Transliteration Comparison: translit vs Unidecode vs anyascii

Comprehensive character-level comparison across all 83 supported languages. Every assigned codepoint in each language's Unicode block(s) is tested — no sampling.

Methodology

For each language: 1. All assigned codepoints in the relevant Unicode block(s) are enumerated 2. Unassigned, private-use, surrogate, and format characters are skipped 3. Each character is transliterated by all three libraries with the language's lang parameter 4. "Mapped" means at least one library produced meaningful ASCII output (not empty, not [?], not the original character)

This approach is deterministic and comprehensive — results do not depend on sample text selection.

Summary

Lang Description Block chars Mapped translit Unidecode anyascii translit-only Unidecode-only Output diffs
bg Bulgarian 304 301 292 234 301 65 7 78
ca Catalan 400 400 400 398 400 2 0 24
cs Czech 400 400 400 398 400 2 0 24
cy Welsh 400 400 400 398 400 2 0 24
da Danish 400 400 400 398 400 2 0 29
de German 400 400 400 398 400 2 0 30
el Greek 135 135 135 106 135 29 0 34
es Spanish 400 400 400 398 400 2 0 24
et Estonian 400 400 400 398 400 2 0 30
fi Finnish 400 400 400 398 400 2 0 24
fr French 400 400 400 398 400 2 0 24
ga Irish 400 400 400 398 400 2 0 24
hr Croatian 400 400 400 398 400 2 0 24
hu Hungarian 400 400 400 398 400 2 0 24
is Icelandic 400 400 400 398 400 2 0 25
it Italian 400 400 400 398 400 2 0 24
lt Lithuanian 400 400 400 398 400 2 0 24
lv Latvian 400 400 400 398 400 2 0 24
mt Maltese 400 400 400 398 400 2 0 24
nl Dutch 400 400 400 398 400 2 0 24
no Norwegian 400 400 400 398 400 2 0 29
pl Polish 400 400 400 398 400 2 0 24
pt Portuguese 400 400 400 398 400 2 0 24
ro Romanian 400 400 400 398 400 2 0 24
sk Slovak 400 400 400 398 400 2 0 24
sl Slovenian 400 400 400 398 400 2 0 24
sq Albanian 400 400 400 398 400 2 0 24
sr Serbian 304 301 290 234 301 65 9 78
sv Swedish 400 400 400 398 400 2 0 28
tr Turkish 400 400 400 398 400 2 0 24
uk Ukrainian 304 301 292 234 301 65 7 80
vi Vietnamese 656 656 647 645 656 2 0 25
ja Japanese 248 248 237 240 246 4 7 12
ja-kunrei Japanese Kunrei 189 189 181 181 188 4 4 9
ko Korean 11172 11172 11172 11172 11172 0 0 3762
zh Chinese 20992 20954 20924 20642 20954 291 9 20633
ar Arabic 248 221 207 173 208 38 4 92
fa Persian 391 331 207 173 318 38 4 97
he Hebrew 88 53 46 49 53 1 4 15
hi Hindi 128 127 117 103 123 19 5 68
bn Bengali 96 95 90 87 95 5 2 59
ta Tamil 72 71 63 61 71 3 1 36
te Telugu 100 99 92 79 99 15 2 53
gu Gujarati 91 87 83 77 87 7 1 50
kn Kannada 91 90 85 79 90 8 2 53
ml Malayalam 118 115 111 77 115 35 1 52
mr Marathi 128 127 117 103 123 19 5 68
ne Nepali 128 127 117 103 123 19 5 68
or Odia 91 90 86 77 89 12 3 49
pa Punjabi 80 78 74 72 76 5 3 49
sa Sanskrit 128 127 117 103 123 19 5 68
as Assamese 96 95 90 87 95 5 2 59
hy Armenian 91 90 86 85 90 3 2 21
ka Georgian 88 88 87 78 88 9 0 27
si Sinhala 91 90 90 79 90 11 0 55
th Thai 87 80 78 80 78 0 2 16
lo Lao 83 76 75 58 75 18 1 12
km Khmer 114 106 100 94 104 10 4 62
my Myanmar 160 141 136 77 139 64 5 54
bo Tibetan 211 201 155 147 195 22 14 115
am Amharic 384 370 370 343 370 27 0 218
ru Russian 304 301 294 234 301 65 5 76
dv Dhivehi 50 49 48 48 48 0 0 3
jv Javanese 91 90 75 0 90 75 0 0
mn Mongolian 157 153 149 148 151 5 4 53
su Sundanese 64 63 48 0 62 48 0 0
nod Tai Tham 127 119 103 0 118 103 0 0
cjm Cham 83 83 78 0 83 78 0 0
btk Batak 56 54 50 0 52 50 0 0
bug Buginese 30 30 30 0 29 30 0 0
tl Tagalog 23 21 21 0 21 21 0 0
hnn Hanunoo 23 22 22 0 22 22 0 0
bku Buhid 20 20 20 0 20 20 0 0
tbw Tagbanwa 18 18 18 0 18 18 0 0
mni Meetei Mayek 79 76 73 0 75 73 0 0
ber Tifinagh 59 58 58 0 58 58 0 0
lis Lisu 48 48 48 0 47 48 0 0
sat Ol Chiki 48 45 43 0 45 43 0 0
bax Bamum 88 87 83 0 87 83 0 0
bal Balinese 124 114 93 0 114 93 0 0
nko N'Ko 62 54 50 0 53 50 0 0
vai Vai 300 299 286 0 299 286 0 0
cop Coptic 123 121 102 0 121 102 0 0
TOTAL 50464 50157 49641 47408 50085 2362 129 27034

Notable Differences

Latin-script languages (27 languages)

Languages: ca (Catalan), cs (Czech), cy (Welsh), da (Danish), de (German), es (Spanish), et (Estonian), fi (Finnish), fr (French), ga (Irish), hr (Croatian), hu (Hungarian), is (Icelandic), it (Italian), lt (Lithuanian), lv (Latvian), mt (Maltese), nl (Dutch), no (Norwegian), pl (Polish), pt (Portuguese), ro (Romanian), sk (Slovak), sl (Slovenian), sq (Albanian), sv (Swedish), tr (Turkish)

All 27 languages share the same Unicode blocks (Latin-1 Supplement + Latin Extended-A + Latin Extended-B) with 400 assigned codepoints, 400 mapped by at least one library.

Coverage: translit maps 400/400, Unidecode maps 398/400. 2 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
Ɂ U+0241 LATIN CAPITAL LETTER GLOTTAL STOP '
ɂ U+0242 LATIN SMALL LETTER GLOTTAL STOP '

Shared differences (same output across all 27 languages):

Char Codepoint Name translit Unidecode anyascii
ʼn U+0149 LATIN SMALL LETTER N PRECEDED BY APOSTROPHE n 'n 'n
Ŋ U+014A LATIN CAPITAL LETTER ENG N NG Ng
ŋ U+014B LATIN SMALL LETTER ENG n ng ng
Ƅ U+0184 LATIN CAPITAL LETTER TONE SIX B 6 6
ƅ U+0185 LATIN SMALL LETTER TONE SIX b 6 6
Ǝ U+018E LATIN CAPITAL LETTER REVERSED E D 3 E
Ə U+018F LATIN CAPITAL LETTER SCHWA A @ E
Ɯ U+019C LATIN CAPITAL LETTER TURNED M M W W
Ʀ U+01A6 LATIN LETTER YR R YR R
Ƨ U+01A7 LATIN CAPITAL LETTER TONE TWO S 2 2
ƨ U+01A8 LATIN SMALL LETTER TONE TWO s 2 2
Ʃ U+01A9 LATIN CAPITAL LETTER ESH Sh SH Sh
ƪ U+01AA LATIN LETTER REVERSED ESH LOOP s sh sh
Ʊ U+01B1 LATIN CAPITAL LETTER UPSILON U Y U
Ʒ U+01B7 LATIN CAPITAL LETTER EZH Zh ZH Zh
Ƹ U+01B8 LATIN CAPITAL LETTER EZH REVERSED Zh ZH ```
ǂ U+01C2 LATIN LETTER ALVEOLAR CLICK ! |= qc
ǝ U+01DD LATIN SMALL LETTER TURNED E e @ e
Ǯ U+01EE LATIN CAPITAL LETTER EZH WITH CARON Zh ZH Zh
Ƕ U+01F6 LATIN CAPITAL LETTER HWAIR Hv HV Hw
Ȝ U+021C LATIN CAPITAL LETTER YOGH Yh Y Y
ȝ U+021D LATIN SMALL LETTER YOGH yh y y
Ʌ U+0245 LATIN CAPITAL LETTER TURNED V V ^ A
Ɋ U+024A LATIN CAPITAL LETTER SMALL Q WITH HOOK TAIL Q q Q

Language-specific differences (due to language override tables):

da — Danish

Char Codepoint Name translit Unidecode anyascii
Å U+00C5 LATIN CAPITAL LETTER A WITH RING ABOVE Aa A A
Æ U+00C6 LATIN CAPITAL LETTER AE Ae AE Ae
Ø U+00D8 LATIN CAPITAL LETTER O WITH STROKE Oe O O
å U+00E5 LATIN SMALL LETTER A WITH RING ABOVE aa a a
ø U+00F8 LATIN SMALL LETTER O WITH STROKE oe o o

de — German

Char Codepoint Name translit Unidecode anyascii
Ä U+00C4 LATIN CAPITAL LETTER A WITH DIAERESIS Ae A A
Ö U+00D6 LATIN CAPITAL LETTER O WITH DIAERESIS Oe O O
Ü U+00DC LATIN CAPITAL LETTER U WITH DIAERESIS Ue U U
ä U+00E4 LATIN SMALL LETTER A WITH DIAERESIS ae a a
ö U+00F6 LATIN SMALL LETTER O WITH DIAERESIS oe o o
ü U+00FC LATIN SMALL LETTER U WITH DIAERESIS ue u u

et — Estonian

Char Codepoint Name translit Unidecode anyascii
Ä U+00C4 LATIN CAPITAL LETTER A WITH DIAERESIS Ae A A
Ö U+00D6 LATIN CAPITAL LETTER O WITH DIAERESIS Oe O O
Ü U+00DC LATIN CAPITAL LETTER U WITH DIAERESIS Ue U U
ä U+00E4 LATIN SMALL LETTER A WITH DIAERESIS ae a a
ö U+00F6 LATIN SMALL LETTER O WITH DIAERESIS oe o o
ü U+00FC LATIN SMALL LETTER U WITH DIAERESIS ue u u

is — Icelandic

Char Codepoint Name translit Unidecode anyascii
Æ U+00C6 LATIN CAPITAL LETTER AE Ae AE Ae

no — Norwegian

Char Codepoint Name translit Unidecode anyascii
Å U+00C5 LATIN CAPITAL LETTER A WITH RING ABOVE Aa A A
Æ U+00C6 LATIN CAPITAL LETTER AE Ae AE Ae
Ø U+00D8 LATIN CAPITAL LETTER O WITH STROKE Oe O O
å U+00E5 LATIN SMALL LETTER A WITH RING ABOVE aa a a
ø U+00F8 LATIN SMALL LETTER O WITH STROKE oe o o

sv — Swedish

Char Codepoint Name translit Unidecode anyascii
Ä U+00C4 LATIN CAPITAL LETTER A WITH DIAERESIS Ae A A
Ö U+00D6 LATIN CAPITAL LETTER O WITH DIAERESIS Oe O O
ä U+00E4 LATIN SMALL LETTER A WITH DIAERESIS ae a a
ö U+00F6 LATIN SMALL LETTER O WITH DIAERESIS oe o o

bg — Bulgarian

Block: 304 assigned codepoints, 301 mapped by at least one library.

Coverage: translit maps 292/301, Unidecode maps 234/301. 65 mapped only by translit, 7 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
Ҋ U+048A CYRILLIC CAPITAL LETTER SHORT I WITH TAIL Y
ҋ U+048B CYRILLIC SMALL LETTER SHORT I WITH TAIL y
Ӆ U+04C5 CYRILLIC CAPITAL LETTER EL WITH TAIL L
ӆ U+04C6 CYRILLIC SMALL LETTER EL WITH TAIL l
Ӊ U+04C9 CYRILLIC CAPITAL LETTER EN WITH TAIL N
ӊ U+04CA CYRILLIC SMALL LETTER EN WITH TAIL n
Ӎ U+04CD CYRILLIC CAPITAL LETTER EM WITH TAIL M
ӎ U+04CE CYRILLIC SMALL LETTER EM WITH TAIL m
ӏ U+04CF CYRILLIC SMALL LETTER PALOCHKA i
Ӷ U+04F6 CYRILLIC CAPITAL LETTER GHE WITH DESCENDER G
ӷ U+04F7 CYRILLIC SMALL LETTER GHE WITH DESCENDER g
Ӻ U+04FA CYRILLIC CAPITAL LETTER GHE WITH STROKE AND HOOK G
ӻ U+04FB CYRILLIC SMALL LETTER GHE WITH STROKE AND HOOK g
Ӽ U+04FC CYRILLIC CAPITAL LETTER HA WITH HOOK Kh
ӽ U+04FD CYRILLIC SMALL LETTER HA WITH HOOK kh
Ӿ U+04FE CYRILLIC CAPITAL LETTER HA WITH STROKE Kh
ӿ U+04FF CYRILLIC SMALL LETTER HA WITH STROKE kh
Ԁ U+0500 CYRILLIC CAPITAL LETTER KOMI DE D
ԁ U+0501 CYRILLIC SMALL LETTER KOMI DE d
Ԃ U+0502 CYRILLIC CAPITAL LETTER KOMI DJE Dj
ԃ U+0503 CYRILLIC SMALL LETTER KOMI DJE dj
Ԅ U+0504 CYRILLIC CAPITAL LETTER KOMI ZJE Z
ԅ U+0505 CYRILLIC SMALL LETTER KOMI ZJE z
Ԇ U+0506 CYRILLIC CAPITAL LETTER KOMI DZJE Dz
ԇ U+0507 CYRILLIC SMALL LETTER KOMI DZJE dz
Ԉ U+0508 CYRILLIC CAPITAL LETTER KOMI LJE Lj
ԉ U+0509 CYRILLIC SMALL LETTER KOMI LJE lj
Ԋ U+050A CYRILLIC CAPITAL LETTER KOMI NJE Nj
ԋ U+050B CYRILLIC SMALL LETTER KOMI NJE nj
Ԍ U+050C CYRILLIC CAPITAL LETTER KOMI SJE Sj
...35 more

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
Ь U+042C CYRILLIC CAPITAL LETTER SOFT SIGN '
ь U+044C CYRILLIC SMALL LETTER SOFT SIGN '
҂ U+0482 CYRILLIC THOUSANDS SIGN *1000*
҈ U+0488 COMBINING CYRILLIC HUNDRED THOUSANDS SIGN *100.000*
҉ U+0489 COMBINING CYRILLIC MILLIONS SIGN *1.000.000*
Ҍ U+048C CYRILLIC CAPITAL LETTER SEMISOFT SIGN "
ҍ U+048D CYRILLIC SMALL LETTER SEMISOFT SIGN "
Char Codepoint Name translit Unidecode anyascii
Ѐ U+0400 CYRILLIC CAPITAL LETTER IE WITH GRAVE E Ie E
Ё U+0401 CYRILLIC CAPITAL LETTER IO Yo Io E
Ѓ U+0403 CYRILLIC CAPITAL LETTER GJE G Gj G
Є U+0404 CYRILLIC CAPITAL LETTER UKRAINIAN IE Ye Ie Ie
Ќ U+040C CYRILLIC CAPITAL LETTER KJE K Kj K
Й U+0419 CYRILLIC CAPITAL LETTER SHORT I Y I Y
Щ U+0429 CYRILLIC CAPITAL LETTER SHCHA Sht Shch Shch
Ъ U+042A CYRILLIC CAPITAL LETTER HARD SIGN A ' '
Ю U+042E CYRILLIC CAPITAL LETTER YU Yu Iu Yu
Я U+042F CYRILLIC CAPITAL LETTER YA Ya Ia Ya
й U+0439 CYRILLIC SMALL LETTER SHORT I y i y
щ U+0449 CYRILLIC SMALL LETTER SHCHA sht shch shch
ъ U+044A CYRILLIC SMALL LETTER HARD SIGN a ' '
ю U+044E CYRILLIC SMALL LETTER YU yu iu yu
я U+044F CYRILLIC SMALL LETTER YA ya ia ya
ѐ U+0450 CYRILLIC SMALL LETTER IE WITH GRAVE e ie e
ё U+0451 CYRILLIC SMALL LETTER IO yo io e
ѓ U+0453 CYRILLIC SMALL LETTER GJE g gj g
є U+0454 CYRILLIC SMALL LETTER UKRAINIAN IE ye ie ie
ќ U+045C CYRILLIC SMALL LETTER KJE k kj k
Ѣ U+0462 CYRILLIC CAPITAL LETTER YAT Ye E E
ѣ U+0463 CYRILLIC SMALL LETTER YAT ye e e
Ѹ U+0478 CYRILLIC CAPITAL LETTER UK U u U
Ҁ U+0480 CYRILLIC CAPITAL LETTER KOPPA K Q Q
ҁ U+0481 CYRILLIC SMALL LETTER KOPPA k q q
Ҏ U+048E CYRILLIC CAPITAL LETTER ER WITH TICK R R' Rh
ҏ U+048F CYRILLIC SMALL LETTER ER WITH TICK r r' rh
Ґ U+0490 CYRILLIC CAPITAL LETTER GHE WITH UPTURN G G' G
ґ U+0491 CYRILLIC SMALL LETTER GHE WITH UPTURN g g' g
Ғ U+0492 CYRILLIC CAPITAL LETTER GHE WITH STROKE G G' Gh
ғ U+0493 CYRILLIC SMALL LETTER GHE WITH STROKE g g' gh
Ҕ U+0494 CYRILLIC CAPITAL LETTER GHE WITH MIDDLE HOOK G G' Gh
ҕ U+0495 CYRILLIC SMALL LETTER GHE WITH MIDDLE HOOK g g' gh
Җ U+0496 CYRILLIC CAPITAL LETTER ZHE WITH DESCENDER Zh Zh' J
җ U+0497 CYRILLIC SMALL LETTER ZHE WITH DESCENDER zh zh' j
Ҙ U+0498 CYRILLIC CAPITAL LETTER ZE WITH DESCENDER Z Z' Z
ҙ U+0499 CYRILLIC SMALL LETTER ZE WITH DESCENDER z z' z
Қ U+049A CYRILLIC CAPITAL LETTER KA WITH DESCENDER K K' Q
қ U+049B CYRILLIC SMALL LETTER KA WITH DESCENDER k k' q
Ҝ U+049C CYRILLIC CAPITAL LETTER KA WITH VERTICAL STROKE K K' G
ҝ U+049D CYRILLIC SMALL LETTER KA WITH VERTICAL STROKE k k' g
Ҟ U+049E CYRILLIC CAPITAL LETTER KA WITH STROKE K K' Q
ҟ U+049F CYRILLIC SMALL LETTER KA WITH STROKE k k' q
Ҡ U+04A0 CYRILLIC CAPITAL LETTER BASHKIR KA K K' Q
ҡ U+04A1 CYRILLIC SMALL LETTER BASHKIR KA k k' q
Ң U+04A2 CYRILLIC CAPITAL LETTER EN WITH DESCENDER N N' Ng
ң U+04A3 CYRILLIC SMALL LETTER EN WITH DESCENDER n n' ng
Ҧ U+04A6 CYRILLIC CAPITAL LETTER PE WITH MIDDLE HOOK P P' Ph
ҧ U+04A7 CYRILLIC SMALL LETTER PE WITH MIDDLE HOOK p p' ph
Ҫ U+04AA CYRILLIC CAPITAL LETTER ES WITH DESCENDER S S' S
...28 more differences

el — Greek

Block: 135 assigned codepoints, 135 mapped by at least one library.

Coverage: translit maps 135/135, Unidecode maps 106/135. 29 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
Ͱ U+0370 GREEK CAPITAL LETTER HETA H
ͱ U+0371 GREEK SMALL LETTER HETA h
Ͳ U+0372 GREEK CAPITAL LETTER ARCHAIC SAMPI Ss
ͳ U+0373 GREEK SMALL LETTER ARCHAIC SAMPI ss
Ͷ U+0376 GREEK CAPITAL LETTER PAMPHYLIAN DIGAMMA W
ͷ U+0377 GREEK SMALL LETTER PAMPHYLIAN DIGAMMA w
ͺ U+037A GREEK YPOGEGRAMMENI i
ͻ U+037B GREEK SMALL REVERSED LUNATE SIGMA SYMBOL s
ͼ U+037C GREEK SMALL DOTTED LUNATE SIGMA SYMBOL s
ͽ U+037D GREEK SMALL REVERSED DOTTED LUNATE SIGMA SYMBOL s
; U+037E GREEK QUESTION MARK ;
Ϳ U+037F GREEK CAPITAL LETTER YOT J
΄ U+0384 GREEK TONOS '
΅ U+0385 GREEK DIALYTIKA TONOS "
Ϗ U+03CF GREEK CAPITAL KAI SYMBOL K
Ϙ U+03D8 GREEK LETTER ARCHAIC KOPPA Q
ϙ U+03D9 GREEK SMALL LETTER ARCHAIC KOPPA q
ϴ U+03F4 GREEK CAPITAL THETA SYMBOL Th
ϵ U+03F5 GREEK LUNATE EPSILON SYMBOL e
϶ U+03F6 GREEK REVERSED LUNATE EPSILON SYMBOL e
Ϸ U+03F7 GREEK CAPITAL LETTER SHO Sh
ϸ U+03F8 GREEK SMALL LETTER SHO sh
Ϲ U+03F9 GREEK CAPITAL LUNATE SIGMA SYMBOL S
Ϻ U+03FA GREEK CAPITAL LETTER SAN S
ϻ U+03FB GREEK SMALL LETTER SAN s
ϼ U+03FC GREEK RHO WITH STROKE SYMBOL r
Ͻ U+03FD GREEK CAPITAL REVERSED LUNATE SIGMA SYMBOL S
Ͼ U+03FE GREEK CAPITAL DOTTED LUNATE SIGMA SYMBOL S
Ͽ U+03FF GREEK CAPITAL REVERSED DOTTED LUNATE SIGMA SYMBOL S
Char Codepoint Name translit Unidecode anyascii
· U+0387 GREEK ANO TELEIA . ; ;
Ή U+0389 GREEK CAPITAL LETTER ETA WITH TONOS I E I
Ύ U+038E GREEK CAPITAL LETTER UPSILON WITH TONOS Y U Y
ΐ U+0390 GREEK SMALL LETTER IOTA WITH DIALYTIKA AND TONOS i I i
Η U+0397 GREEK CAPITAL LETTER ETA I E I
Ξ U+039E GREEK CAPITAL LETTER XI X Ks X
Υ U+03A5 GREEK CAPITAL LETTER UPSILON Y U Y
Φ U+03A6 GREEK CAPITAL LETTER PHI F Ph F
Χ U+03A7 GREEK CAPITAL LETTER CHI Ch Kh Ch
Ϋ U+03AB GREEK CAPITAL LETTER UPSILON WITH DIALYTIKA Y U Y
ή U+03AE GREEK SMALL LETTER ETA WITH TONOS i e i
ΰ U+03B0 GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND TONOS y u y
η U+03B7 GREEK SMALL LETTER ETA i e i
υ U+03C5 GREEK SMALL LETTER UPSILON y u y
φ U+03C6 GREEK SMALL LETTER PHI f ph f
χ U+03C7 GREEK SMALL LETTER CHI ch kh ch
ϋ U+03CB GREEK SMALL LETTER UPSILON WITH DIALYTIKA y u y
ύ U+03CD GREEK SMALL LETTER UPSILON WITH TONOS y u y
ϒ U+03D2 GREEK UPSILON WITH HOOK SYMBOL Y U Y
ϓ U+03D3 GREEK UPSILON WITH ACUTE AND HOOK SYMBOL Y U Y
ϔ U+03D4 GREEK UPSILON WITH DIAERESIS AND HOOK SYMBOL Y U Y
ϗ U+03D7 GREEK KAI SYMBOL k & &
Ϡ U+03E0 GREEK LETTER SAMPI Ss Sp S
ϡ U+03E1 GREEK SMALL LETTER SAMPI ss sp s
Ϣ U+03E2 COPTIC CAPITAL LETTER SHEI sh Sh Sh
Ϥ U+03E4 COPTIC CAPITAL LETTER FEI f F F
Ϧ U+03E6 COPTIC CAPITAL LETTER KHEI kh Kh X
Ϩ U+03E8 COPTIC CAPITAL LETTER HORI h H H
Ϫ U+03EA COPTIC CAPITAL LETTER GANGIA j G J
ϫ U+03EB COPTIC SMALL LETTER GANGIA j g j
Ϭ U+03EC COPTIC CAPITAL LETTER SHIMA c CH C
ϭ U+03ED COPTIC SMALL LETTER SHIMA c ch c
Ϯ U+03EE COPTIC CAPITAL LETTER DEI ti Ti Ti
ϲ U+03F2 GREEK LUNATE SIGMA SYMBOL s c s

sr — Serbian

Block: 304 assigned codepoints, 301 mapped by at least one library.

Coverage: translit maps 290/301, Unidecode maps 234/301. 65 mapped only by translit, 9 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
Ҋ U+048A CYRILLIC CAPITAL LETTER SHORT I WITH TAIL Y
ҋ U+048B CYRILLIC SMALL LETTER SHORT I WITH TAIL y
Ӆ U+04C5 CYRILLIC CAPITAL LETTER EL WITH TAIL L
ӆ U+04C6 CYRILLIC SMALL LETTER EL WITH TAIL l
Ӊ U+04C9 CYRILLIC CAPITAL LETTER EN WITH TAIL N
ӊ U+04CA CYRILLIC SMALL LETTER EN WITH TAIL n
Ӎ U+04CD CYRILLIC CAPITAL LETTER EM WITH TAIL M
ӎ U+04CE CYRILLIC SMALL LETTER EM WITH TAIL m
ӏ U+04CF CYRILLIC SMALL LETTER PALOCHKA i
Ӷ U+04F6 CYRILLIC CAPITAL LETTER GHE WITH DESCENDER G
ӷ U+04F7 CYRILLIC SMALL LETTER GHE WITH DESCENDER g
Ӻ U+04FA CYRILLIC CAPITAL LETTER GHE WITH STROKE AND HOOK G
ӻ U+04FB CYRILLIC SMALL LETTER GHE WITH STROKE AND HOOK g
Ӽ U+04FC CYRILLIC CAPITAL LETTER HA WITH HOOK Kh
ӽ U+04FD CYRILLIC SMALL LETTER HA WITH HOOK kh
Ӿ U+04FE CYRILLIC CAPITAL LETTER HA WITH STROKE Kh
ӿ U+04FF CYRILLIC SMALL LETTER HA WITH STROKE kh
Ԁ U+0500 CYRILLIC CAPITAL LETTER KOMI DE D
ԁ U+0501 CYRILLIC SMALL LETTER KOMI DE d
Ԃ U+0502 CYRILLIC CAPITAL LETTER KOMI DJE Dj
ԃ U+0503 CYRILLIC SMALL LETTER KOMI DJE dj
Ԅ U+0504 CYRILLIC CAPITAL LETTER KOMI ZJE Z
ԅ U+0505 CYRILLIC SMALL LETTER KOMI ZJE z
Ԇ U+0506 CYRILLIC CAPITAL LETTER KOMI DZJE Dz
ԇ U+0507 CYRILLIC SMALL LETTER KOMI DZJE dz
Ԉ U+0508 CYRILLIC CAPITAL LETTER KOMI LJE Lj
ԉ U+0509 CYRILLIC SMALL LETTER KOMI LJE lj
Ԋ U+050A CYRILLIC CAPITAL LETTER KOMI NJE Nj
ԋ U+050B CYRILLIC SMALL LETTER KOMI NJE nj
Ԍ U+050C CYRILLIC CAPITAL LETTER KOMI SJE Sj
...35 more

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
Ъ U+042A CYRILLIC CAPITAL LETTER HARD SIGN '
Ь U+042C CYRILLIC CAPITAL LETTER SOFT SIGN '
ъ U+044A CYRILLIC SMALL LETTER HARD SIGN '
ь U+044C CYRILLIC SMALL LETTER SOFT SIGN '
҂ U+0482 CYRILLIC THOUSANDS SIGN *1000*
҈ U+0488 COMBINING CYRILLIC HUNDRED THOUSANDS SIGN *100.000*
҉ U+0489 COMBINING CYRILLIC MILLIONS SIGN *1.000.000*
Ҍ U+048C CYRILLIC CAPITAL LETTER SEMISOFT SIGN "
ҍ U+048D CYRILLIC SMALL LETTER SEMISOFT SIGN "
Char Codepoint Name translit Unidecode anyascii
Ѐ U+0400 CYRILLIC CAPITAL LETTER IE WITH GRAVE E Ie E
Ё U+0401 CYRILLIC CAPITAL LETTER IO Yo Io E
Ѓ U+0403 CYRILLIC CAPITAL LETTER GJE G Gj G
Є U+0404 CYRILLIC CAPITAL LETTER UKRAINIAN IE Ye Ie Ie
Ћ U+040B CYRILLIC CAPITAL LETTER TSHE C Tsh C
Ќ U+040C CYRILLIC CAPITAL LETTER KJE K Kj K
Џ U+040F CYRILLIC CAPITAL LETTER DZHE Dz Dzh Dzh
Й U+0419 CYRILLIC CAPITAL LETTER SHORT I Y I Y
Ю U+042E CYRILLIC CAPITAL LETTER YU Yu Iu Yu
Я U+042F CYRILLIC CAPITAL LETTER YA Ya Ia Ya
й U+0439 CYRILLIC SMALL LETTER SHORT I y i y
ю U+044E CYRILLIC SMALL LETTER YU yu iu yu
я U+044F CYRILLIC SMALL LETTER YA ya ia ya
ѐ U+0450 CYRILLIC SMALL LETTER IE WITH GRAVE e ie e
ё U+0451 CYRILLIC SMALL LETTER IO yo io e
ѓ U+0453 CYRILLIC SMALL LETTER GJE g gj g
є U+0454 CYRILLIC SMALL LETTER UKRAINIAN IE ye ie ie
ћ U+045B CYRILLIC SMALL LETTER TSHE c tsh c
ќ U+045C CYRILLIC SMALL LETTER KJE k kj k
џ U+045F CYRILLIC SMALL LETTER DZHE dz dzh dzh
Ѣ U+0462 CYRILLIC CAPITAL LETTER YAT Ye E E
ѣ U+0463 CYRILLIC SMALL LETTER YAT ye e e
Ѹ U+0478 CYRILLIC CAPITAL LETTER UK U u U
Ҁ U+0480 CYRILLIC CAPITAL LETTER KOPPA K Q Q
ҁ U+0481 CYRILLIC SMALL LETTER KOPPA k q q
Ҏ U+048E CYRILLIC CAPITAL LETTER ER WITH TICK R R' Rh
ҏ U+048F CYRILLIC SMALL LETTER ER WITH TICK r r' rh
Ґ U+0490 CYRILLIC CAPITAL LETTER GHE WITH UPTURN G G' G
ґ U+0491 CYRILLIC SMALL LETTER GHE WITH UPTURN g g' g
Ғ U+0492 CYRILLIC CAPITAL LETTER GHE WITH STROKE G G' Gh
ғ U+0493 CYRILLIC SMALL LETTER GHE WITH STROKE g g' gh
Ҕ U+0494 CYRILLIC CAPITAL LETTER GHE WITH MIDDLE HOOK G G' Gh
ҕ U+0495 CYRILLIC SMALL LETTER GHE WITH MIDDLE HOOK g g' gh
Җ U+0496 CYRILLIC CAPITAL LETTER ZHE WITH DESCENDER Zh Zh' J
җ U+0497 CYRILLIC SMALL LETTER ZHE WITH DESCENDER zh zh' j
Ҙ U+0498 CYRILLIC CAPITAL LETTER ZE WITH DESCENDER Z Z' Z
ҙ U+0499 CYRILLIC SMALL LETTER ZE WITH DESCENDER z z' z
Қ U+049A CYRILLIC CAPITAL LETTER KA WITH DESCENDER K K' Q
қ U+049B CYRILLIC SMALL LETTER KA WITH DESCENDER k k' q
Ҝ U+049C CYRILLIC CAPITAL LETTER KA WITH VERTICAL STROKE K K' G
ҝ U+049D CYRILLIC SMALL LETTER KA WITH VERTICAL STROKE k k' g
Ҟ U+049E CYRILLIC CAPITAL LETTER KA WITH STROKE K K' Q
ҟ U+049F CYRILLIC SMALL LETTER KA WITH STROKE k k' q
Ҡ U+04A0 CYRILLIC CAPITAL LETTER BASHKIR KA K K' Q
ҡ U+04A1 CYRILLIC SMALL LETTER BASHKIR KA k k' q
Ң U+04A2 CYRILLIC CAPITAL LETTER EN WITH DESCENDER N N' Ng
ң U+04A3 CYRILLIC SMALL LETTER EN WITH DESCENDER n n' ng
Ҧ U+04A6 CYRILLIC CAPITAL LETTER PE WITH MIDDLE HOOK P P' Ph
ҧ U+04A7 CYRILLIC SMALL LETTER PE WITH MIDDLE HOOK p p' ph
Ҫ U+04AA CYRILLIC CAPITAL LETTER ES WITH DESCENDER S S' S
...28 more differences

uk — Ukrainian

Block: 304 assigned codepoints, 301 mapped by at least one library.

Coverage: translit maps 292/301, Unidecode maps 234/301. 65 mapped only by translit, 7 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
Ҋ U+048A CYRILLIC CAPITAL LETTER SHORT I WITH TAIL Y
ҋ U+048B CYRILLIC SMALL LETTER SHORT I WITH TAIL y
Ӆ U+04C5 CYRILLIC CAPITAL LETTER EL WITH TAIL L
ӆ U+04C6 CYRILLIC SMALL LETTER EL WITH TAIL l
Ӊ U+04C9 CYRILLIC CAPITAL LETTER EN WITH TAIL N
ӊ U+04CA CYRILLIC SMALL LETTER EN WITH TAIL n
Ӎ U+04CD CYRILLIC CAPITAL LETTER EM WITH TAIL M
ӎ U+04CE CYRILLIC SMALL LETTER EM WITH TAIL m
ӏ U+04CF CYRILLIC SMALL LETTER PALOCHKA i
Ӷ U+04F6 CYRILLIC CAPITAL LETTER GHE WITH DESCENDER G
ӷ U+04F7 CYRILLIC SMALL LETTER GHE WITH DESCENDER g
Ӻ U+04FA CYRILLIC CAPITAL LETTER GHE WITH STROKE AND HOOK G
ӻ U+04FB CYRILLIC SMALL LETTER GHE WITH STROKE AND HOOK g
Ӽ U+04FC CYRILLIC CAPITAL LETTER HA WITH HOOK Kh
ӽ U+04FD CYRILLIC SMALL LETTER HA WITH HOOK kh
Ӿ U+04FE CYRILLIC CAPITAL LETTER HA WITH STROKE Kh
ӿ U+04FF CYRILLIC SMALL LETTER HA WITH STROKE kh
Ԁ U+0500 CYRILLIC CAPITAL LETTER KOMI DE D
ԁ U+0501 CYRILLIC SMALL LETTER KOMI DE d
Ԃ U+0502 CYRILLIC CAPITAL LETTER KOMI DJE Dj
ԃ U+0503 CYRILLIC SMALL LETTER KOMI DJE dj
Ԅ U+0504 CYRILLIC CAPITAL LETTER KOMI ZJE Z
ԅ U+0505 CYRILLIC SMALL LETTER KOMI ZJE z
Ԇ U+0506 CYRILLIC CAPITAL LETTER KOMI DZJE Dz
ԇ U+0507 CYRILLIC SMALL LETTER KOMI DZJE dz
Ԉ U+0508 CYRILLIC CAPITAL LETTER KOMI LJE Lj
ԉ U+0509 CYRILLIC SMALL LETTER KOMI LJE lj
Ԋ U+050A CYRILLIC CAPITAL LETTER KOMI NJE Nj
ԋ U+050B CYRILLIC SMALL LETTER KOMI NJE nj
Ԍ U+050C CYRILLIC CAPITAL LETTER KOMI SJE Sj
...35 more

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
Ъ U+042A CYRILLIC CAPITAL LETTER HARD SIGN '
ъ U+044A CYRILLIC SMALL LETTER HARD SIGN '
҂ U+0482 CYRILLIC THOUSANDS SIGN *1000*
҈ U+0488 COMBINING CYRILLIC HUNDRED THOUSANDS SIGN *100.000*
҉ U+0489 COMBINING CYRILLIC MILLIONS SIGN *1.000.000*
Ҍ U+048C CYRILLIC CAPITAL LETTER SEMISOFT SIGN "
ҍ U+048D CYRILLIC SMALL LETTER SEMISOFT SIGN "
Char Codepoint Name translit Unidecode anyascii
Ѐ U+0400 CYRILLIC CAPITAL LETTER IE WITH GRAVE E Ie E
Ё U+0401 CYRILLIC CAPITAL LETTER IO Yo Io E
Ѓ U+0403 CYRILLIC CAPITAL LETTER GJE G Gj G
Є U+0404 CYRILLIC CAPITAL LETTER UKRAINIAN IE Ye Ie Ie
Ї U+0407 CYRILLIC CAPITAL LETTER YI I Yi I
Ќ U+040C CYRILLIC CAPITAL LETTER KJE K Kj K
Г U+0413 CYRILLIC CAPITAL LETTER GHE H G G
И U+0418 CYRILLIC CAPITAL LETTER I Y I I
Й U+0419 CYRILLIC CAPITAL LETTER SHORT I Y I Y
Ю U+042E CYRILLIC CAPITAL LETTER YU Yu Iu Yu
Я U+042F CYRILLIC CAPITAL LETTER YA Ya Ia Ya
г U+0433 CYRILLIC SMALL LETTER GHE h g g
и U+0438 CYRILLIC SMALL LETTER I y i i
й U+0439 CYRILLIC SMALL LETTER SHORT I y i y
ю U+044E CYRILLIC SMALL LETTER YU yu iu yu
я U+044F CYRILLIC SMALL LETTER YA ya ia ya
ѐ U+0450 CYRILLIC SMALL LETTER IE WITH GRAVE e ie e
ё U+0451 CYRILLIC SMALL LETTER IO yo io e
ѓ U+0453 CYRILLIC SMALL LETTER GJE g gj g
є U+0454 CYRILLIC SMALL LETTER UKRAINIAN IE ye ie ie
ї U+0457 CYRILLIC SMALL LETTER YI i yi i
ќ U+045C CYRILLIC SMALL LETTER KJE k kj k
Ѣ U+0462 CYRILLIC CAPITAL LETTER YAT Ye E E
ѣ U+0463 CYRILLIC SMALL LETTER YAT ye e e
Ѹ U+0478 CYRILLIC CAPITAL LETTER UK U u U
Ҁ U+0480 CYRILLIC CAPITAL LETTER KOPPA K Q Q
ҁ U+0481 CYRILLIC SMALL LETTER KOPPA k q q
Ҏ U+048E CYRILLIC CAPITAL LETTER ER WITH TICK R R' Rh
ҏ U+048F CYRILLIC SMALL LETTER ER WITH TICK r r' rh
Ґ U+0490 CYRILLIC CAPITAL LETTER GHE WITH UPTURN G G' G
ґ U+0491 CYRILLIC SMALL LETTER GHE WITH UPTURN g g' g
Ғ U+0492 CYRILLIC CAPITAL LETTER GHE WITH STROKE G G' Gh
ғ U+0493 CYRILLIC SMALL LETTER GHE WITH STROKE g g' gh
Ҕ U+0494 CYRILLIC CAPITAL LETTER GHE WITH MIDDLE HOOK G G' Gh
ҕ U+0495 CYRILLIC SMALL LETTER GHE WITH MIDDLE HOOK g g' gh
Җ U+0496 CYRILLIC CAPITAL LETTER ZHE WITH DESCENDER Zh Zh' J
җ U+0497 CYRILLIC SMALL LETTER ZHE WITH DESCENDER zh zh' j
Ҙ U+0498 CYRILLIC CAPITAL LETTER ZE WITH DESCENDER Z Z' Z
ҙ U+0499 CYRILLIC SMALL LETTER ZE WITH DESCENDER z z' z
Қ U+049A CYRILLIC CAPITAL LETTER KA WITH DESCENDER K K' Q
қ U+049B CYRILLIC SMALL LETTER KA WITH DESCENDER k k' q
Ҝ U+049C CYRILLIC CAPITAL LETTER KA WITH VERTICAL STROKE K K' G
ҝ U+049D CYRILLIC SMALL LETTER KA WITH VERTICAL STROKE k k' g
Ҟ U+049E CYRILLIC CAPITAL LETTER KA WITH STROKE K K' Q
ҟ U+049F CYRILLIC SMALL LETTER KA WITH STROKE k k' q
Ҡ U+04A0 CYRILLIC CAPITAL LETTER BASHKIR KA K K' Q
ҡ U+04A1 CYRILLIC SMALL LETTER BASHKIR KA k k' q
Ң U+04A2 CYRILLIC CAPITAL LETTER EN WITH DESCENDER N N' Ng
ң U+04A3 CYRILLIC SMALL LETTER EN WITH DESCENDER n n' ng
Ҧ U+04A6 CYRILLIC CAPITAL LETTER PE WITH MIDDLE HOOK P P' Ph
...30 more differences

vi — Vietnamese

Block: 656 assigned codepoints, 656 mapped by at least one library.

Coverage: translit maps 647/656, Unidecode maps 645/656. 2 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
Ɂ U+0241 LATIN CAPITAL LETTER GLOTTAL STOP '
ɂ U+0242 LATIN SMALL LETTER GLOTTAL STOP '
Char Codepoint Name translit Unidecode anyascii
ʼn U+0149 LATIN SMALL LETTER N PRECEDED BY APOSTROPHE n 'n 'n
Ŋ U+014A LATIN CAPITAL LETTER ENG N NG Ng
ŋ U+014B LATIN SMALL LETTER ENG n ng ng
Ƅ U+0184 LATIN CAPITAL LETTER TONE SIX B 6 6
ƅ U+0185 LATIN SMALL LETTER TONE SIX b 6 6
Ǝ U+018E LATIN CAPITAL LETTER REVERSED E D 3 E
Ə U+018F LATIN CAPITAL LETTER SCHWA A @ E
Ɯ U+019C LATIN CAPITAL LETTER TURNED M M W W
Ʀ U+01A6 LATIN LETTER YR R YR R
Ƨ U+01A7 LATIN CAPITAL LETTER TONE TWO S 2 2
ƨ U+01A8 LATIN SMALL LETTER TONE TWO s 2 2
Ʃ U+01A9 LATIN CAPITAL LETTER ESH Sh SH Sh
ƪ U+01AA LATIN LETTER REVERSED ESH LOOP s sh sh
Ʊ U+01B1 LATIN CAPITAL LETTER UPSILON U Y U
Ʒ U+01B7 LATIN CAPITAL LETTER EZH Zh ZH Zh
Ƹ U+01B8 LATIN CAPITAL LETTER EZH REVERSED Zh ZH ```
ǂ U+01C2 LATIN LETTER ALVEOLAR CLICK ! |= qc
ǝ U+01DD LATIN SMALL LETTER TURNED E e @ e
Ǯ U+01EE LATIN CAPITAL LETTER EZH WITH CARON Zh ZH Zh
Ƕ U+01F6 LATIN CAPITAL LETTER HWAIR Hv HV Hw
Ȝ U+021C LATIN CAPITAL LETTER YOGH Yh Y Y
ȝ U+021D LATIN SMALL LETTER YOGH yh y y
Ʌ U+0245 LATIN CAPITAL LETTER TURNED V V ^ A
Ɋ U+024A LATIN CAPITAL LETTER SMALL Q WITH HOOK TAIL Q q Q
U+1E9B LATIN SMALL LETTER LONG S WITH DOT ABOVE s S s

ja — Japanese

Block: 248 assigned codepoints, 248 mapped by at least one library.

Coverage: translit maps 237/248, Unidecode maps 240/248. 4 mapped only by translit, 7 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+3095 HIRAGANA LETTER SMALL KA ka
U+3096 HIRAGANA LETTER SMALL KE ke
U+309F HIRAGANA DIGRAPH YORI yori
U+30FF KATAKANA DIGRAPH KOTO koto

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+309D HIRAGANA ITERATION MARK "
U+309E HIRAGANA VOICED ITERATION MARK "
U+30FC KATAKANA-HIRAGANA PROLONGED SOUND MARK -
U+30FD KATAKANA ITERATION MARK "
U+30FE KATAKANA VOICED ITERATION MARK "
U+FF9E HALFWIDTH KATAKANA VOICED SOUND MARK :
U+FF9F HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK ;
Char Codepoint Name translit Unidecode anyascii
U+3058 HIRAGANA LETTER ZI ji zi ji
U+3075 HIRAGANA LETTER HU fu hu fu
U+30B8 KATAKANA LETTER ZI ji zi ji
U+30D5 KATAKANA LETTER HU fu hu fu
U+30FB KATAKANA MIDDLE DOT * -
U+FF65 HALFWIDTH KATAKANA MIDDLE DOT * -
U+FF6F HALFWIDTH KATAKANA LETTER SMALL TU tsu tu t
U+FF70 HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK - +
U+FF7C HALFWIDTH KATAKANA LETTER SI shi si shi
U+FF81 HALFWIDTH KATAKANA LETTER TI chi ti chi
U+FF82 HALFWIDTH KATAKANA LETTER TU tsu tu tsu
U+FF8C HALFWIDTH KATAKANA LETTER HU fu hu fu

ja-kunrei — Japanese Kunrei

Block: 189 assigned codepoints, 189 mapped by at least one library.

Coverage: translit maps 181/189, Unidecode maps 181/189. 4 mapped only by translit, 4 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+3095 HIRAGANA LETTER SMALL KA ka
U+3096 HIRAGANA LETTER SMALL KE ke
U+309F HIRAGANA DIGRAPH YORI yori
U+30FF KATAKANA DIGRAPH KOTO koto

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+309D HIRAGANA ITERATION MARK "
U+309E HIRAGANA VOICED ITERATION MARK "
U+30FD KATAKANA ITERATION MARK "
U+30FE KATAKANA VOICED ITERATION MARK "
Char Codepoint Name translit Unidecode anyascii
U+3057 HIRAGANA LETTER SI si shi shi
U+3061 HIRAGANA LETTER TI ti chi chi
U+3063 HIRAGANA LETTER SMALL TU tu tsu t
U+3064 HIRAGANA LETTER TU tu tsu tsu
U+30B7 KATAKANA LETTER SI si shi shi
U+30C1 KATAKANA LETTER TI ti chi chi
U+30C3 KATAKANA LETTER SMALL TU tu tsu t
U+30C4 KATAKANA LETTER TU tu tsu tsu
U+30FB KATAKANA MIDDLE DOT * -

ko — Korean

Block: 11172 assigned codepoints, 11172 mapped by at least one library.

Char Codepoint Name translit Unidecode anyascii
U+AC02 HANGUL SYLLABLE GAGG gakk gagg Gakk
U+AC17 HANGUL SYLLABLE GAC gach gac Gach
U+AC1E HANGUL SYLLABLE GAEGG gaekk gaegg Gaekk
U+AC33 HANGUL SYLLABLE GAEC gaech gaec Gaech
U+AC3A HANGUL SYLLABLE GYAGG gyakk gyagg Gyakk
U+AC4F HANGUL SYLLABLE GYAC gyach gyac Gyach
U+AC56 HANGUL SYLLABLE GYAEGG gyaekk gyaegg Gyaekk
U+AC6B HANGUL SYLLABLE GYAEC gyaech gyaec Gyaech
U+AC72 HANGUL SYLLABLE GEOGG geokk geogg Geokk
U+AC87 HANGUL SYLLABLE GEOC geoch geoc Geoch
U+AC8E HANGUL SYLLABLE GEGG gekk gegg Gekk
U+ACA3 HANGUL SYLLABLE GEC gech gec Gech
U+ACAA HANGUL SYLLABLE GYEOGG gyeokk gyeogg Gyeokk
겿 U+ACBF HANGUL SYLLABLE GYEOC gyeoch gyeoc Gyeoch
U+ACC6 HANGUL SYLLABLE GYEGG gyekk gyegg Gyekk
U+ACDB HANGUL SYLLABLE GYEC gyech gyec Gyech
U+ACE2 HANGUL SYLLABLE GOGG gokk gogg Gokk
U+ACF7 HANGUL SYLLABLE GOC goch goc Goch
U+ACFE HANGUL SYLLABLE GWAGG gwakk gwagg Gwakk
U+AD13 HANGUL SYLLABLE GWAC gwach gwac Gwach
U+AD1A HANGUL SYLLABLE GWAEGG gwaekk gwaegg Gwaekk
U+AD2F HANGUL SYLLABLE GWAEC gwaech gwaec Gwaech
U+AD36 HANGUL SYLLABLE GOEGG goekk goegg Goekk
U+AD4B HANGUL SYLLABLE GOEC goech goec Goech
U+AD52 HANGUL SYLLABLE GYOGG gyokk gyogg Gyokk
U+AD67 HANGUL SYLLABLE GYOC gyoch gyoc Gyoch
U+AD6E HANGUL SYLLABLE GUGG gukk gugg Gukk
U+AD83 HANGUL SYLLABLE GUC guch guc Guch
U+AD88 HANGUL SYLLABLE GWEO gwo gweo Gwo
U+AD89 HANGUL SYLLABLE GWEOG gwog gweog Gwog
U+AD8A HANGUL SYLLABLE GWEOGG gwokk gweogg Gwokk
U+AD8B HANGUL SYLLABLE GWEOGS gwogs gweogs Gwogs
U+AD8C HANGUL SYLLABLE GWEON gwon gweon Gwon
U+AD8D HANGUL SYLLABLE GWEONJ gwonj gweonj Gwonj
U+AD8E HANGUL SYLLABLE GWEONH gwonh gweonh Gwonh
U+AD8F HANGUL SYLLABLE GWEOD gwod gweod Gwod
U+AD90 HANGUL SYLLABLE GWEOL gwol gweol Gwol
U+AD91 HANGUL SYLLABLE GWEOLG gwolg gweolg Gwolg
U+AD92 HANGUL SYLLABLE GWEOLM gwolm gweolm Gwolm
U+AD93 HANGUL SYLLABLE GWEOLB gwolb gweolb Gwolb
U+AD94 HANGUL SYLLABLE GWEOLS gwols gweols Gwols
U+AD95 HANGUL SYLLABLE GWEOLT gwolt gweolt Gwolt
U+AD96 HANGUL SYLLABLE GWEOLP gwolp gweolp Gwolp
U+AD97 HANGUL SYLLABLE GWEOLH gwolh gweolh Gwolh
U+AD98 HANGUL SYLLABLE GWEOM gwom gweom Gwom
U+AD99 HANGUL SYLLABLE GWEOB gwob gweob Gwob
U+AD9A HANGUL SYLLABLE GWEOBS gwobs gweobs Gwobs
U+AD9B HANGUL SYLLABLE GWEOS gwos gweos Gwos
U+AD9C HANGUL SYLLABLE GWEOSS gwoss gweoss Gwoss
U+AD9D HANGUL SYLLABLE GWEONG gwong gweong Gwong
...3712 more differences

zh — Chinese

Block: 20992 assigned codepoints, 20954 mapped by at least one library.

Coverage: translit maps 20924/20954, Unidecode maps 20642/20954. 291 mapped only by translit, 9 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+4E06 CJK UNIFIED IDEOGRAPH-4E06 han
U+4E4A CJK UNIFIED IDEOGRAPH-4E4A yi
U+4E5B CJK UNIFIED IDEOGRAPH-4E5B ya
U+4E65 CJK UNIFIED IDEOGRAPH-4E65 hu
U+4E6E CJK UNIFIED IDEOGRAPH-4E6E mao
U+4E7D CJK UNIFIED IDEOGRAPH-4E7D zhe
U+4EAA CJK UNIFIED IDEOGRAPH-4EAA ye
U+4EE9 CJK UNIFIED IDEOGRAPH-4EE9 chang
U+4F2C CJK UNIFIED IDEOGRAPH-4F2C ze
U+4F66 CJK UNIFIED IDEOGRAPH-4F66 shi
U+4F68 CJK UNIFIED IDEOGRAPH-4F68 bao
U+4FE7 CJK UNIFIED IDEOGRAPH-4FE7 zhi
U+4FEC CJK UNIFIED IDEOGRAPH-4FEC si
U+503F CJK UNIFIED IDEOGRAPH-503F ning
U+50A6 CJK UNIFIED IDEOGRAPH-50A6 gu
U+50F2 CJK UNIFIED IDEOGRAPH-50F2 xian
U+510F CJK UNIFIED IDEOGRAPH-510F can
U+516F CJK UNIFIED IDEOGRAPH-516F han
U+5307 CJK UNIFIED IDEOGRAPH-5307 yi
U+5381 CJK UNIFIED IDEOGRAPH-5381 san
U+5391 CJK UNIFIED IDEOGRAPH-5391 ya
U+53FE CJK UNIFIED IDEOGRAPH-53FE liao
U+545A CJK UNIFIED IDEOGRAPH-545A wen
U+54D6 CJK UNIFIED IDEOGRAPH-54D6 nian
U+54DB CJK UNIFIED IDEOGRAPH-54DB fen
U+5579 CJK UNIFIED IDEOGRAPH-5579 ju
U+55F4 CJK UNIFIED IDEOGRAPH-55F4 qiang
U+5691 CJK UNIFIED IDEOGRAPH-5691 xun
U+5692 CJK UNIFIED IDEOGRAPH-5692 me
U+56D5 CJK UNIFIED IDEOGRAPH-56D5 lan
...261 more

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+5159 CJK UNIFIED IDEOGRAPH-5159 Shi
U+5161 CJK UNIFIED IDEOGRAPH-5161 Bai
U+55E7 CJK UNIFIED IDEOGRAPH-55E7 Jia
U+685B CJK UNIFIED IDEOGRAPH-685B Kasei
U+74E7 CJK UNIFIED IDEOGRAPH-74E7 Dekaguramu
U+74F0 CJK UNIFIED IDEOGRAPH-74F0 Deshiguramu
U+74F1 CJK UNIFIED IDEOGRAPH-74F1 Miriguramu
U+74FC CJK UNIFIED IDEOGRAPH-74FC Sarake
U+7505 CJK UNIFIED IDEOGRAPH-7505 Senchigura
Char Codepoint Name translit Unidecode anyascii
U+4E00 CJK UNIFIED IDEOGRAPH-4E00 yi Yi Yi
U+4E01 CJK UNIFIED IDEOGRAPH-4E01 ding Ding Ding
U+4E02 CJK UNIFIED IDEOGRAPH-4E02 kao Kao Kao
U+4E03 CJK UNIFIED IDEOGRAPH-4E03 qi Qi Qi
U+4E04 CJK UNIFIED IDEOGRAPH-4E04 shang Shang Shang
U+4E05 CJK UNIFIED IDEOGRAPH-4E05 xia Xia Xia
U+4E07 CJK UNIFIED IDEOGRAPH-4E07 wan Mo Wan
U+4E08 CJK UNIFIED IDEOGRAPH-4E08 zhang Zhang Zhang
U+4E09 CJK UNIFIED IDEOGRAPH-4E09 san San San
U+4E0A CJK UNIFIED IDEOGRAPH-4E0A shang Shang Shang
U+4E0B CJK UNIFIED IDEOGRAPH-4E0B xia Xia Xia
U+4E0C CJK UNIFIED IDEOGRAPH-4E0C ji Ji Ji
U+4E0D CJK UNIFIED IDEOGRAPH-4E0D bu Bu Bu
U+4E0E CJK UNIFIED IDEOGRAPH-4E0E yu Yu Yu
U+4E0F CJK UNIFIED IDEOGRAPH-4E0F mian Mian Mian
U+4E10 CJK UNIFIED IDEOGRAPH-4E10 gai Gai Gai
U+4E11 CJK UNIFIED IDEOGRAPH-4E11 chou Chou Chou
U+4E12 CJK UNIFIED IDEOGRAPH-4E12 chou Chou Chou
U+4E13 CJK UNIFIED IDEOGRAPH-4E13 zhuan Zhuan Zhuan
U+4E14 CJK UNIFIED IDEOGRAPH-4E14 qie Qie Qie
U+4E15 CJK UNIFIED IDEOGRAPH-4E15 pi Pi Pi
U+4E16 CJK UNIFIED IDEOGRAPH-4E16 shi Shi Shi
U+4E17 CJK UNIFIED IDEOGRAPH-4E17 shi Shi Shi
U+4E18 CJK UNIFIED IDEOGRAPH-4E18 qiu Qiu Qiu
U+4E19 CJK UNIFIED IDEOGRAPH-4E19 bing Bing Bing
U+4E1A CJK UNIFIED IDEOGRAPH-4E1A ye Ye Ye
U+4E1B CJK UNIFIED IDEOGRAPH-4E1B cong Cong Cong
U+4E1C CJK UNIFIED IDEOGRAPH-4E1C dong Dong Dong
U+4E1D CJK UNIFIED IDEOGRAPH-4E1D si Si Si
U+4E1E CJK UNIFIED IDEOGRAPH-4E1E cheng Cheng Cheng
U+4E1F CJK UNIFIED IDEOGRAPH-4E1F diu Diu Diu
U+4E20 CJK UNIFIED IDEOGRAPH-4E20 qiu Qiu Qiu
U+4E21 CJK UNIFIED IDEOGRAPH-4E21 liang Liang Liang
U+4E22 CJK UNIFIED IDEOGRAPH-4E22 diu Diu Diu
U+4E23 CJK UNIFIED IDEOGRAPH-4E23 you You You
U+4E24 CJK UNIFIED IDEOGRAPH-4E24 liang Liang Liang
U+4E25 CJK UNIFIED IDEOGRAPH-4E25 yan Yan Yan
U+4E26 CJK UNIFIED IDEOGRAPH-4E26 bing Bing Bing
U+4E27 CJK UNIFIED IDEOGRAPH-4E27 sang Sang Sang
U+4E28 CJK UNIFIED IDEOGRAPH-4E28 gun Gun Gun
U+4E29 CJK UNIFIED IDEOGRAPH-4E29 jiu Jiu Jiu
U+4E2A CJK UNIFIED IDEOGRAPH-4E2A ge Ge Ge
U+4E2B CJK UNIFIED IDEOGRAPH-4E2B ya Ya Ya
U+4E2C CJK UNIFIED IDEOGRAPH-4E2C qiang Qiang Qiang
U+4E2D CJK UNIFIED IDEOGRAPH-4E2D zhong Zhong Zhong
U+4E2E CJK UNIFIED IDEOGRAPH-4E2E ji Ji Ji
U+4E2F CJK UNIFIED IDEOGRAPH-4E2F jie Jie Jie
U+4E30 CJK UNIFIED IDEOGRAPH-4E30 feng Feng Feng
U+4E31 CJK UNIFIED IDEOGRAPH-4E31 guan Guan Guan
U+4E32 CJK UNIFIED IDEOGRAPH-4E32 chuan Chuan Chuan
...20583 more differences

ar — Arabic

Block: 248 assigned codepoints, 221 mapped by at least one library.

Coverage: translit maps 207/221, Unidecode maps 173/221. 38 mapped only by translit, 4 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
؉ U+0609 ARABIC-INDIC PER MILLE SIGN %o
؊ U+060A ARABIC-INDIC PER TEN THOUSAND SIGN %oo
؋ U+060B AFGHANI SIGN Af
؍ U+060D ARABIC DATE SEPARATOR /
ؖ U+0616 ARABIC SMALL HIGH LIGATURE ALEF WITH LAM WITH YEH aly
ؘ U+0618 ARABIC SMALL FATHA a
ؙ U+0619 ARABIC SMALL DAMMA u
ؚ U+061A ARABIC SMALL KASRA i
؝ U+061D ARABIC END OF TEXT MARK .
؞ U+061E ARABIC TRIPLE DOT PUNCTUATION MARK ...
ؠ U+0620 ARABIC LETTER KASHMIRI YEH y
ء U+0621 ARABIC LETTER HAMZA '
إ U+0625 ARABIC LETTER ALEF WITH HAMZA BELOW a
ا U+0627 ARABIC LETTER ALEF a
ػ U+063B ARABIC LETTER KEHEH WITH TWO DOTS ABOVE k
ؼ U+063C ARABIC LETTER KEHEH WITH THREE DOTS BELOW k
ؽ U+063D ARABIC LETTER FARSI YEH WITH INVERTED V y
ؾ U+063E ARABIC LETTER FARSI YEH WITH TWO DOTS ABOVE y
ؿ U+063F ARABIC LETTER FARSI YEH WITH THREE DOTS ABOVE y
ٖ U+0656 ARABIC SUBSCRIPT ALEF a
ٗ U+0657 ARABIC INVERTED DAMMA u
٘ U+0658 ARABIC MARK NOON GHUNNA n
ٝ U+065D ARABIC REVERSED DAMMA u
ٞ U+065E ARABIC FATHA WITH TWO DOTS a
ٟ U+065F ARABIC WAVY HAMZA BELOW '
ٮ U+066E ARABIC LETTER DOTLESS BEH b
ٯ U+066F ARABIC LETTER DOTLESS QAF q
ٰ U+0670 ARABIC LETTER SUPERSCRIPT ALEF a
ٴ U+0674 ARABIC LETTER HIGH HAMZA '
ې U+06D0 ARABIC LETTER E e
...8 more

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
ّ U+0651 ARABIC SHADDA W
۞ U+06DE ARABIC START OF RUB EL HIZB #
۩ U+06E9 ARABIC PLACE OF SAJDAH ^
۾ U+06FE ARABIC SIGN SINDHI POSTPOSITION MEN +m
Char Codepoint Name translit Unidecode anyascii
أ U+0623 ARABIC LETTER ALEF WITH HAMZA ABOVE a ' '
ؤ U+0624 ARABIC LETTER WAW WITH HAMZA ABOVE ' w' u'
ئ U+0626 ARABIC LETTER YEH WITH HAMZA ABOVE ' y' i'
ة U+0629 ARABIC LETTER TEH MARBUTA h @ h
ح U+062D ARABIC LETTER HAH h H h
ص U+0635 ARABIC LETTER SAD s S s
ض U+0636 ARABIC LETTER DAD d D d
ط U+0637 ARABIC LETTER TAH t T t
ظ U+0638 ARABIC LETTER ZAH z Z dh
ع U+0639 ARABIC LETTER AIN ' |
غ U+063A ARABIC LETTER GHAIN gh G gh
ى U+0649 ARABIC LETTER ALEF MAKSURA a ~ a
ٱ U+0671 ARABIC LETTER ALEF WASLA a ' '
ٲ U+0672 ARABIC LETTER ALEF WITH WAVY HAMZA ABOVE a ' a
ٳ U+0673 ARABIC LETTER ALEF WITH WAVY HAMZA BELOW a ' u'
ٵ U+0675 ARABIC LETTER HIGH HAMZA ALEF a ' a
ٶ U+0676 ARABIC LETTER HIGH HAMZA WAW w 'w o
ٷ U+0677 ARABIC LETTER U WITH HAMZA ABOVE u' 'u u
ٸ U+0678 ARABIC LETTER HIGH HAMZA YEH y 'y i
ٹ U+0679 ARABIC LETTER TTEH t tt t
ٺ U+067A ARABIC LETTER TTEHEH t tth th
ٽ U+067D ARABIC LETTER TEH WITH THREE DOTS ABOVE DOWNWARDS t T t
ٿ U+067F ARABIC LETTER TEHEH t th th
ڀ U+0680 ARABIC LETTER BEHEH b bh bh
ځ U+0681 ARABIC LETTER HAH WITH HAMZA ABOVE h 'h dz
ڂ U+0682 ARABIC LETTER HAH WITH TWO DOTS VERTICAL ABOVE h H dz
څ U+0685 ARABIC LETTER HAH WITH THREE DOTS ABOVE h H ts
ڇ U+0687 ARABIC LETTER TCHEHEH ch cch ch
ڈ U+0688 ARABIC LETTER DDAL d dd d
ډ U+0689 ARABIC LETTER DAL WITH RING d D d
ڊ U+068A ARABIC LETTER DAL WITH DOT BELOW d D d
ڋ U+068B ARABIC LETTER DAL WITH DOT BELOW AND SMALL TAH d Dt dd
ڌ U+068C ARABIC LETTER DAHAL d dh dh
ڍ U+068D ARABIC LETTER DDAHAL d ddh dh
ڏ U+068F ARABIC LETTER DAL WITH THREE DOTS ABOVE DOWNWARDS d D d
ڐ U+0690 ARABIC LETTER DAL WITH FOUR DOTS ABOVE d D d
ڑ U+0691 ARABIC LETTER RREH r rr r
ڒ U+0692 ARABIC LETTER REH WITH SMALL V r R r
ړ U+0693 ARABIC LETTER REH WITH RING r R r
ڔ U+0694 ARABIC LETTER REH WITH DOT BELOW r R r
ڕ U+0695 ARABIC LETTER REH WITH SMALL V BELOW r R r
ږ U+0696 ARABIC LETTER REH WITH DOT BELOW AND DOT ABOVE r R zh
ڗ U+0697 ARABIC LETTER REH WITH TWO DOTS ABOVE r R d
ژ U+0698 ARABIC LETTER JEH zh j zh
ڙ U+0699 ARABIC LETTER REH WITH FOUR DOTS ABOVE r R r
ښ U+069A ARABIC LETTER SEEN WITH DOT BELOW AND DOT ABOVE s S sh
ڛ U+069B ARABIC LETTER SEEN WITH THREE DOTS BELOW s S s
ڜ U+069C ARABIC LETTER SEEN WITH THREE DOTS BELOW AND THREE DOTS ABOVE s S ch
ڝ U+069D ARABIC LETTER SAD WITH TWO DOTS BELOW s S ts
ڞ U+069E ARABIC LETTER SAD WITH THREE DOTS ABOVE s S ch
...42 more differences

fa — Persian

Block: 391 assigned codepoints, 331 mapped by at least one library.

Coverage: translit maps 207/331, Unidecode maps 173/331. 38 mapped only by translit, 4 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
؉ U+0609 ARABIC-INDIC PER MILLE SIGN %o
؊ U+060A ARABIC-INDIC PER TEN THOUSAND SIGN %oo
؋ U+060B AFGHANI SIGN Af
؍ U+060D ARABIC DATE SEPARATOR /
ؖ U+0616 ARABIC SMALL HIGH LIGATURE ALEF WITH LAM WITH YEH aly
ؘ U+0618 ARABIC SMALL FATHA a
ؙ U+0619 ARABIC SMALL DAMMA u
ؚ U+061A ARABIC SMALL KASRA i
؝ U+061D ARABIC END OF TEXT MARK .
؞ U+061E ARABIC TRIPLE DOT PUNCTUATION MARK ...
ؠ U+0620 ARABIC LETTER KASHMIRI YEH y
ء U+0621 ARABIC LETTER HAMZA '
إ U+0625 ARABIC LETTER ALEF WITH HAMZA BELOW e
ا U+0627 ARABIC LETTER ALEF a
ػ U+063B ARABIC LETTER KEHEH WITH TWO DOTS ABOVE k
ؼ U+063C ARABIC LETTER KEHEH WITH THREE DOTS BELOW k
ؽ U+063D ARABIC LETTER FARSI YEH WITH INVERTED V y
ؾ U+063E ARABIC LETTER FARSI YEH WITH TWO DOTS ABOVE y
ؿ U+063F ARABIC LETTER FARSI YEH WITH THREE DOTS ABOVE y
ٖ U+0656 ARABIC SUBSCRIPT ALEF a
ٗ U+0657 ARABIC INVERTED DAMMA u
٘ U+0658 ARABIC MARK NOON GHUNNA n
ٝ U+065D ARABIC REVERSED DAMMA u
ٞ U+065E ARABIC FATHA WITH TWO DOTS a
ٟ U+065F ARABIC WAVY HAMZA BELOW '
ٮ U+066E ARABIC LETTER DOTLESS BEH b
ٯ U+066F ARABIC LETTER DOTLESS QAF q
ٰ U+0670 ARABIC LETTER SUPERSCRIPT ALEF a
ٴ U+0674 ARABIC LETTER HIGH HAMZA '
ې U+06D0 ARABIC LETTER E e
...8 more

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
ّ U+0651 ARABIC SHADDA W
۞ U+06DE ARABIC START OF RUB EL HIZB #
۩ U+06E9 ARABIC PLACE OF SAJDAH ^
۾ U+06FE ARABIC SIGN SINDHI POSTPOSITION MEN +m
Char Codepoint Name translit Unidecode anyascii
أ U+0623 ARABIC LETTER ALEF WITH HAMZA ABOVE a ' '
ؤ U+0624 ARABIC LETTER WAW WITH HAMZA ABOVE ' w' u'
ئ U+0626 ARABIC LETTER YEH WITH HAMZA ABOVE ' y' i'
ة U+0629 ARABIC LETTER TEH MARBUTA e @ h
ث U+062B ARABIC LETTER THEH s th th
ح U+062D ARABIC LETTER HAH h H h
ذ U+0630 ARABIC LETTER THAL z dh dh
ص U+0635 ARABIC LETTER SAD s S s
ض U+0636 ARABIC LETTER DAD z D d
ط U+0637 ARABIC LETTER TAH t T t
ظ U+0638 ARABIC LETTER ZAH z Z dh
ع U+0639 ARABIC LETTER AIN ' |
غ U+063A ARABIC LETTER GHAIN gh G gh
و U+0648 ARABIC LETTER WAW v w w
ى U+0649 ARABIC LETTER ALEF MAKSURA a ~ a
ُ U+064F ARABIC DAMMA o u u
ِ U+0650 ARABIC KASRA e i i
ٱ U+0671 ARABIC LETTER ALEF WASLA a ' '
ٲ U+0672 ARABIC LETTER ALEF WITH WAVY HAMZA ABOVE a ' a
ٳ U+0673 ARABIC LETTER ALEF WITH WAVY HAMZA BELOW a ' u'
ٵ U+0675 ARABIC LETTER HIGH HAMZA ALEF a ' a
ٶ U+0676 ARABIC LETTER HIGH HAMZA WAW w 'w o
ٷ U+0677 ARABIC LETTER U WITH HAMZA ABOVE u' 'u u
ٸ U+0678 ARABIC LETTER HIGH HAMZA YEH y 'y i
ٹ U+0679 ARABIC LETTER TTEH t tt t
ٺ U+067A ARABIC LETTER TTEHEH t tth th
ٽ U+067D ARABIC LETTER TEH WITH THREE DOTS ABOVE DOWNWARDS t T t
ٿ U+067F ARABIC LETTER TEHEH t th th
ڀ U+0680 ARABIC LETTER BEHEH b bh bh
ځ U+0681 ARABIC LETTER HAH WITH HAMZA ABOVE h 'h dz
ڂ U+0682 ARABIC LETTER HAH WITH TWO DOTS VERTICAL ABOVE h H dz
څ U+0685 ARABIC LETTER HAH WITH THREE DOTS ABOVE h H ts
ڇ U+0687 ARABIC LETTER TCHEHEH ch cch ch
ڈ U+0688 ARABIC LETTER DDAL d dd d
ډ U+0689 ARABIC LETTER DAL WITH RING d D d
ڊ U+068A ARABIC LETTER DAL WITH DOT BELOW d D d
ڋ U+068B ARABIC LETTER DAL WITH DOT BELOW AND SMALL TAH d Dt dd
ڌ U+068C ARABIC LETTER DAHAL d dh dh
ڍ U+068D ARABIC LETTER DDAHAL d ddh dh
ڏ U+068F ARABIC LETTER DAL WITH THREE DOTS ABOVE DOWNWARDS d D d
ڐ U+0690 ARABIC LETTER DAL WITH FOUR DOTS ABOVE d D d
ڑ U+0691 ARABIC LETTER RREH r rr r
ڒ U+0692 ARABIC LETTER REH WITH SMALL V r R r
ړ U+0693 ARABIC LETTER REH WITH RING r R r
ڔ U+0694 ARABIC LETTER REH WITH DOT BELOW r R r
ڕ U+0695 ARABIC LETTER REH WITH SMALL V BELOW r R r
ږ U+0696 ARABIC LETTER REH WITH DOT BELOW AND DOT ABOVE r R zh
ڗ U+0697 ARABIC LETTER REH WITH TWO DOTS ABOVE r R d
ژ U+0698 ARABIC LETTER JEH zh j zh
ڙ U+0699 ARABIC LETTER REH WITH FOUR DOTS ABOVE r R r
...47 more differences

he — Hebrew

Block: 88 assigned codepoints, 53 mapped by at least one library.

Coverage: translit maps 46/53, Unidecode maps 49/53. 1 mapped only by translit, 4 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
ְ U+05B0 HEBREW POINT SHEVA e

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
׀ U+05C0 HEBREW PUNCTUATION PASEQ |
׆ U+05C6 HEBREW PUNCTUATION NUN HAFUKHA n
ע U+05E2 HEBREW LETTER AYIN ```
ׯ U+05EF HEBREW YOD TRIANGLE YYY
Char Codepoint Name translit Unidecode anyascii
א U+05D0 HEBREW LETTER ALEF ' A '
ב U+05D1 HEBREW LETTER BET v b v
ח U+05D7 HEBREW LETTER HET ch H h
ט U+05D8 HEBREW LETTER TET t T t
ך U+05DA HEBREW LETTER FINAL KAF kh KH kh
כ U+05DB HEBREW LETTER KAF kh KH kh
ף U+05E3 HEBREW LETTER FINAL PE f p f
פ U+05E4 HEBREW LETTER PE f p f
ץ U+05E5 HEBREW LETTER FINAL TSADI ts TS ts
צ U+05E6 HEBREW LETTER TSADI ts TS ts
ק U+05E7 HEBREW LETTER QOF q k k
ש U+05E9 HEBREW LETTER SHIN sh SH s
װ U+05F0 HEBREW LIGATURE YIDDISH DOUBLE VAV v V v
ױ U+05F1 HEBREW LIGATURE YIDDISH VAV YOD vy OY oy
ײ U+05F2 HEBREW LIGATURE YIDDISH DOUBLE YOD y EY ey

hi — Hindi

Block: 128 assigned codepoints, 127 mapped by at least one library.

Coverage: translit maps 117/127, Unidecode maps 103/127. 19 mapped only by translit, 5 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0904 DEVANAGARI LETTER SHORT A a
U+0955 DEVANAGARI VOWEL SIGN CANDRA LONG E e
U+0956 DEVANAGARI VOWEL SIGN UE u
U+0957 DEVANAGARI VOWEL SIGN UUE u
U+0971 DEVANAGARI SIGN HIGH SPACING DOT .
U+0972 DEVANAGARI LETTER CANDRA A a
U+0973 DEVANAGARI LETTER OE oe
U+0974 DEVANAGARI LETTER OOE ooe
U+0975 DEVANAGARI LETTER AW aw
U+0976 DEVANAGARI LETTER UE ue
U+0977 DEVANAGARI LETTER UUE uue
U+0978 DEVANAGARI LETTER MARWARI DDA dda
U+0979 DEVANAGARI LETTER ZHA zha
U+097A DEVANAGARI LETTER HEAVY YA ya
U+097B DEVANAGARI LETTER GGA gga
U+097C DEVANAGARI LETTER JJA jja
U+097D DEVANAGARI LETTER GLOTTAL STOP '
U+097E DEVANAGARI LETTER DDDA ddda
ॿ U+097F DEVANAGARI LETTER BBA bba

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+093C DEVANAGARI SIGN NUKTA '
U+0951 DEVANAGARI STRESS SIGN UDATTA '
U+0952 DEVANAGARI STRESS SIGN ANUDATTA '
U+0953 DEVANAGARI GRAVE ACCENT ```
U+0954 DEVANAGARI ACUTE ACCENT '
Char Codepoint Name translit Unidecode anyascii
U+0901 DEVANAGARI SIGN CANDRABINDU m N m
U+0902 DEVANAGARI SIGN ANUSVARA m N m
U+0903 DEVANAGARI SIGN VISARGA h H h
U+0908 DEVANAGARI LETTER II i ii i
U+090A DEVANAGARI LETTER UU u uu u
U+090B DEVANAGARI LETTER VOCALIC R r R r
U+090C DEVANAGARI LETTER VOCALIC L l L l
U+090D DEVANAGARI LETTER CANDRA E e eN e
U+0911 DEVANAGARI LETTER CANDRA O o oN o
U+0915 DEVANAGARI LETTER KA ka k k
U+0916 DEVANAGARI LETTER KHA kha kh kh
U+0917 DEVANAGARI LETTER GA ga g g
U+0918 DEVANAGARI LETTER GHA gha gh gh
U+0919 DEVANAGARI LETTER NGA nga ng n
U+091A DEVANAGARI LETTER CA cha c c
U+091B DEVANAGARI LETTER CHA chha ch ch
U+091C DEVANAGARI LETTER JA ja j j
U+091D DEVANAGARI LETTER JHA jha jh jh
U+091E DEVANAGARI LETTER NYA nya ny n
U+091F DEVANAGARI LETTER TTA ta tt t
U+0920 DEVANAGARI LETTER TTHA tha tth th
U+0921 DEVANAGARI LETTER DDA da dd d
U+0922 DEVANAGARI LETTER DDHA dha ddh dh
U+0923 DEVANAGARI LETTER NNA na nn n
U+0924 DEVANAGARI LETTER TA ta t t
U+0925 DEVANAGARI LETTER THA tha th th
U+0926 DEVANAGARI LETTER DA da d d
U+0927 DEVANAGARI LETTER DHA dha dh dh
U+0928 DEVANAGARI LETTER NA na n n
U+0929 DEVANAGARI LETTER NNNA na nnn n
U+092A DEVANAGARI LETTER PA pa p p
U+092B DEVANAGARI LETTER PHA pha ph ph
U+092C DEVANAGARI LETTER BA ba b b
U+092D DEVANAGARI LETTER BHA bha bh bh
U+092E DEVANAGARI LETTER MA ma m m
U+092F DEVANAGARI LETTER YA ya y y
U+0930 DEVANAGARI LETTER RA ra r r
U+0931 DEVANAGARI LETTER RRA ra rr r
U+0932 DEVANAGARI LETTER LA la l l
U+0933 DEVANAGARI LETTER LLA la l l
U+0934 DEVANAGARI LETTER LLLA la lll l
U+0935 DEVANAGARI LETTER VA va v v
U+0936 DEVANAGARI LETTER SHA sha sh s
U+0937 DEVANAGARI LETTER SSA sha ss s
U+0938 DEVANAGARI LETTER SA sa s s
U+0939 DEVANAGARI LETTER HA ha h h
U+093E DEVANAGARI VOWEL SIGN AA a aa a
U+0940 DEVANAGARI VOWEL SIGN II i ii i
U+0942 DEVANAGARI VOWEL SIGN UU u uu u
U+0943 DEVANAGARI VOWEL SIGN VOCALIC R r R r
...18 more differences

bn — Bengali

Block: 96 assigned codepoints, 95 mapped by at least one library.

Coverage: translit maps 90/95, Unidecode maps 87/95. 5 mapped only by translit, 2 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0980 BENGALI ANJI m
U+09BD BENGALI SIGN AVAGRAHA '
U+09CE BENGALI LETTER KHANDA TA t
U+09FC BENGALI LETTER VEDIC ANUSVARA m
U+09FD BENGALI ABBREVIATION SIGN .

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+09BC BENGALI SIGN NUKTA '
U+09D7 BENGALI AU LENGTH MARK +
Char Codepoint Name translit Unidecode anyascii
U+0981 BENGALI SIGN CANDRABINDU m N m
U+0982 BENGALI SIGN ANUSVARA m N m
U+0983 BENGALI SIGN VISARGA h H h
U+0988 BENGALI LETTER II i ii i
U+098A BENGALI LETTER UU u uu u
U+098B BENGALI LETTER VOCALIC R r R r
U+098C BENGALI LETTER VOCALIC L l RR l
U+0995 BENGALI LETTER KA ka k k
U+0996 BENGALI LETTER KHA kha kh kh
U+0997 BENGALI LETTER GA ga g g
U+0998 BENGALI LETTER GHA gha gh gh
U+0999 BENGALI LETTER NGA nga ng n
U+099A BENGALI LETTER CA cha c c
U+099B BENGALI LETTER CHA chha ch ch
U+099C BENGALI LETTER JA ja j j
U+099D BENGALI LETTER JHA jha jh jh
U+099E BENGALI LETTER NYA nya ny n
U+099F BENGALI LETTER TTA ta tt t
U+09A0 BENGALI LETTER TTHA tha tth th
U+09A1 BENGALI LETTER DDA da dd d
U+09A2 BENGALI LETTER DDHA dha ddh dh
U+09A3 BENGALI LETTER NNA na nn n
U+09A4 BENGALI LETTER TA ta t t
U+09A5 BENGALI LETTER THA tha th th
U+09A6 BENGALI LETTER DA da d d
U+09A7 BENGALI LETTER DHA dha dh dh
U+09A8 BENGALI LETTER NA na n n
U+09AA BENGALI LETTER PA pa p p
U+09AB BENGALI LETTER PHA pha ph ph
U+09AC BENGALI LETTER BA ba b b
U+09AD BENGALI LETTER BHA bha bh bh
U+09AE BENGALI LETTER MA ma m m
U+09AF BENGALI LETTER YA ya y y
U+09B0 BENGALI LETTER RA ra r r
U+09B2 BENGALI LETTER LA la l l
U+09B6 BENGALI LETTER SHA sha sh s
U+09B7 BENGALI LETTER SSA sha ss s
U+09B8 BENGALI LETTER SA sa s s
U+09B9 BENGALI LETTER HA ha h h
U+09BE BENGALI VOWEL SIGN AA a aa a
U+09C0 BENGALI VOWEL SIGN II i ii i
U+09C2 BENGALI VOWEL SIGN UU u uu u
U+09C3 BENGALI VOWEL SIGN VOCALIC R r R r
U+09C4 BENGALI VOWEL SIGN VOCALIC RR r RR r
U+09DC BENGALI LETTER RRA ra rr r
U+09DD BENGALI LETTER RHA rha rh rh
U+09DF BENGALI LETTER YYA ya yy y
U+09E0 BENGALI LETTER VOCALIC RR r RR r
U+09E1 BENGALI LETTER VOCALIC LL l LL l
U+09E2 BENGALI VOWEL SIGN VOCALIC L l L l
...9 more differences

ta — Tamil

Block: 72 assigned codepoints, 71 mapped by at least one library.

Coverage: translit maps 63/71, Unidecode maps 61/71. 3 mapped only by translit, 1 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0BB6 TAMIL LETTER SHA sha
U+0BD0 TAMIL OM om
U+0BF9 TAMIL RUPEE SIGN Rs

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+0BD7 TAMIL AU LENGTH MARK +
Char Codepoint Name translit Unidecode anyascii
U+0B82 TAMIL SIGN ANUSVARA m N m
U+0B83 TAMIL SIGN VISARGA h H k
U+0B88 TAMIL LETTER II i ii i
U+0B8A TAMIL LETTER UU u uu u
U+0B8F TAMIL LETTER EE e ee e
U+0B93 TAMIL LETTER OO o oo o
U+0B95 TAMIL LETTER KA ka k k
U+0B99 TAMIL LETTER NGA nga ng n
U+0B9A TAMIL LETTER CA cha c c
U+0B9C TAMIL LETTER JA ja j j
U+0B9E TAMIL LETTER NYA nya ny n
U+0B9F TAMIL LETTER TTA ta tt t
U+0BA3 TAMIL LETTER NNA na nn n
U+0BA4 TAMIL LETTER TA ta t t
U+0BA8 TAMIL LETTER NA na n n
U+0BA9 TAMIL LETTER NNNA na nnn n
U+0BAA TAMIL LETTER PA pa p p
U+0BAE TAMIL LETTER MA ma m m
U+0BAF TAMIL LETTER YA ya y y
U+0BB0 TAMIL LETTER RA ra r r
U+0BB1 TAMIL LETTER RRA ra rr r
U+0BB2 TAMIL LETTER LA la l l
U+0BB3 TAMIL LETTER LLA la ll l
U+0BB4 TAMIL LETTER LLLA zha lll l
U+0BB5 TAMIL LETTER VA va v v
U+0BB7 TAMIL LETTER SSA sha ss s
U+0BB8 TAMIL LETTER SA sa s s
U+0BB9 TAMIL LETTER HA ha h h
U+0BBE TAMIL VOWEL SIGN AA a aa a
U+0BC0 TAMIL VOWEL SIGN II i ii i
U+0BC2 TAMIL VOWEL SIGN UU u uu u
U+0BC7 TAMIL VOWEL SIGN EE e ee e
U+0BCB TAMIL VOWEL SIGN OO o oo o
U+0BF0 TAMIL NUMBER TEN 10 +10+ 10
U+0BF1 TAMIL NUMBER ONE HUNDRED 100 +100+ 100
U+0BF2 TAMIL NUMBER ONE THOUSAND 1000 +1000+ 1000

te — Telugu

Block: 100 assigned codepoints, 99 mapped by at least one library.

Coverage: translit maps 92/99, Unidecode maps 79/99. 15 mapped only by translit, 2 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0C34 TELUGU LETTER LLLA lla
U+0C3D TELUGU SIGN AVAGRAHA '
U+0C58 TELUGU LETTER TSA tsa
U+0C59 TELUGU LETTER DZA dza
U+0C5A TELUGU LETTER RRRA rra
U+0C5D TELUGU LETTER NAKAARA POLLU n
U+0C62 TELUGU VOWEL SIGN VOCALIC L l
U+0C63 TELUGU VOWEL SIGN VOCALIC LL l
U+0C78 TELUGU FRACTION DIGIT ZERO FOR ODD POWERS OF FOUR 0
U+0C79 TELUGU FRACTION DIGIT ONE FOR ODD POWERS OF FOUR 1
U+0C7A TELUGU FRACTION DIGIT TWO FOR ODD POWERS OF FOUR 2
U+0C7B TELUGU FRACTION DIGIT THREE FOR ODD POWERS OF FOUR 3
U+0C7C TELUGU FRACTION DIGIT ONE FOR EVEN POWERS OF FOUR 1
U+0C7D TELUGU FRACTION DIGIT TWO FOR EVEN POWERS OF FOUR 2
U+0C7E TELUGU FRACTION DIGIT THREE FOR EVEN POWERS OF FOUR 3

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+0C55 TELUGU LENGTH MARK +
U+0C56 TELUGU AI LENGTH MARK +
Char Codepoint Name translit Unidecode anyascii
U+0C01 TELUGU SIGN CANDRABINDU m N n
U+0C02 TELUGU SIGN ANUSVARA m N m
U+0C03 TELUGU SIGN VISARGA h H h
U+0C08 TELUGU LETTER II i ii i
U+0C0A TELUGU LETTER UU u uu u
U+0C0B TELUGU LETTER VOCALIC R r R r
U+0C0C TELUGU LETTER VOCALIC L l L l
U+0C0F TELUGU LETTER EE e ee e
U+0C13 TELUGU LETTER OO o oo o
U+0C15 TELUGU LETTER KA ka k k
U+0C16 TELUGU LETTER KHA kha kh kh
U+0C17 TELUGU LETTER GA ga g g
U+0C18 TELUGU LETTER GHA gha gh gh
U+0C19 TELUGU LETTER NGA nga ng n
U+0C1A TELUGU LETTER CA cha c c
U+0C1B TELUGU LETTER CHA chha ch ch
U+0C1C TELUGU LETTER JA ja j j
U+0C1D TELUGU LETTER JHA jha jh jh
U+0C1E TELUGU LETTER NYA nya ny n
U+0C1F TELUGU LETTER TTA ta tt t
U+0C20 TELUGU LETTER TTHA tha tth th
U+0C21 TELUGU LETTER DDA da dd d
U+0C22 TELUGU LETTER DDHA dha ddh dh
U+0C23 TELUGU LETTER NNA na nn n
U+0C24 TELUGU LETTER TA ta t t
U+0C25 TELUGU LETTER THA tha th th
U+0C26 TELUGU LETTER DA da d d
U+0C27 TELUGU LETTER DHA dha dh dh
U+0C28 TELUGU LETTER NA na n n
U+0C2A TELUGU LETTER PA pa p p
U+0C2B TELUGU LETTER PHA pha ph ph
U+0C2C TELUGU LETTER BA ba b b
U+0C2D TELUGU LETTER BHA bha bh bh
U+0C2E TELUGU LETTER MA ma m m
U+0C2F TELUGU LETTER YA ya y y
U+0C30 TELUGU LETTER RA ra r r
U+0C31 TELUGU LETTER RRA ra rr r
U+0C32 TELUGU LETTER LA la l l
U+0C33 TELUGU LETTER LLA la ll l
U+0C35 TELUGU LETTER VA va v v
U+0C36 TELUGU LETTER SHA sha sh s
U+0C37 TELUGU LETTER SSA sha ss s
U+0C38 TELUGU LETTER SA sa s s
U+0C39 TELUGU LETTER HA ha h h
U+0C3E TELUGU VOWEL SIGN AA a aa a
U+0C40 TELUGU VOWEL SIGN II i ii i
U+0C42 TELUGU VOWEL SIGN UU u uu u
U+0C43 TELUGU VOWEL SIGN VOCALIC R r R r
U+0C44 TELUGU VOWEL SIGN VOCALIC RR r RR r
U+0C47 TELUGU VOWEL SIGN EE e ee e
...3 more differences

gu — Gujarati

Block: 91 assigned codepoints, 87 mapped by at least one library.

Coverage: translit maps 83/87, Unidecode maps 77/87. 7 mapped only by translit, 1 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0A8C GUJARATI LETTER VOCALIC L l
U+0AE1 GUJARATI LETTER VOCALIC LL l
U+0AE2 GUJARATI VOWEL SIGN VOCALIC L l
U+0AE3 GUJARATI VOWEL SIGN VOCALIC LL l
U+0AF0 GUJARATI ABBREVIATION SIGN .
U+0AF1 GUJARATI RUPEE SIGN Rs
U+0AF9 GUJARATI LETTER ZHA zha

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+0ABC GUJARATI SIGN NUKTA '
Char Codepoint Name translit Unidecode anyascii
U+0A81 GUJARATI SIGN CANDRABINDU m N m
U+0A82 GUJARATI SIGN ANUSVARA m N m
U+0A83 GUJARATI SIGN VISARGA h H h
U+0A88 GUJARATI LETTER II i ii i
U+0A8A GUJARATI LETTER UU u uu u
U+0A8B GUJARATI LETTER VOCALIC R r R r
U+0A8D GUJARATI VOWEL CANDRA E e eN e
U+0A91 GUJARATI VOWEL CANDRA O o oN o
U+0A95 GUJARATI LETTER KA ka k k
U+0A96 GUJARATI LETTER KHA kha kh kh
U+0A97 GUJARATI LETTER GA ga g g
U+0A98 GUJARATI LETTER GHA gha gh gh
U+0A99 GUJARATI LETTER NGA nga ng n
U+0A9A GUJARATI LETTER CA cha c c
U+0A9B GUJARATI LETTER CHA chha ch ch
U+0A9C GUJARATI LETTER JA ja j j
U+0A9D GUJARATI LETTER JHA jha jh jh
U+0A9E GUJARATI LETTER NYA nya ny n
U+0A9F GUJARATI LETTER TTA ta tt t
U+0AA0 GUJARATI LETTER TTHA tha tth th
U+0AA1 GUJARATI LETTER DDA da dd d
U+0AA2 GUJARATI LETTER DDHA dha ddh dh
U+0AA3 GUJARATI LETTER NNA na nn n
U+0AA4 GUJARATI LETTER TA ta t t
U+0AA5 GUJARATI LETTER THA tha th th
U+0AA6 GUJARATI LETTER DA da d d
U+0AA7 GUJARATI LETTER DHA dha dh dh
U+0AA8 GUJARATI LETTER NA na n n
U+0AAA GUJARATI LETTER PA pa p p
U+0AAB GUJARATI LETTER PHA pha ph ph
U+0AAC GUJARATI LETTER BA ba b b
U+0AAD GUJARATI LETTER BHA bha bh bh
U+0AAE GUJARATI LETTER MA ma m m
U+0AB0 GUJARATI LETTER RA ra r r
U+0AB2 GUJARATI LETTER LA la l l
U+0AB3 GUJARATI LETTER LLA la ll l
U+0AB5 GUJARATI LETTER VA va v v
U+0AB6 GUJARATI LETTER SHA sha sh s
U+0AB7 GUJARATI LETTER SSA sha ss s
U+0AB8 GUJARATI LETTER SA sa s s
U+0AB9 GUJARATI LETTER HA ha h h
U+0ABE GUJARATI VOWEL SIGN AA a aa a
U+0AC0 GUJARATI VOWEL SIGN II i ii i
U+0AC2 GUJARATI VOWEL SIGN UU u uu u
U+0AC3 GUJARATI VOWEL SIGN VOCALIC R r R r
U+0AC4 GUJARATI VOWEL SIGN VOCALIC RR r RR r
U+0AC5 GUJARATI VOWEL SIGN CANDRA E e eN e
U+0AC9 GUJARATI VOWEL SIGN CANDRA O o oN o
U+0AD0 GUJARATI OM om AUM Om
U+0AE0 GUJARATI LETTER VOCALIC RR r RR r

kn — Kannada

Block: 91 assigned codepoints, 90 mapped by at least one library.

Coverage: translit maps 85/90, Unidecode maps 79/90. 8 mapped only by translit, 2 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0C80 KANNADA SIGN SPACING CANDRABINDU m
U+0C81 KANNADA SIGN CANDRABINDU m
U+0CBD KANNADA SIGN AVAGRAHA '
U+0CDD KANNADA LETTER NAKAARA POLLU n
U+0CE2 KANNADA VOWEL SIGN VOCALIC L l
U+0CE3 KANNADA VOWEL SIGN VOCALIC LL l
U+0CF1 KANNADA SIGN JIHVAMULIYA h
U+0CF2 KANNADA SIGN UPADHMANIYA h

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+0CD5 KANNADA LENGTH MARK +
U+0CD6 KANNADA AI LENGTH MARK +
Char Codepoint Name translit Unidecode anyascii
U+0C82 KANNADA SIGN ANUSVARA m N m
U+0C83 KANNADA SIGN VISARGA h H h
U+0C88 KANNADA LETTER II i ii i
U+0C8A KANNADA LETTER UU u uu u
U+0C8B KANNADA LETTER VOCALIC R r R r
U+0C8C KANNADA LETTER VOCALIC L l L l
U+0C8F KANNADA LETTER EE e ee e
U+0C93 KANNADA LETTER OO o oo o
U+0C95 KANNADA LETTER KA ka k k
U+0C96 KANNADA LETTER KHA kha kh kh
U+0C97 KANNADA LETTER GA ga g g
U+0C98 KANNADA LETTER GHA gha gh gh
U+0C99 KANNADA LETTER NGA nga ng n
U+0C9A KANNADA LETTER CA cha c c
U+0C9B KANNADA LETTER CHA chha ch ch
U+0C9C KANNADA LETTER JA ja j j
U+0C9D KANNADA LETTER JHA jha jh jh
U+0C9E KANNADA LETTER NYA nya ny n
U+0C9F KANNADA LETTER TTA ta tt t
U+0CA0 KANNADA LETTER TTHA tha tth th
U+0CA1 KANNADA LETTER DDA da dd d
U+0CA2 KANNADA LETTER DDHA dha ddh dh
U+0CA3 KANNADA LETTER NNA na nn n
U+0CA4 KANNADA LETTER TA ta t t
U+0CA5 KANNADA LETTER THA tha th th
U+0CA6 KANNADA LETTER DA da d d
U+0CA7 KANNADA LETTER DHA dha dh dh
U+0CA8 KANNADA LETTER NA na n n
U+0CAA KANNADA LETTER PA pa p p
U+0CAB KANNADA LETTER PHA pha ph ph
U+0CAC KANNADA LETTER BA ba b b
U+0CAD KANNADA LETTER BHA bha bh bh
U+0CAE KANNADA LETTER MA ma m m
U+0CAF KANNADA LETTER YA ya y y
U+0CB0 KANNADA LETTER RA ra r r
U+0CB1 KANNADA LETTER RRA ra rr r
U+0CB2 KANNADA LETTER LA la l l
U+0CB3 KANNADA LETTER LLA la ll l
U+0CB5 KANNADA LETTER VA va v v
U+0CB6 KANNADA LETTER SHA sha sh s
U+0CB7 KANNADA LETTER SSA sha ss s
U+0CB8 KANNADA LETTER SA sa s s
U+0CB9 KANNADA LETTER HA ha h h
U+0CBE KANNADA VOWEL SIGN AA a aa a
U+0CC0 KANNADA VOWEL SIGN II i ii i
U+0CC2 KANNADA VOWEL SIGN UU u uu u
U+0CC3 KANNADA VOWEL SIGN VOCALIC R r R r
U+0CC4 KANNADA VOWEL SIGN VOCALIC RR r RR r
U+0CC7 KANNADA VOWEL SIGN EE e ee e
U+0CCB KANNADA VOWEL SIGN OO o oo o
...3 more differences

ml — Malayalam

Block: 118 assigned codepoints, 115 mapped by at least one library.

Coverage: translit maps 111/115, Unidecode maps 77/115. 35 mapped only by translit, 1 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0D01 MALAYALAM SIGN CANDRABINDU m
U+0D04 MALAYALAM LETTER VEDIC ANUSVARA a
U+0D29 MALAYALAM LETTER NNNA nna
U+0D3A MALAYALAM LETTER TTTA tta
U+0D3D MALAYALAM SIGN AVAGRAHA '
U+0D44 MALAYALAM VOWEL SIGN VOCALIC RR r
U+0D4E MALAYALAM LETTER DOT REPH r
U+0D54 MALAYALAM LETTER CHILLU M m
U+0D55 MALAYALAM LETTER CHILLU Y y
U+0D56 MALAYALAM LETTER CHILLU LLL l
U+0D58 MALAYALAM FRACTION ONE ONE-HUNDRED-AND-SIXTIETH 1/160
U+0D59 MALAYALAM FRACTION ONE FORTIETH 1/40
U+0D5A MALAYALAM FRACTION THREE EIGHTIETHS 3/80
U+0D5B MALAYALAM FRACTION ONE TWENTIETH 1/20
U+0D5C MALAYALAM FRACTION ONE TENTH 1/10
U+0D5D MALAYALAM FRACTION THREE TWENTIETHS 3/20
U+0D5E MALAYALAM FRACTION ONE FIFTH 1/5
U+0D5F MALAYALAM LETTER ARCHAIC II ii
U+0D62 MALAYALAM VOWEL SIGN VOCALIC L l
U+0D63 MALAYALAM VOWEL SIGN VOCALIC LL l
U+0D70 MALAYALAM NUMBER TEN 10
U+0D71 MALAYALAM NUMBER ONE HUNDRED 100
U+0D72 MALAYALAM NUMBER ONE THOUSAND 1000
U+0D73 MALAYALAM FRACTION ONE QUARTER 1/4
U+0D74 MALAYALAM FRACTION ONE HALF 1/2
U+0D75 MALAYALAM FRACTION THREE QUARTERS 3/4
U+0D76 MALAYALAM FRACTION ONE SIXTEENTH 1/16
U+0D77 MALAYALAM FRACTION ONE EIGHTH 1/8
U+0D78 MALAYALAM FRACTION THREE SIXTEENTHS 3/16
U+0D7A MALAYALAM LETTER CHILLU NN n
...5 more

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+0D57 MALAYALAM AU LENGTH MARK +
Char Codepoint Name translit Unidecode anyascii
U+0D02 MALAYALAM SIGN ANUSVARA m N m
U+0D03 MALAYALAM SIGN VISARGA h H h
U+0D08 MALAYALAM LETTER II i ii i
U+0D0A MALAYALAM LETTER UU u uu u
U+0D0B MALAYALAM LETTER VOCALIC R r R r
U+0D0C MALAYALAM LETTER VOCALIC L l L l
U+0D0F MALAYALAM LETTER EE e ee e
U+0D13 MALAYALAM LETTER OO o oo o
U+0D15 MALAYALAM LETTER KA ka k k
U+0D16 MALAYALAM LETTER KHA kha kh kh
U+0D17 MALAYALAM LETTER GA ga g g
U+0D18 MALAYALAM LETTER GHA gha gh gh
U+0D19 MALAYALAM LETTER NGA nga ng n
U+0D1A MALAYALAM LETTER CA cha c c
U+0D1B MALAYALAM LETTER CHA chha ch ch
U+0D1C MALAYALAM LETTER JA ja j j
U+0D1D MALAYALAM LETTER JHA jha jh jh
U+0D1E MALAYALAM LETTER NYA nya ny n
U+0D1F MALAYALAM LETTER TTA ta tt t
U+0D20 MALAYALAM LETTER TTHA tha tth th
U+0D21 MALAYALAM LETTER DDA da dd d
U+0D22 MALAYALAM LETTER DDHA dha ddh dh
U+0D23 MALAYALAM LETTER NNA na nn n
U+0D24 MALAYALAM LETTER TA ta t t
U+0D25 MALAYALAM LETTER THA tha th th
U+0D26 MALAYALAM LETTER DA da d d
U+0D27 MALAYALAM LETTER DHA dha dh dh
U+0D28 MALAYALAM LETTER NA na n n
U+0D2A MALAYALAM LETTER PA pa p p
U+0D2B MALAYALAM LETTER PHA pha ph ph
U+0D2C MALAYALAM LETTER BA ba b b
U+0D2D MALAYALAM LETTER BHA bha bh bh
U+0D2E MALAYALAM LETTER MA ma m m
U+0D2F MALAYALAM LETTER YA ya y y
U+0D30 MALAYALAM LETTER RA ra r r
U+0D31 MALAYALAM LETTER RRA ra rr r
U+0D32 MALAYALAM LETTER LA la l l
U+0D33 MALAYALAM LETTER LLA la ll l
U+0D34 MALAYALAM LETTER LLLA zha lll l
U+0D35 MALAYALAM LETTER VA va v v
U+0D36 MALAYALAM LETTER SHA sha sh s
U+0D37 MALAYALAM LETTER SSA sha ss s
U+0D38 MALAYALAM LETTER SA sa s s
U+0D39 MALAYALAM LETTER HA ha h h
U+0D3E MALAYALAM VOWEL SIGN AA a aa a
U+0D40 MALAYALAM VOWEL SIGN II i ii i
U+0D42 MALAYALAM VOWEL SIGN UU u uu u
U+0D43 MALAYALAM VOWEL SIGN VOCALIC R r R r
U+0D47 MALAYALAM VOWEL SIGN EE e ee e
U+0D4B MALAYALAM VOWEL SIGN OO o oo o
...2 more differences

mr — Marathi

Block: 128 assigned codepoints, 127 mapped by at least one library.

Coverage: translit maps 117/127, Unidecode maps 103/127. 19 mapped only by translit, 5 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0904 DEVANAGARI LETTER SHORT A a
U+0955 DEVANAGARI VOWEL SIGN CANDRA LONG E e
U+0956 DEVANAGARI VOWEL SIGN UE u
U+0957 DEVANAGARI VOWEL SIGN UUE u
U+0971 DEVANAGARI SIGN HIGH SPACING DOT .
U+0972 DEVANAGARI LETTER CANDRA A a
U+0973 DEVANAGARI LETTER OE oe
U+0974 DEVANAGARI LETTER OOE ooe
U+0975 DEVANAGARI LETTER AW aw
U+0976 DEVANAGARI LETTER UE ue
U+0977 DEVANAGARI LETTER UUE uue
U+0978 DEVANAGARI LETTER MARWARI DDA dda
U+0979 DEVANAGARI LETTER ZHA zha
U+097A DEVANAGARI LETTER HEAVY YA ya
U+097B DEVANAGARI LETTER GGA gga
U+097C DEVANAGARI LETTER JJA jja
U+097D DEVANAGARI LETTER GLOTTAL STOP '
U+097E DEVANAGARI LETTER DDDA ddda
ॿ U+097F DEVANAGARI LETTER BBA bba

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+093C DEVANAGARI SIGN NUKTA '
U+0951 DEVANAGARI STRESS SIGN UDATTA '
U+0952 DEVANAGARI STRESS SIGN ANUDATTA '
U+0953 DEVANAGARI GRAVE ACCENT ```
U+0954 DEVANAGARI ACUTE ACCENT '
Char Codepoint Name translit Unidecode anyascii
U+0901 DEVANAGARI SIGN CANDRABINDU m N m
U+0902 DEVANAGARI SIGN ANUSVARA m N m
U+0903 DEVANAGARI SIGN VISARGA h H h
U+0908 DEVANAGARI LETTER II i ii i
U+090A DEVANAGARI LETTER UU u uu u
U+090B DEVANAGARI LETTER VOCALIC R r R r
U+090C DEVANAGARI LETTER VOCALIC L l L l
U+090D DEVANAGARI LETTER CANDRA E e eN e
U+0911 DEVANAGARI LETTER CANDRA O o oN o
U+0915 DEVANAGARI LETTER KA ka k k
U+0916 DEVANAGARI LETTER KHA kha kh kh
U+0917 DEVANAGARI LETTER GA ga g g
U+0918 DEVANAGARI LETTER GHA gha gh gh
U+0919 DEVANAGARI LETTER NGA nga ng n
U+091A DEVANAGARI LETTER CA cha c c
U+091B DEVANAGARI LETTER CHA chha ch ch
U+091C DEVANAGARI LETTER JA ja j j
U+091D DEVANAGARI LETTER JHA jha jh jh
U+091E DEVANAGARI LETTER NYA nya ny n
U+091F DEVANAGARI LETTER TTA ta tt t
U+0920 DEVANAGARI LETTER TTHA tha tth th
U+0921 DEVANAGARI LETTER DDA da dd d
U+0922 DEVANAGARI LETTER DDHA dha ddh dh
U+0923 DEVANAGARI LETTER NNA na nn n
U+0924 DEVANAGARI LETTER TA ta t t
U+0925 DEVANAGARI LETTER THA tha th th
U+0926 DEVANAGARI LETTER DA da d d
U+0927 DEVANAGARI LETTER DHA dha dh dh
U+0928 DEVANAGARI LETTER NA na n n
U+0929 DEVANAGARI LETTER NNNA na nnn n
U+092A DEVANAGARI LETTER PA pa p p
U+092B DEVANAGARI LETTER PHA pha ph ph
U+092C DEVANAGARI LETTER BA ba b b
U+092D DEVANAGARI LETTER BHA bha bh bh
U+092E DEVANAGARI LETTER MA ma m m
U+092F DEVANAGARI LETTER YA ya y y
U+0930 DEVANAGARI LETTER RA ra r r
U+0931 DEVANAGARI LETTER RRA ra rr r
U+0932 DEVANAGARI LETTER LA la l l
U+0933 DEVANAGARI LETTER LLA la l l
U+0934 DEVANAGARI LETTER LLLA la lll l
U+0935 DEVANAGARI LETTER VA va v v
U+0936 DEVANAGARI LETTER SHA sha sh s
U+0937 DEVANAGARI LETTER SSA sha ss s
U+0938 DEVANAGARI LETTER SA sa s s
U+0939 DEVANAGARI LETTER HA ha h h
U+093E DEVANAGARI VOWEL SIGN AA a aa a
U+0940 DEVANAGARI VOWEL SIGN II i ii i
U+0942 DEVANAGARI VOWEL SIGN UU u uu u
U+0943 DEVANAGARI VOWEL SIGN VOCALIC R r R r
...18 more differences

ne — Nepali

Block: 128 assigned codepoints, 127 mapped by at least one library.

Coverage: translit maps 117/127, Unidecode maps 103/127. 19 mapped only by translit, 5 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0904 DEVANAGARI LETTER SHORT A a
U+0955 DEVANAGARI VOWEL SIGN CANDRA LONG E e
U+0956 DEVANAGARI VOWEL SIGN UE u
U+0957 DEVANAGARI VOWEL SIGN UUE u
U+0971 DEVANAGARI SIGN HIGH SPACING DOT .
U+0972 DEVANAGARI LETTER CANDRA A a
U+0973 DEVANAGARI LETTER OE oe
U+0974 DEVANAGARI LETTER OOE ooe
U+0975 DEVANAGARI LETTER AW aw
U+0976 DEVANAGARI LETTER UE ue
U+0977 DEVANAGARI LETTER UUE uue
U+0978 DEVANAGARI LETTER MARWARI DDA dda
U+0979 DEVANAGARI LETTER ZHA zha
U+097A DEVANAGARI LETTER HEAVY YA ya
U+097B DEVANAGARI LETTER GGA gga
U+097C DEVANAGARI LETTER JJA jja
U+097D DEVANAGARI LETTER GLOTTAL STOP '
U+097E DEVANAGARI LETTER DDDA ddda
ॿ U+097F DEVANAGARI LETTER BBA bba

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+093C DEVANAGARI SIGN NUKTA '
U+0951 DEVANAGARI STRESS SIGN UDATTA '
U+0952 DEVANAGARI STRESS SIGN ANUDATTA '
U+0953 DEVANAGARI GRAVE ACCENT ```
U+0954 DEVANAGARI ACUTE ACCENT '
Char Codepoint Name translit Unidecode anyascii
U+0901 DEVANAGARI SIGN CANDRABINDU m N m
U+0902 DEVANAGARI SIGN ANUSVARA m N m
U+0903 DEVANAGARI SIGN VISARGA h H h
U+0908 DEVANAGARI LETTER II i ii i
U+090A DEVANAGARI LETTER UU u uu u
U+090B DEVANAGARI LETTER VOCALIC R r R r
U+090C DEVANAGARI LETTER VOCALIC L l L l
U+090D DEVANAGARI LETTER CANDRA E e eN e
U+0911 DEVANAGARI LETTER CANDRA O o oN o
U+0915 DEVANAGARI LETTER KA ka k k
U+0916 DEVANAGARI LETTER KHA kha kh kh
U+0917 DEVANAGARI LETTER GA ga g g
U+0918 DEVANAGARI LETTER GHA gha gh gh
U+0919 DEVANAGARI LETTER NGA nga ng n
U+091A DEVANAGARI LETTER CA cha c c
U+091B DEVANAGARI LETTER CHA chha ch ch
U+091C DEVANAGARI LETTER JA ja j j
U+091D DEVANAGARI LETTER JHA jha jh jh
U+091E DEVANAGARI LETTER NYA nya ny n
U+091F DEVANAGARI LETTER TTA ta tt t
U+0920 DEVANAGARI LETTER TTHA tha tth th
U+0921 DEVANAGARI LETTER DDA da dd d
U+0922 DEVANAGARI LETTER DDHA dha ddh dh
U+0923 DEVANAGARI LETTER NNA na nn n
U+0924 DEVANAGARI LETTER TA ta t t
U+0925 DEVANAGARI LETTER THA tha th th
U+0926 DEVANAGARI LETTER DA da d d
U+0927 DEVANAGARI LETTER DHA dha dh dh
U+0928 DEVANAGARI LETTER NA na n n
U+0929 DEVANAGARI LETTER NNNA na nnn n
U+092A DEVANAGARI LETTER PA pa p p
U+092B DEVANAGARI LETTER PHA pha ph ph
U+092C DEVANAGARI LETTER BA ba b b
U+092D DEVANAGARI LETTER BHA bha bh bh
U+092E DEVANAGARI LETTER MA ma m m
U+092F DEVANAGARI LETTER YA ya y y
U+0930 DEVANAGARI LETTER RA ra r r
U+0931 DEVANAGARI LETTER RRA ra rr r
U+0932 DEVANAGARI LETTER LA la l l
U+0933 DEVANAGARI LETTER LLA la l l
U+0934 DEVANAGARI LETTER LLLA la lll l
U+0935 DEVANAGARI LETTER VA va v v
U+0936 DEVANAGARI LETTER SHA sha sh s
U+0937 DEVANAGARI LETTER SSA sha ss s
U+0938 DEVANAGARI LETTER SA sa s s
U+0939 DEVANAGARI LETTER HA ha h h
U+093E DEVANAGARI VOWEL SIGN AA a aa a
U+0940 DEVANAGARI VOWEL SIGN II i ii i
U+0942 DEVANAGARI VOWEL SIGN UU u uu u
U+0943 DEVANAGARI VOWEL SIGN VOCALIC R r R r
...18 more differences

or — Odia

Block: 91 assigned codepoints, 90 mapped by at least one library.

Coverage: translit maps 86/90, Unidecode maps 77/90. 12 mapped only by translit, 3 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0B35 ORIYA LETTER VA va
U+0B44 ORIYA VOWEL SIGN VOCALIC RR r
U+0B55 ORIYA SIGN OVERLINE e
U+0B62 ORIYA VOWEL SIGN VOCALIC L l
U+0B63 ORIYA VOWEL SIGN VOCALIC LL l
U+0B71 ORIYA LETTER WA wa
U+0B72 ORIYA FRACTION ONE QUARTER 1/4
U+0B73 ORIYA FRACTION ONE HALF 1/2
U+0B74 ORIYA FRACTION THREE QUARTERS 3/4
U+0B75 ORIYA FRACTION ONE SIXTEENTH 1/16
U+0B76 ORIYA FRACTION ONE EIGHTH 1/8
U+0B77 ORIYA FRACTION THREE SIXTEENTHS 3/16

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+0B3C ORIYA SIGN NUKTA '
U+0B56 ORIYA AI LENGTH MARK +
U+0B57 ORIYA AU LENGTH MARK +
Char Codepoint Name translit Unidecode anyascii
U+0B01 ORIYA SIGN CANDRABINDU m N m
U+0B02 ORIYA SIGN ANUSVARA m N m
U+0B03 ORIYA SIGN VISARGA h H h
U+0B08 ORIYA LETTER II i ii i
U+0B0A ORIYA LETTER UU u uu u
U+0B0B ORIYA LETTER VOCALIC R r R r
U+0B0C ORIYA LETTER VOCALIC L l L l
U+0B15 ORIYA LETTER KA ka k k
U+0B16 ORIYA LETTER KHA kha kh kh
U+0B17 ORIYA LETTER GA ga g g
U+0B18 ORIYA LETTER GHA gha gh gh
U+0B19 ORIYA LETTER NGA nga ng n
U+0B1A ORIYA LETTER CA cha c c
U+0B1B ORIYA LETTER CHA chha ch ch
U+0B1C ORIYA LETTER JA ja j j
U+0B1D ORIYA LETTER JHA jha jh jh
U+0B1E ORIYA LETTER NYA nya ny n
U+0B1F ORIYA LETTER TTA ta tt t
U+0B20 ORIYA LETTER TTHA tha tth th
U+0B21 ORIYA LETTER DDA da dd d
U+0B22 ORIYA LETTER DDHA dha ddh dh
U+0B23 ORIYA LETTER NNA na nn n
U+0B24 ORIYA LETTER TA ta t t
U+0B25 ORIYA LETTER THA tha th th
U+0B26 ORIYA LETTER DA da d d
U+0B27 ORIYA LETTER DHA dha dh dh
U+0B28 ORIYA LETTER NA na n n
U+0B2A ORIYA LETTER PA pa p p
U+0B2B ORIYA LETTER PHA pha ph ph
U+0B2C ORIYA LETTER BA ba b b
U+0B2D ORIYA LETTER BHA bha bh bh
U+0B2E ORIYA LETTER MA ma m m
U+0B2F ORIYA LETTER YA ya y y
U+0B30 ORIYA LETTER RA ra r r
U+0B32 ORIYA LETTER LA la l l
U+0B33 ORIYA LETTER LLA la ll l
U+0B36 ORIYA LETTER SHA sha sh s
U+0B37 ORIYA LETTER SSA sha ss s
U+0B38 ORIYA LETTER SA sa s s
U+0B39 ORIYA LETTER HA ha h h
U+0B3E ORIYA VOWEL SIGN AA a aa a
U+0B40 ORIYA VOWEL SIGN II i ii i
U+0B42 ORIYA VOWEL SIGN UU u uu u
U+0B43 ORIYA VOWEL SIGN VOCALIC R r R r
U+0B5C ORIYA LETTER RRA da rr r
U+0B5D ORIYA LETTER RHA dha rh rh
U+0B5F ORIYA LETTER YYA ya yy y
U+0B60 ORIYA LETTER VOCALIC RR r RR r
U+0B61 ORIYA LETTER VOCALIC LL l LL l

pa — Punjabi

Block: 80 assigned codepoints, 78 mapped by at least one library.

Coverage: translit maps 74/78, Unidecode maps 72/78. 5 mapped only by translit, 3 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0A01 GURMUKHI SIGN ADAK BINDI m
U+0A03 GURMUKHI SIGN VISARGA h
U+0A72 GURMUKHI IRI iri
U+0A73 GURMUKHI URA ura
U+0A76 GURMUKHI ABBREVIATION SIGN .

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+0A3C GURMUKHI SIGN NUKTA '
U+0A70 GURMUKHI TIPPI N
U+0A71 GURMUKHI ADDAK H
Char Codepoint Name translit Unidecode anyascii
U+0A02 GURMUKHI SIGN BINDI m N m
U+0A08 GURMUKHI LETTER II i ii i
U+0A0A GURMUKHI LETTER UU u uu u
U+0A0F GURMUKHI LETTER EE e ee e
U+0A13 GURMUKHI LETTER OO o oo o
U+0A15 GURMUKHI LETTER KA ka k k
U+0A16 GURMUKHI LETTER KHA kha kh kh
U+0A17 GURMUKHI LETTER GA ga g g
U+0A18 GURMUKHI LETTER GHA gha gh gh
U+0A19 GURMUKHI LETTER NGA nga ng n
U+0A1A GURMUKHI LETTER CA cha c c
U+0A1B GURMUKHI LETTER CHA chha ch ch
U+0A1C GURMUKHI LETTER JA ja j j
U+0A1D GURMUKHI LETTER JHA jha jh jh
U+0A1E GURMUKHI LETTER NYA nya ny n
U+0A1F GURMUKHI LETTER TTA ta tt t
U+0A20 GURMUKHI LETTER TTHA tha tth th
U+0A21 GURMUKHI LETTER DDA da dd d
U+0A22 GURMUKHI LETTER DDHA dha ddh dh
U+0A23 GURMUKHI LETTER NNA na nn n
U+0A24 GURMUKHI LETTER TA ta t t
U+0A25 GURMUKHI LETTER THA tha th th
U+0A26 GURMUKHI LETTER DA da d d
U+0A27 GURMUKHI LETTER DHA dha dh dh
U+0A28 GURMUKHI LETTER NA na n n
U+0A2A GURMUKHI LETTER PA pa p p
U+0A2B GURMUKHI LETTER PHA pha ph ph
U+0A2C GURMUKHI LETTER BA ba b b
U+0A2D GURMUKHI LETTER BHA bha bb bh
U+0A2E GURMUKHI LETTER MA ma m m
U+0A2F GURMUKHI LETTER YA ya y y
U+0A30 GURMUKHI LETTER RA ra r r
U+0A32 GURMUKHI LETTER LA la l l
U+0A33 GURMUKHI LETTER LLA la ll l
U+0A35 GURMUKHI LETTER VA va v v
U+0A36 GURMUKHI LETTER SHA sha sh s
U+0A38 GURMUKHI LETTER SA sa s s
U+0A39 GURMUKHI LETTER HA ha h h
U+0A3E GURMUKHI VOWEL SIGN AA a aa a
U+0A40 GURMUKHI VOWEL SIGN II i ii i
U+0A42 GURMUKHI VOWEL SIGN UU u uu u
U+0A47 GURMUKHI VOWEL SIGN EE e ee e
U+0A4B GURMUKHI VOWEL SIGN OO o oo o
U+0A59 GURMUKHI LETTER KHHA kha khh kh
U+0A5A GURMUKHI LETTER GHHA ga ghh g
U+0A5B GURMUKHI LETTER ZA za z z
U+0A5C GURMUKHI LETTER RRA ra rr r
U+0A5E GURMUKHI LETTER FA fa f ph
U+0A74 GURMUKHI EK ONKAR ek G.E.O. *

sa — Sanskrit

Block: 128 assigned codepoints, 127 mapped by at least one library.

Coverage: translit maps 117/127, Unidecode maps 103/127. 19 mapped only by translit, 5 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0904 DEVANAGARI LETTER SHORT A a
U+0955 DEVANAGARI VOWEL SIGN CANDRA LONG E e
U+0956 DEVANAGARI VOWEL SIGN UE u
U+0957 DEVANAGARI VOWEL SIGN UUE u
U+0971 DEVANAGARI SIGN HIGH SPACING DOT .
U+0972 DEVANAGARI LETTER CANDRA A a
U+0973 DEVANAGARI LETTER OE oe
U+0974 DEVANAGARI LETTER OOE ooe
U+0975 DEVANAGARI LETTER AW aw
U+0976 DEVANAGARI LETTER UE ue
U+0977 DEVANAGARI LETTER UUE uue
U+0978 DEVANAGARI LETTER MARWARI DDA dda
U+0979 DEVANAGARI LETTER ZHA zha
U+097A DEVANAGARI LETTER HEAVY YA ya
U+097B DEVANAGARI LETTER GGA gga
U+097C DEVANAGARI LETTER JJA jja
U+097D DEVANAGARI LETTER GLOTTAL STOP '
U+097E DEVANAGARI LETTER DDDA ddda
ॿ U+097F DEVANAGARI LETTER BBA bba

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+093C DEVANAGARI SIGN NUKTA '
U+0951 DEVANAGARI STRESS SIGN UDATTA '
U+0952 DEVANAGARI STRESS SIGN ANUDATTA '
U+0953 DEVANAGARI GRAVE ACCENT ```
U+0954 DEVANAGARI ACUTE ACCENT '
Char Codepoint Name translit Unidecode anyascii
U+0901 DEVANAGARI SIGN CANDRABINDU m N m
U+0902 DEVANAGARI SIGN ANUSVARA m N m
U+0903 DEVANAGARI SIGN VISARGA h H h
U+0908 DEVANAGARI LETTER II i ii i
U+090A DEVANAGARI LETTER UU u uu u
U+090B DEVANAGARI LETTER VOCALIC R r R r
U+090C DEVANAGARI LETTER VOCALIC L l L l
U+090D DEVANAGARI LETTER CANDRA E e eN e
U+0911 DEVANAGARI LETTER CANDRA O o oN o
U+0915 DEVANAGARI LETTER KA ka k k
U+0916 DEVANAGARI LETTER KHA kha kh kh
U+0917 DEVANAGARI LETTER GA ga g g
U+0918 DEVANAGARI LETTER GHA gha gh gh
U+0919 DEVANAGARI LETTER NGA nga ng n
U+091A DEVANAGARI LETTER CA cha c c
U+091B DEVANAGARI LETTER CHA chha ch ch
U+091C DEVANAGARI LETTER JA ja j j
U+091D DEVANAGARI LETTER JHA jha jh jh
U+091E DEVANAGARI LETTER NYA nya ny n
U+091F DEVANAGARI LETTER TTA ta tt t
U+0920 DEVANAGARI LETTER TTHA tha tth th
U+0921 DEVANAGARI LETTER DDA da dd d
U+0922 DEVANAGARI LETTER DDHA dha ddh dh
U+0923 DEVANAGARI LETTER NNA na nn n
U+0924 DEVANAGARI LETTER TA ta t t
U+0925 DEVANAGARI LETTER THA tha th th
U+0926 DEVANAGARI LETTER DA da d d
U+0927 DEVANAGARI LETTER DHA dha dh dh
U+0928 DEVANAGARI LETTER NA na n n
U+0929 DEVANAGARI LETTER NNNA na nnn n
U+092A DEVANAGARI LETTER PA pa p p
U+092B DEVANAGARI LETTER PHA pha ph ph
U+092C DEVANAGARI LETTER BA ba b b
U+092D DEVANAGARI LETTER BHA bha bh bh
U+092E DEVANAGARI LETTER MA ma m m
U+092F DEVANAGARI LETTER YA ya y y
U+0930 DEVANAGARI LETTER RA ra r r
U+0931 DEVANAGARI LETTER RRA ra rr r
U+0932 DEVANAGARI LETTER LA la l l
U+0933 DEVANAGARI LETTER LLA la l l
U+0934 DEVANAGARI LETTER LLLA la lll l
U+0935 DEVANAGARI LETTER VA va v v
U+0936 DEVANAGARI LETTER SHA sha sh s
U+0937 DEVANAGARI LETTER SSA sha ss s
U+0938 DEVANAGARI LETTER SA sa s s
U+0939 DEVANAGARI LETTER HA ha h h
U+093E DEVANAGARI VOWEL SIGN AA a aa a
U+0940 DEVANAGARI VOWEL SIGN II i ii i
U+0942 DEVANAGARI VOWEL SIGN UU u uu u
U+0943 DEVANAGARI VOWEL SIGN VOCALIC R r R r
...18 more differences

as — Assamese

Block: 96 assigned codepoints, 95 mapped by at least one library.

Coverage: translit maps 90/95, Unidecode maps 87/95. 5 mapped only by translit, 2 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0980 BENGALI ANJI m
U+09BD BENGALI SIGN AVAGRAHA '
U+09CE BENGALI LETTER KHANDA TA t
U+09FC BENGALI LETTER VEDIC ANUSVARA m
U+09FD BENGALI ABBREVIATION SIGN .

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+09BC BENGALI SIGN NUKTA '
U+09D7 BENGALI AU LENGTH MARK +
Char Codepoint Name translit Unidecode anyascii
U+0981 BENGALI SIGN CANDRABINDU m N m
U+0982 BENGALI SIGN ANUSVARA m N m
U+0983 BENGALI SIGN VISARGA h H h
U+0988 BENGALI LETTER II i ii i
U+098A BENGALI LETTER UU u uu u
U+098B BENGALI LETTER VOCALIC R r R r
U+098C BENGALI LETTER VOCALIC L l RR l
U+0995 BENGALI LETTER KA ka k k
U+0996 BENGALI LETTER KHA kha kh kh
U+0997 BENGALI LETTER GA ga g g
U+0998 BENGALI LETTER GHA gha gh gh
U+0999 BENGALI LETTER NGA nga ng n
U+099A BENGALI LETTER CA cha c c
U+099B BENGALI LETTER CHA chha ch ch
U+099C BENGALI LETTER JA ja j j
U+099D BENGALI LETTER JHA jha jh jh
U+099E BENGALI LETTER NYA nya ny n
U+099F BENGALI LETTER TTA ta tt t
U+09A0 BENGALI LETTER TTHA tha tth th
U+09A1 BENGALI LETTER DDA da dd d
U+09A2 BENGALI LETTER DDHA dha ddh dh
U+09A3 BENGALI LETTER NNA na nn n
U+09A4 BENGALI LETTER TA ta t t
U+09A5 BENGALI LETTER THA tha th th
U+09A6 BENGALI LETTER DA da d d
U+09A7 BENGALI LETTER DHA dha dh dh
U+09A8 BENGALI LETTER NA na n n
U+09AA BENGALI LETTER PA pa p p
U+09AB BENGALI LETTER PHA pha ph ph
U+09AC BENGALI LETTER BA ba b b
U+09AD BENGALI LETTER BHA bha bh bh
U+09AE BENGALI LETTER MA ma m m
U+09AF BENGALI LETTER YA ya y y
U+09B0 BENGALI LETTER RA ra r r
U+09B2 BENGALI LETTER LA la l l
U+09B6 BENGALI LETTER SHA sha sh s
U+09B7 BENGALI LETTER SSA sha ss s
U+09B8 BENGALI LETTER SA sa s s
U+09B9 BENGALI LETTER HA ha h h
U+09BE BENGALI VOWEL SIGN AA a aa a
U+09C0 BENGALI VOWEL SIGN II i ii i
U+09C2 BENGALI VOWEL SIGN UU u uu u
U+09C3 BENGALI VOWEL SIGN VOCALIC R r R r
U+09C4 BENGALI VOWEL SIGN VOCALIC RR r RR r
U+09DC BENGALI LETTER RRA ra rr r
U+09DD BENGALI LETTER RHA rha rh rh
U+09DF BENGALI LETTER YYA ya yy y
U+09E0 BENGALI LETTER VOCALIC RR r RR r
U+09E1 BENGALI LETTER VOCALIC LL l LL l
U+09E2 BENGALI VOWEL SIGN VOCALIC L l L l
...9 more differences

hy — Armenian

Block: 91 assigned codepoints, 90 mapped by at least one library.

Coverage: translit maps 86/90, Unidecode maps 85/90. 3 mapped only by translit, 2 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
ՠ U+0560 ARMENIAN SMALL LETTER TURNED AYB a
ֈ U+0588 ARMENIAN SMALL LETTER YI WITH STROKE yi
֏ U+058F ARMENIAN DRAM SIGN AMD

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
՛ U+055B ARMENIAN EMPHASIS MARK /
՟ U+055F ARMENIAN ABBREVIATION MARK .
Char Codepoint Name translit Unidecode anyascii
Ը U+0538 ARMENIAN CAPITAL LETTER ET Y E Y
Թ U+0539 ARMENIAN CAPITAL LETTER TO T T`` |T'`
Ո U+0548 ARMENIAN CAPITAL LETTER VO Vo O O
Չ U+0549 ARMENIAN CAPITAL LETTER CHA Ch Ch`` |Ch'`
Ռ U+054C ARMENIAN CAPITAL LETTER RA R Rh Rr
Ց U+0551 ARMENIAN CAPITAL LETTER CO Ts Ts`` |Ts'`
Ւ U+0552 ARMENIAN CAPITAL LETTER YIWN V W W
Փ U+0553 ARMENIAN CAPITAL LETTER PIWR P P`` |P'`
Ք U+0554 ARMENIAN CAPITAL LETTER KEH K K`` |K'`
ՙ U+0559 ARMENIAN MODIFIER LETTER LEFT HALF RING ' < ```
ը U+0568 ARMENIAN SMALL LETTER ET y e y
թ U+0569 ARMENIAN SMALL LETTER TO t t`` |t'`
ո U+0578 ARMENIAN SMALL LETTER VO vo o o
չ U+0579 ARMENIAN SMALL LETTER CHA ch ch`` |ch'`
ռ U+057C ARMENIAN SMALL LETTER RA r rh rr
ց U+0581 ARMENIAN SMALL LETTER CO ts ts`` |ts'`
ւ U+0582 ARMENIAN SMALL LETTER YIWN v w w
փ U+0583 ARMENIAN SMALL LETTER PIWR p p`` |p'`
ք U+0584 ARMENIAN SMALL LETTER KEH k k`` |k'`
և U+0587 ARMENIAN SMALL LIGATURE ECH YIWN yev ew ev
։ U+0589 ARMENIAN FULL STOP . : .

ka — Georgian

Block: 88 assigned codepoints, 88 mapped by at least one library.

Coverage: translit maps 87/88, Unidecode maps 78/88. 9 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+10C7 GEORGIAN CAPITAL LETTER YN Yn
U+10CD GEORGIAN CAPITAL LETTER AEN Ae
U+10F7 GEORGIAN LETTER YN yn
U+10F8 GEORGIAN LETTER ELIFI el
U+10F9 GEORGIAN LETTER TURNED GAN g
U+10FA GEORGIAN LETTER AIN '
U+10FC MODIFIER LETTER GEORGIAN NAR n
U+10FD GEORGIAN LETTER AEN ae
U+10FF GEORGIAN LETTER LABIAL SIGN w
Char Codepoint Name translit Unidecode anyascii
U+10A7 GEORGIAN CAPITAL LETTER TAN T T`` |T`
U+10B4 GEORGIAN CAPITAL LETTER PHAR P P`` |P`
U+10B5 GEORGIAN CAPITAL LETTER KHAR K K`` |K`
U+10B6 GEORGIAN CAPITAL LETTER GHAN Gh G' Gh
U+10B9 GEORGIAN CAPITAL LETTER CHIN Ch Ch`` |Ch`
U+10BA GEORGIAN CAPITAL LETTER CAN Ts C`` |Ts`
U+10BB GEORGIAN CAPITAL LETTER JIL Dz Z' Dz
U+10BC GEORGIAN CAPITAL LETTER CIL Ts C Ts'
U+10BE GEORGIAN CAPITAL LETTER XAN Kh X Kh
U+10C1 GEORGIAN CAPITAL LETTER HE He E E
U+10C2 GEORGIAN CAPITAL LETTER HIE Hi Y Y
U+10C4 GEORGIAN CAPITAL LETTER HAR Har Xh X
U+10C5 GEORGIAN CAPITAL LETTER HOE Ho OE O
U+10D7 GEORGIAN LETTER TAN t t`` |t`
U+10E4 GEORGIAN LETTER PHAR p p`` |p`
U+10E5 GEORGIAN LETTER KHAR k k`` |k`
U+10E6 GEORGIAN LETTER GHAN gh g' gh
U+10E9 GEORGIAN LETTER CHIN ch ch`` |ch`
U+10EA GEORGIAN LETTER CAN ts c`` |ts`
U+10EB GEORGIAN LETTER JIL dz z' dz
U+10EC GEORGIAN LETTER CIL ts c ts'
U+10EE GEORGIAN LETTER XAN kh x kh
U+10F1 GEORGIAN LETTER HE he e e
U+10F2 GEORGIAN LETTER HIE hi y y
U+10F4 GEORGIAN LETTER HAR har xh x
U+10F5 GEORGIAN LETTER HOE ho oe o
U+10FB GEORGIAN PARAGRAPH SEPARATOR . // *

si — Sinhala

Block: 91 assigned codepoints, 90 mapped by at least one library.

Coverage: translit maps 90/90, Unidecode maps 79/90. 11 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0D81 SINHALA SIGN CANDRABINDU m
U+0DE6 SINHALA LITH DIGIT ZERO 0
U+0DE7 SINHALA LITH DIGIT ONE 1
U+0DE8 SINHALA LITH DIGIT TWO 2
U+0DE9 SINHALA LITH DIGIT THREE 3
U+0DEA SINHALA LITH DIGIT FOUR 4
U+0DEB SINHALA LITH DIGIT FIVE 5
U+0DEC SINHALA LITH DIGIT SIX 6
U+0DED SINHALA LITH DIGIT SEVEN 7
U+0DEE SINHALA LITH DIGIT EIGHT 8
U+0DEF SINHALA LITH DIGIT NINE 9
Char Codepoint Name translit Unidecode anyascii
U+0D82 SINHALA SIGN ANUSVARAYA m N m
U+0D83 SINHALA SIGN VISARGAYA h H h
U+0D8D SINHALA LETTER IRUYANNA ri R r
U+0D8E SINHALA LETTER IRUUYANNA r RR r
U+0D8F SINHALA LETTER ILUYANNA rr L l
U+0D90 SINHALA LETTER ILUUYANNA luu LL l
U+0D9A SINHALA LETTER ALPAPRAANA KAYANNA ka k k
U+0D9B SINHALA LETTER MAHAAPRAANA KAYANNA kha kh kh
U+0D9C SINHALA LETTER ALPAPRAANA GAYANNA ga g g
U+0D9D SINHALA LETTER MAHAAPRAANA GAYANNA gha gh gh
U+0D9E SINHALA LETTER KANTAJA NAASIKYAYA nga ng n
U+0D9F SINHALA LETTER SANYAKA GAYANNA nnga nng ng
U+0DA0 SINHALA LETTER ALPAPRAANA CAYANNA cha c c
U+0DA1 SINHALA LETTER MAHAAPRAANA CAYANNA chha ch ch
U+0DA2 SINHALA LETTER ALPAPRAANA JAYANNA ja j j
U+0DA3 SINHALA LETTER MAHAAPRAANA JAYANNA jha jh jh
U+0DA4 SINHALA LETTER TAALUJA NAASIKYAYA nya ny n
U+0DA5 SINHALA LETTER TAALUJA SANYOOGA NAAKSIKYAYA jnya jny jn
U+0DA6 SINHALA LETTER SANYAKA JAYANNA nyja nyj nj
U+0DA7 SINHALA LETTER ALPAPRAANA TTAYANNA tta tt t
U+0DA8 SINHALA LETTER MAHAAPRAANA TTAYANNA ttha tth th
U+0DA9 SINHALA LETTER ALPAPRAANA DDAYANNA dda dd d
U+0DAA SINHALA LETTER MAHAAPRAANA DDAYANNA ddha ddh dh
U+0DAB SINHALA LETTER MUURDHAJA NAYANNA nna nn n
U+0DAC SINHALA LETTER SANYAKA DDAYANNA nndda nndd nd
U+0DAD SINHALA LETTER ALPAPRAANA TAYANNA ta t t
U+0DAE SINHALA LETTER MAHAAPRAANA TAYANNA tha th th
U+0DAF SINHALA LETTER ALPAPRAANA DAYANNA da d d
U+0DB0 SINHALA LETTER MAHAAPRAANA DAYANNA dha dh dh
U+0DB1 SINHALA LETTER DANTAJA NAYANNA na n n
U+0DB3 SINHALA LETTER SANYAKA DAYANNA nda nd nd
U+0DB4 SINHALA LETTER ALPAPRAANA PAYANNA pa p p
U+0DB5 SINHALA LETTER MAHAAPRAANA PAYANNA pha ph ph
U+0DB6 SINHALA LETTER ALPAPRAANA BAYANNA ba b b
U+0DB7 SINHALA LETTER MAHAAPRAANA BAYANNA bha bh bh
U+0DB8 SINHALA LETTER MAYANNA ma m m
U+0DB9 SINHALA LETTER AMBA BAYANNA mba mb mb
U+0DBA SINHALA LETTER YAYANNA ya y y
U+0DBB SINHALA LETTER RAYANNA ra r r
U+0DBD SINHALA LETTER DANTAJA LAYANNA la l l
U+0DC0 SINHALA LETTER VAYANNA va v v
U+0DC1 SINHALA LETTER TAALUJA SAYANNA sha sh s
U+0DC2 SINHALA LETTER MUURDHAJA SAYANNA sha ss s
U+0DC3 SINHALA LETTER DANTAJA SAYANNA sa s s
U+0DC4 SINHALA LETTER HAYANNA ha h h
U+0DC5 SINHALA LETTER MUURDHAJA LAYANNA lla ll l
U+0DC6 SINHALA LETTER FAYANNA fa f f
U+0DCF SINHALA VOWEL SIGN AELA-PILLA a aa a
U+0DD0 SINHALA VOWEL SIGN KETTI AEDA-PILLA aa ae ae
U+0DD1 SINHALA VOWEL SIGN DIGA AEDA-PILLA ae aae ae
...5 more differences

th — Thai

Block: 87 assigned codepoints, 80 mapped by at least one library.

Coverage: translit maps 78/80, Unidecode maps 80/80. 0 mapped only by translit, 2 mapped only by Unidecode.

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+0E3A THAI CHARACTER PHINTHU '
U+0E46 THAI CHARACTER MAIYAMOK +
Char Codepoint Name translit Unidecode anyascii
U+0E08 THAI CHARACTER CHO CHAN ch cch ch
U+0E0B THAI CHARACTER SO SO s ch s
U+0E24 THAI CHARACTER RU rue R rue
U+0E26 THAI CHARACTER LU lue L lue
U+0E2D THAI CHARACTER O ANG o ``` o
U+0E2F THAI CHARACTER PAIYANNOI . ~ .
U+0E32 THAI CHARACTER SARA AA a aa a
U+0E35 THAI CHARACTER SARA II i ii i
U+0E37 THAI CHARACTER SARA UEE ue uue ue
U+0E39 THAI CHARACTER SARA UU u uu u
฿ U+0E3F THAI CURRENCY SYMBOL BAHT B Bh. B
U+0E45 THAI CHARACTER LAKKHANGYAO a ao
U+0E4D THAI CHARACTER NIKHAHIT m M m
U+0E4F THAI CHARACTER FONGMAN * *
U+0E5A THAI CHARACTER ANGKHANKHU . // #
U+0E5B THAI CHARACTER KHOMUT . /// @

lo — Lao

Block: 83 assigned codepoints, 76 mapped by at least one library.

Coverage: translit maps 75/76, Unidecode maps 58/76. 18 mapped only by translit, 1 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0E86 LAO LETTER PALI GHA gha
U+0E89 LAO LETTER PALI CHA cha
U+0E8C LAO LETTER PALI JHA jha
U+0E8E LAO LETTER PALI NYA nya
U+0E8F LAO LETTER PALI TTA tta
U+0E90 LAO LETTER PALI TTHA ttha
U+0E91 LAO LETTER PALI DDA dda
U+0E92 LAO LETTER PALI DDHA ddha
U+0E93 LAO LETTER PALI NNA nna
U+0E98 LAO LETTER PALI DHA dha
U+0EA0 LAO LETTER PALI BHA bha
U+0EA8 LAO LETTER SANSKRIT SHA sha
U+0EA9 LAO LETTER SANSKRIT SSA ssa
U+0EAC LAO LETTER PALI LLA lla
U+0EAE LAO LETTER HO TAM h
U+0EB1 LAO VOWEL SIGN MAI KAN a
U+0EDE LAO LETTER KHMU GO go
U+0EDF LAO LETTER KHMU NYO nyo

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+0EC6 LAO KO LA +
Char Codepoint Name translit Unidecode anyascii
U+0E95 LAO LETTER TO t h t
U+0EAD LAO LETTER O o ```
U+0EAF LAO ELLIPSIS ... ~ ...
U+0EB2 LAO VOWEL SIGN AA a aa a
U+0EB5 LAO VOWEL SIGN II i ii i
U+0EB6 LAO VOWEL SIGN Y ue y u
U+0EB7 LAO VOWEL SIGN YY ue yy u
U+0EB9 LAO VOWEL SIGN UU u uu ou
U+0EBD LAO SEMIVOWEL SIGN NYO y ny y
U+0EC1 LAO VOWEL SIGN EI ae ei e
U+0EC3 LAO VOWEL SIGN AY ai ay ai
U+0ECD LAO NIGGAHITA m M o

km — Khmer

Block: 114 assigned codepoints, 106 mapped by at least one library.

Coverage: translit maps 100/106, Unidecode maps 94/106. 10 mapped only by translit, 4 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+17F0 KHMER SYMBOL LEK ATTAK SON 0
U+17F1 KHMER SYMBOL LEK ATTAK MUOY 1
U+17F2 KHMER SYMBOL LEK ATTAK PII 2
U+17F3 KHMER SYMBOL LEK ATTAK BEI 3
U+17F4 KHMER SYMBOL LEK ATTAK BUON 4
U+17F5 KHMER SYMBOL LEK ATTAK PRAM 5
U+17F6 KHMER SYMBOL LEK ATTAK PRAM-MUOY 6
U+17F7 KHMER SYMBOL LEK ATTAK PRAM-PII 7
U+17F8 KHMER SYMBOL LEK ATTAK PRAM-BEI 8
U+17F9 KHMER SYMBOL LEK ATTAK PRAM-BUON 9

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+17B4 KHMER VOWEL INHERENT AQ a
U+17B5 KHMER VOWEL INHERENT AA aa
U+17CE KHMER SIGN KAKABAT !
U+17D7 KHMER SIGN LEK TOO +
Char Codepoint Name translit Unidecode anyascii
U+1780 KHMER LETTER KA ka k k
U+1781 KHMER LETTER KHA kha kh kh
U+1782 KHMER LETTER KO ka g k
U+1783 KHMER LETTER KHO kha gh kh
U+1784 KHMER LETTER NGO nga ng ng
U+1785 KHMER LETTER CA cha c ch
U+1786 KHMER LETTER CHA chha ch chh
U+1787 KHMER LETTER CO cha j ch
U+1788 KHMER LETTER CHO chha jh chh
U+1789 KHMER LETTER NYO nya ny nh
U+178A KHMER LETTER DA da t d
U+178B KHMER LETTER TTHA ttha tth th
U+178C KHMER LETTER DO da d d
U+178D KHMER LETTER TTHO ttha ddh th
U+178E KHMER LETTER NNO na nn n
U+178F KHMER LETTER TA ta t t
U+1790 KHMER LETTER THA tha th th
U+1791 KHMER LETTER TO ta d t
U+1792 KHMER LETTER THO tha dh th
U+1793 KHMER LETTER NO na n n
U+1794 KHMER LETTER BA ba p b
U+1795 KHMER LETTER PHA pha ph ph
U+1796 KHMER LETTER PO pa b p
U+1797 KHMER LETTER PHO pha bh ph
U+1798 KHMER LETTER MO ma m m
U+1799 KHMER LETTER YO ya y y
U+179A KHMER LETTER RO ra r r
U+179B KHMER LETTER LO la l l
U+179C KHMER LETTER VO va v v
U+179D KHMER LETTER SHA sha sh s
U+179E KHMER LETTER SSO sha ss s
U+179F KHMER LETTER SA sa s s
U+17A0 KHMER LETTER HA ha h h
U+17A1 KHMER LETTER LA la l l
U+17A2 KHMER LETTER QA a q '
U+17A4 KHMER INDEPENDENT VOWEL QAA a aa 'a
U+17A5 KHMER INDEPENDENT VOWEL QI e i e
U+17A6 KHMER INDEPENDENT VOWEL QII e ii ei
U+17A9 KHMER INDEPENDENT VOWEL QUU u uu ou
U+17AA KHMER INDEPENDENT VOWEL QUUV u uuv au
U+17AC KHMER INDEPENDENT VOWEL RYY ry ryy rueu
U+17AE KHMER INDEPENDENT VOWEL LYY ly lyy lueu
U+17B1 KHMER INDEPENDENT VOWEL QOO TYPE ONE o oo ao
U+17B2 KHMER INDEPENDENT VOWEL QOO TYPE TWO o oo ao
U+17B6 KHMER VOWEL SIGN AA a aa a
U+17B7 KHMER VOWEL SIGN I e i e
U+17B8 KHMER VOWEL SIGN II e ii ei
U+17B9 KHMER VOWEL SIGN Y o y oe
U+17BA KHMER VOWEL SIGN YY o yy eu
U+17BC KHMER VOWEL SIGN UU u uu ou
...12 more differences

my — Myanmar

Block: 160 assigned codepoints, 141 mapped by at least one library.

Coverage: translit maps 136/141, Unidecode maps 77/141. 64 mapped only by translit, 5 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+1022 MYANMAR LETTER SHAN A a
U+1028 MYANMAR LETTER MON E e
U+102B MYANMAR VOWEL SIGN TALL AA a
U+1033 MYANMAR VOWEL SIGN MON II o
U+1034 MYANMAR VOWEL SIGN MON O o
U+1035 MYANMAR VOWEL SIGN E ABOVE e
U+103B MYANMAR CONSONANT SIGN MEDIAL YA y
U+103C MYANMAR CONSONANT SIGN MEDIAL RA r
U+103D MYANMAR CONSONANT SIGN MEDIAL WA w
U+103E MYANMAR CONSONANT SIGN MEDIAL HA h
U+103F MYANMAR LETTER GREAT SA sa
U+105A MYANMAR LETTER MON NGA nga
U+105B MYANMAR LETTER MON JHA jha
U+105C MYANMAR LETTER MON BBA ba
U+105D MYANMAR LETTER MON BBE be
U+105E MYANMAR CONSONANT SIGN MON MEDIAL NA n
U+105F MYANMAR CONSONANT SIGN MON MEDIAL MA m
U+1060 MYANMAR CONSONANT SIGN MON MEDIAL LA l
U+1061 MYANMAR LETTER SGAW KAREN SHA sha
U+1062 MYANMAR VOWEL SIGN SGAW KAREN EU eu
U+1065 MYANMAR LETTER WESTERN PWO KAREN THA tha
U+1066 MYANMAR LETTER WESTERN PWO KAREN PWA pwa
U+1067 MYANMAR VOWEL SIGN WESTERN PWO KAREN EU eu
U+1068 MYANMAR VOWEL SIGN WESTERN PWO KAREN UE ue
U+106E MYANMAR LETTER EASTERN PWO KAREN NNA na
U+106F MYANMAR LETTER EASTERN PWO KAREN YWA ywa
U+1070 MYANMAR LETTER EASTERN PWO KAREN GHWA ghwa
U+1071 MYANMAR VOWEL SIGN GEBA KAREN I i
U+1072 MYANMAR VOWEL SIGN KAYAH OE oe
U+1073 MYANMAR VOWEL SIGN KAYAH U u
...34 more

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+1036 MYANMAR SIGN ANUSVARA N
U+1037 MYANMAR SIGN DOT BELOW '
U+1038 MYANMAR SIGN VISARGA :
U+104E MYANMAR SYMBOL AFOREMENTIONED l*
U+104F MYANMAR SYMBOL GENITIVE e*
Char Codepoint Name translit Unidecode anyascii
က U+1000 MYANMAR LETTER KA ka k k
U+1001 MYANMAR LETTER KHA kha kh kh
U+1002 MYANMAR LETTER GA ga g g
U+1003 MYANMAR LETTER GHA gha gh gh
U+1004 MYANMAR LETTER NGA nga ng n
U+1005 MYANMAR LETTER CA sa c c
U+1006 MYANMAR LETTER CHA hsa ch ch
U+1007 MYANMAR LETTER JA za j j
U+1008 MYANMAR LETTER JHA zha jh jh
U+1009 MYANMAR LETTER NYA nya ny n
U+100A MYANMAR LETTER NNYA nya nny nn
U+100B MYANMAR LETTER TTA ta tt t
U+100C MYANMAR LETTER TTHA tha tth th
U+100D MYANMAR LETTER DDA da dd d
U+100E MYANMAR LETTER DDHA dha ddh dh
U+100F MYANMAR LETTER NNA na nn n
U+1010 MYANMAR LETTER TA ta tt t
U+1011 MYANMAR LETTER THA tha th th
U+1012 MYANMAR LETTER DA da d d
U+1013 MYANMAR LETTER DHA dha dh dh
U+1014 MYANMAR LETTER NA na n n
U+1015 MYANMAR LETTER PA pa p p
U+1016 MYANMAR LETTER PHA pha ph ph
U+1017 MYANMAR LETTER BA ba b b
U+1018 MYANMAR LETTER BHA bha bh bh
U+1019 MYANMAR LETTER MA ma m m
U+101A MYANMAR LETTER YA ya y y
U+101B MYANMAR LETTER RA ra r r
U+101C MYANMAR LETTER LA la l l
U+101D MYANMAR LETTER WA wa w v
U+101E MYANMAR LETTER SA tha s s
U+101F MYANMAR LETTER HA ha h h
U+1020 MYANMAR LETTER LLA la ll l
U+1024 MYANMAR LETTER II i ii i
U+1026 MYANMAR LETTER UU u uu u
U+102A MYANMAR LETTER AU o au o
U+102C MYANMAR VOWEL SIGN AA a aa a
U+102E MYANMAR VOWEL SIGN II i ii i
U+1030 MYANMAR VOWEL SIGN UU u uu u
U+1032 MYANMAR VOWEL SIGN AI e ai ai
U+104A MYANMAR SIGN LITTLE SECTION , / ,
U+104B MYANMAR SIGN SECTION . // .
U+104C MYANMAR SYMBOL LOCATIVE , n* n*
U+104D MYANMAR SYMBOL COMPLETED . r* r*
U+1050 MYANMAR LETTER SHA sha sh s
U+1051 MYANMAR LETTER SSA ssa ss s
U+1052 MYANMAR LETTER VOCALIC R ri R r
U+1053 MYANMAR LETTER VOCALIC RR ri RR r
U+1054 MYANMAR LETTER VOCALIC L li L l
U+1055 MYANMAR LETTER VOCALIC LL li LL l
...4 more differences

bo — Tibetan

Block: 211 assigned codepoints, 201 mapped by at least one library.

Coverage: translit maps 155/201, Unidecode maps 147/201. 22 mapped only by translit, 14 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+0F01 TIBETAN MARK GTER YIG MGO TRUNCATED A .
U+0F02 TIBETAN MARK GTER YIG MGO -UM RNAM BCAD MA .
U+0F03 TIBETAN MARK GTER YIG MGO -UM GTER TSHEG MA .
U+0F04 TIBETAN MARK INITIAL YIG MGO MDUN MA @
U+0F05 TIBETAN MARK CLOSING YIG MGO SGAB MA #
U+0F06 TIBETAN MARK CARET YIG MGO PHUR SHAD MA .
U+0F07 TIBETAN MARK YIG MGO TSHEG SHAD MA .
U+0F0A TIBETAN MARK BKA- SHOG YIG MGO *
U+0F3A TIBETAN MARK GUG RTAGS GYON (
U+0F3B TIBETAN MARK GUG RTAGS GYAS )
U+0F3C TIBETAN MARK ANG KHANG GYON (
U+0F3D TIBETAN MARK ANG KHANG GYAS )
U+0F6B TIBETAN LETTER KKA kka
U+0F6C TIBETAN LETTER RRA rra
U+0F85 TIBETAN MARK PALUTA .
U+0FD0 TIBETAN MARK BSKA- SHOG GI MGO RGYAN |
U+0FD1 TIBETAN MARK MNYAM YIG GI MGO RGYAN |
U+0FD2 TIBETAN MARK NYIS TSHEG :
U+0FD3 TIBETAN MARK INITIAL BRDA RNYING YIG MGO MDUN MA |
U+0FD4 TIBETAN MARK CLOSING BRDA RNYING YIG MGO SGAB MA |
U+0FD9 TIBETAN MARK LEADING MCHAN RTAGS |
U+0FDA TIBETAN MARK TRAILING MCHAN RTAGS |

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+0F0C TIBETAN MARK DELIMITER TSHEG BSTAR /
U+0F34 TIBETAN MARK BSDUS RTAGS +
U+0F35 TIBETAN MARK NGAS BZUNG NYI ZLA *
U+0F36 TIBETAN MARK CARET -DZUD RTAGS BZHI MIG CAN ^
U+0F37 TIBETAN MARK NGAS BZUNG SGOR RTAGS _
U+0F39 TIBETAN MARK TSA -PHRU ~
U+0F7E TIBETAN SIGN RJES SU NGA RO M
ཿ U+0F7F TIBETAN SIGN RNAM BCAD H
U+0FBE TIBETAN KU RU KHA X
྿ U+0FBF TIBETAN KU RU KHA BZHI MIG CAN :X:
U+0FC0 TIBETAN CANTILLATION SIGN HEAVY BEAT /O/
U+0FC1 TIBETAN CANTILLATION SIGN LIGHT BEAT /o/
U+0FC2 TIBETAN CANTILLATION SIGN CANG TE-U \o\
U+0FC3 TIBETAN CANTILLATION SIGN SBUB -CHAL (O)
Char Codepoint Name translit Unidecode anyascii
U+0F00 TIBETAN SYLLABLE OM om AUM Om
U+0F08 TIBETAN MARK SBRUL SHAD ; // !
U+0F09 TIBETAN MARK BSKUR YIG MGO * * *
U+0F0D TIBETAN MARK SHAD . / ,
U+0F0E TIBETAN MARK NYIS SHAD . // .
U+0F0F TIBETAN MARK TSHEG SHAD . -/ ;
U+0F10 TIBETAN MARK NYIS TSHEG SHAD . +/ |
U+0F11 TIBETAN MARK RIN CHEN SPUNGS SHAD . X/ |
U+0F12 TIBETAN MARK RGYA GRAM SHAD . /XX/ /
U+0F13 TIBETAN MARK CARET -DZUD RTAGS ME LONG CAN . /X/ *
U+0F14 TIBETAN MARK GTER TSHEG : , :
U+0F2A TIBETAN DIGIT HALF ONE 0.0 .5 1-
U+0F2B TIBETAN DIGIT HALF TWO 0.5 1.5 2-
U+0F2C TIBETAN DIGIT HALF THREE 1.0 2.5 3-
U+0F2D TIBETAN DIGIT HALF FOUR 1.5 3.5 4-
U+0F2E TIBETAN DIGIT HALF FIVE 2.0 4.5 5-
U+0F2F TIBETAN DIGIT HALF SIX 2.5 5.5 6-
U+0F30 TIBETAN DIGIT HALF SEVEN 3.0 6.5 7-
U+0F31 TIBETAN DIGIT HALF EIGHT 3.5 7.5 8-
U+0F32 TIBETAN DIGIT HALF NINE 4.0 8.5 9-
U+0F33 TIBETAN DIGIT HALF ZERO 4.5 -.5 0-
U+0F40 TIBETAN LETTER KA ka k k
U+0F41 TIBETAN LETTER KHA kha kh kh
U+0F42 TIBETAN LETTER GA ga g g
U+0F43 TIBETAN LETTER GHA ga gh gh
U+0F44 TIBETAN LETTER NGA nga ng ng
U+0F45 TIBETAN LETTER CA cha c c
U+0F46 TIBETAN LETTER CHA chha ch ch
U+0F47 TIBETAN LETTER JA ja j j
U+0F49 TIBETAN LETTER NYA nya ny ny
U+0F4A TIBETAN LETTER TTA ta tt t
U+0F4B TIBETAN LETTER TTHA tha tth th
U+0F4C TIBETAN LETTER DDA da dd d
U+0F4D TIBETAN LETTER DDHA da ddh dh
U+0F4E TIBETAN LETTER NNA na nn n
U+0F4F TIBETAN LETTER TA ta t t
U+0F50 TIBETAN LETTER THA tha th th
U+0F51 TIBETAN LETTER DA da d d
U+0F52 TIBETAN LETTER DHA da dh dh
U+0F53 TIBETAN LETTER NA na n n
U+0F54 TIBETAN LETTER PA pa p p
U+0F55 TIBETAN LETTER PHA pha ph ph
U+0F56 TIBETAN LETTER BA ba b b
U+0F57 TIBETAN LETTER BHA ba bh bh
U+0F58 TIBETAN LETTER MA ma m m
U+0F59 TIBETAN LETTER TSA tsa ts ts
U+0F5A TIBETAN LETTER TSHA tsha tsh tsh
U+0F5B TIBETAN LETTER DZA dza dz dz
U+0F5C TIBETAN LETTER DZHA dza dzh dzh
U+0F5D TIBETAN LETTER WA wa w w
...65 more differences

am — Amharic

Block: 384 assigned codepoints, 370 mapped by at least one library.

Coverage: translit maps 370/370, Unidecode maps 343/370. 27 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+1207 ETHIOPIC SYLLABLE HOA hwa
U+1247 ETHIOPIC SYLLABLE QOA qwa
U+1287 ETHIOPIC SYLLABLE XOA hwa
U+12A2 ETHIOPIC SYLLABLE GLOTTAL I i
U+12AF ETHIOPIC SYLLABLE KOA kwa
U+12CF ETHIOPIC SYLLABLE WOA wwa
U+12EF ETHIOPIC SYLLABLE YOA ywa
U+130F ETHIOPIC SYLLABLE GOA gwa
U+131F ETHIOPIC SYLLABLE GGWAA ggwa
U+1347 ETHIOPIC SYLLABLE TZOA swa
U+1360 ETHIOPIC SECTION MARK
U+1380 ETHIOPIC SYLLABLE SEBATBEIT MWA mwa
U+1381 ETHIOPIC SYLLABLE MWI mwi
U+1382 ETHIOPIC SYLLABLE MWEE mwe
U+1383 ETHIOPIC SYLLABLE MWE mwe
U+1384 ETHIOPIC SYLLABLE SEBATBEIT BWA bwa
U+1385 ETHIOPIC SYLLABLE BWI bwi
U+1386 ETHIOPIC SYLLABLE BWEE bwe
U+1387 ETHIOPIC SYLLABLE BWE bwe
U+1388 ETHIOPIC SYLLABLE SEBATBEIT FWA fwa
U+1389 ETHIOPIC SYLLABLE FWI fwi
U+138A ETHIOPIC SYLLABLE FWEE fwe
U+138B ETHIOPIC SYLLABLE FWE fwe
U+138C ETHIOPIC SYLLABLE SEBATBEIT PWA pwa
U+138D ETHIOPIC SYLLABLE PWI pwi
U+138E ETHIOPIC SYLLABLE PWEE pwe
U+138F ETHIOPIC SYLLABLE PWE pwe
Char Codepoint Name translit Unidecode anyascii
U+1200 ETHIOPIC SYLLABLE HA he ha ha
U+1203 ETHIOPIC SYLLABLE HAA ha haa ha
U+1204 ETHIOPIC SYLLABLE HEE he hee he
U+1205 ETHIOPIC SYLLABLE HE h he h
U+1208 ETHIOPIC SYLLABLE LA le la le
U+120B ETHIOPIC SYLLABLE LAA la laa la
U+120C ETHIOPIC SYLLABLE LEE le lee le
U+120D ETHIOPIC SYLLABLE LE l le l
U+1210 ETHIOPIC SYLLABLE HHA hhe hha ha
U+1213 ETHIOPIC SYLLABLE HHAA hha hhaa ha
U+1214 ETHIOPIC SYLLABLE HHEE hhe hhee he
U+1215 ETHIOPIC SYLLABLE HHE hh hhe h
U+1218 ETHIOPIC SYLLABLE MA me ma me
U+121B ETHIOPIC SYLLABLE MAA ma maa ma
U+121C ETHIOPIC SYLLABLE MEE me mee me
U+121D ETHIOPIC SYLLABLE ME m me m
U+1220 ETHIOPIC SYLLABLE SZA se sza se
U+1221 ETHIOPIC SYLLABLE SZU su szu su
U+1222 ETHIOPIC SYLLABLE SZI si szi si
U+1223 ETHIOPIC SYLLABLE SZAA sa szaa sa
U+1224 ETHIOPIC SYLLABLE SZEE se szee se
U+1225 ETHIOPIC SYLLABLE SZE s sze s
U+1226 ETHIOPIC SYLLABLE SZO so szo so
U+1227 ETHIOPIC SYLLABLE SZWA swa szwa swa
U+1228 ETHIOPIC SYLLABLE RA re ra re
U+122B ETHIOPIC SYLLABLE RAA ra raa ra
U+122C ETHIOPIC SYLLABLE REE re ree re
U+122D ETHIOPIC SYLLABLE RE r re r
U+1230 ETHIOPIC SYLLABLE SA se sa se
U+1233 ETHIOPIC SYLLABLE SAA sa saa sa
U+1234 ETHIOPIC SYLLABLE SEE se see se
U+1235 ETHIOPIC SYLLABLE SE s se s
U+1238 ETHIOPIC SYLLABLE SHA she sha she
U+123B ETHIOPIC SYLLABLE SHAA sha shaa sha
U+123C ETHIOPIC SYLLABLE SHEE she shee she
U+123D ETHIOPIC SYLLABLE SHE sh she sh
U+1240 ETHIOPIC SYLLABLE QA qe qa k'e
U+1243 ETHIOPIC SYLLABLE QAA qa qaa k'a
U+1244 ETHIOPIC SYLLABLE QEE qe qee k'e
U+1245 ETHIOPIC SYLLABLE QE q qe k'
U+124B ETHIOPIC SYLLABLE QWAA qwa qwaa k'wa
U+124C ETHIOPIC SYLLABLE QWEE qwe qwee k'we
U+124D ETHIOPIC SYLLABLE QWE qw qwe k'wi
U+1250 ETHIOPIC SYLLABLE QHA qhe qha k'e
U+1253 ETHIOPIC SYLLABLE QHAA qha qhaa k'a
U+1254 ETHIOPIC SYLLABLE QHEE qhe qhee k'e
U+1255 ETHIOPIC SYLLABLE QHE qh qhe k'
U+125B ETHIOPIC SYLLABLE QHWAA qhwa qhwaa k'wa
U+125C ETHIOPIC SYLLABLE QHWEE qhwe qhwee k'we
U+125D ETHIOPIC SYLLABLE QHWE qhw qhwe k'wi
...168 more differences

ru — Russian

Block: 304 assigned codepoints, 301 mapped by at least one library.

Coverage: translit maps 294/301, Unidecode maps 234/301. 65 mapped only by translit, 5 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
Ҋ U+048A CYRILLIC CAPITAL LETTER SHORT I WITH TAIL Y
ҋ U+048B CYRILLIC SMALL LETTER SHORT I WITH TAIL y
Ӆ U+04C5 CYRILLIC CAPITAL LETTER EL WITH TAIL L
ӆ U+04C6 CYRILLIC SMALL LETTER EL WITH TAIL l
Ӊ U+04C9 CYRILLIC CAPITAL LETTER EN WITH TAIL N
ӊ U+04CA CYRILLIC SMALL LETTER EN WITH TAIL n
Ӎ U+04CD CYRILLIC CAPITAL LETTER EM WITH TAIL M
ӎ U+04CE CYRILLIC SMALL LETTER EM WITH TAIL m
ӏ U+04CF CYRILLIC SMALL LETTER PALOCHKA i
Ӷ U+04F6 CYRILLIC CAPITAL LETTER GHE WITH DESCENDER G
ӷ U+04F7 CYRILLIC SMALL LETTER GHE WITH DESCENDER g
Ӻ U+04FA CYRILLIC CAPITAL LETTER GHE WITH STROKE AND HOOK G
ӻ U+04FB CYRILLIC SMALL LETTER GHE WITH STROKE AND HOOK g
Ӽ U+04FC CYRILLIC CAPITAL LETTER HA WITH HOOK Kh
ӽ U+04FD CYRILLIC SMALL LETTER HA WITH HOOK kh
Ӿ U+04FE CYRILLIC CAPITAL LETTER HA WITH STROKE Kh
ӿ U+04FF CYRILLIC SMALL LETTER HA WITH STROKE kh
Ԁ U+0500 CYRILLIC CAPITAL LETTER KOMI DE D
ԁ U+0501 CYRILLIC SMALL LETTER KOMI DE d
Ԃ U+0502 CYRILLIC CAPITAL LETTER KOMI DJE Dj
ԃ U+0503 CYRILLIC SMALL LETTER KOMI DJE dj
Ԅ U+0504 CYRILLIC CAPITAL LETTER KOMI ZJE Z
ԅ U+0505 CYRILLIC SMALL LETTER KOMI ZJE z
Ԇ U+0506 CYRILLIC CAPITAL LETTER KOMI DZJE Dz
ԇ U+0507 CYRILLIC SMALL LETTER KOMI DZJE dz
Ԉ U+0508 CYRILLIC CAPITAL LETTER KOMI LJE Lj
ԉ U+0509 CYRILLIC SMALL LETTER KOMI LJE lj
Ԋ U+050A CYRILLIC CAPITAL LETTER KOMI NJE Nj
ԋ U+050B CYRILLIC SMALL LETTER KOMI NJE nj
Ԍ U+050C CYRILLIC CAPITAL LETTER KOMI SJE Sj
...35 more

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
҂ U+0482 CYRILLIC THOUSANDS SIGN *1000*
҈ U+0488 COMBINING CYRILLIC HUNDRED THOUSANDS SIGN *100.000*
҉ U+0489 COMBINING CYRILLIC MILLIONS SIGN *1.000.000*
Ҍ U+048C CYRILLIC CAPITAL LETTER SEMISOFT SIGN "
ҍ U+048D CYRILLIC SMALL LETTER SEMISOFT SIGN "
Char Codepoint Name translit Unidecode anyascii
Ѐ U+0400 CYRILLIC CAPITAL LETTER IE WITH GRAVE E Ie E
Ё U+0401 CYRILLIC CAPITAL LETTER IO Yo Io E
Ѓ U+0403 CYRILLIC CAPITAL LETTER GJE G Gj G
Є U+0404 CYRILLIC CAPITAL LETTER UKRAINIAN IE Ye Ie Ie
Ќ U+040C CYRILLIC CAPITAL LETTER KJE K Kj K
Й U+0419 CYRILLIC CAPITAL LETTER SHORT I Y I Y
Ъ U+042A CYRILLIC CAPITAL LETTER HARD SIGN " ' '
Ю U+042E CYRILLIC CAPITAL LETTER YU Yu Iu Yu
Я U+042F CYRILLIC CAPITAL LETTER YA Ya Ia Ya
й U+0439 CYRILLIC SMALL LETTER SHORT I y i y
ъ U+044A CYRILLIC SMALL LETTER HARD SIGN " ' '
ю U+044E CYRILLIC SMALL LETTER YU yu iu yu
я U+044F CYRILLIC SMALL LETTER YA ya ia ya
ѐ U+0450 CYRILLIC SMALL LETTER IE WITH GRAVE e ie e
ё U+0451 CYRILLIC SMALL LETTER IO yo io e
ѓ U+0453 CYRILLIC SMALL LETTER GJE g gj g
є U+0454 CYRILLIC SMALL LETTER UKRAINIAN IE ye ie ie
ќ U+045C CYRILLIC SMALL LETTER KJE k kj k
Ѣ U+0462 CYRILLIC CAPITAL LETTER YAT Ye E E
ѣ U+0463 CYRILLIC SMALL LETTER YAT ye e e
Ѹ U+0478 CYRILLIC CAPITAL LETTER UK U u U
Ҁ U+0480 CYRILLIC CAPITAL LETTER KOPPA K Q Q
ҁ U+0481 CYRILLIC SMALL LETTER KOPPA k q q
Ҏ U+048E CYRILLIC CAPITAL LETTER ER WITH TICK R R' Rh
ҏ U+048F CYRILLIC SMALL LETTER ER WITH TICK r r' rh
Ґ U+0490 CYRILLIC CAPITAL LETTER GHE WITH UPTURN G G' G
ґ U+0491 CYRILLIC SMALL LETTER GHE WITH UPTURN g g' g
Ғ U+0492 CYRILLIC CAPITAL LETTER GHE WITH STROKE G G' Gh
ғ U+0493 CYRILLIC SMALL LETTER GHE WITH STROKE g g' gh
Ҕ U+0494 CYRILLIC CAPITAL LETTER GHE WITH MIDDLE HOOK G G' Gh
ҕ U+0495 CYRILLIC SMALL LETTER GHE WITH MIDDLE HOOK g g' gh
Җ U+0496 CYRILLIC CAPITAL LETTER ZHE WITH DESCENDER Zh Zh' J
җ U+0497 CYRILLIC SMALL LETTER ZHE WITH DESCENDER zh zh' j
Ҙ U+0498 CYRILLIC CAPITAL LETTER ZE WITH DESCENDER Z Z' Z
ҙ U+0499 CYRILLIC SMALL LETTER ZE WITH DESCENDER z z' z
Қ U+049A CYRILLIC CAPITAL LETTER KA WITH DESCENDER K K' Q
қ U+049B CYRILLIC SMALL LETTER KA WITH DESCENDER k k' q
Ҝ U+049C CYRILLIC CAPITAL LETTER KA WITH VERTICAL STROKE K K' G
ҝ U+049D CYRILLIC SMALL LETTER KA WITH VERTICAL STROKE k k' g
Ҟ U+049E CYRILLIC CAPITAL LETTER KA WITH STROKE K K' Q
ҟ U+049F CYRILLIC SMALL LETTER KA WITH STROKE k k' q
Ҡ U+04A0 CYRILLIC CAPITAL LETTER BASHKIR KA K K' Q
ҡ U+04A1 CYRILLIC SMALL LETTER BASHKIR KA k k' q
Ң U+04A2 CYRILLIC CAPITAL LETTER EN WITH DESCENDER N N' Ng
ң U+04A3 CYRILLIC SMALL LETTER EN WITH DESCENDER n n' ng
Ҧ U+04A6 CYRILLIC CAPITAL LETTER PE WITH MIDDLE HOOK P P' Ph
ҧ U+04A7 CYRILLIC SMALL LETTER PE WITH MIDDLE HOOK p p' ph
Ҫ U+04AA CYRILLIC CAPITAL LETTER ES WITH DESCENDER S S' S
ҫ U+04AB CYRILLIC SMALL LETTER ES WITH DESCENDER s s' s
Ҭ U+04AC CYRILLIC CAPITAL LETTER TE WITH DESCENDER T T' Th
...26 more differences

dv — Dhivehi

Block: 50 assigned codepoints, 49 mapped by at least one library.

Char Codepoint Name translit Unidecode anyascii
ޅ U+0785 THAANA LETTER LHAVIYANI lh L lh
ޏ U+078F THAANA LETTER GNAVIYANI gn ny gn
ޢ U+07A2 THAANA LETTER AINU ' ``` '

jv — Javanese

Block: 91 assigned codepoints, 90 mapped by at least one library.

Coverage: translit maps 75/90, Unidecode maps 0/90. 75 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+A984 JAVANESE LETTER A a
U+A985 JAVANESE LETTER I KAWI aa
U+A986 JAVANESE LETTER I i
U+A987 JAVANESE LETTER II ii
U+A988 JAVANESE LETTER U u
U+A989 JAVANESE LETTER PA CEREK uu
U+A98A JAVANESE LETTER NGA LELET e
U+A98B JAVANESE LETTER NGA LELET RASWADI ai
U+A98C JAVANESE LETTER E o
U+A98D JAVANESE LETTER AI au
U+A98E JAVANESE LETTER O e
U+A98F JAVANESE LETTER KA o
U+A990 JAVANESE LETTER KA SASAK ka
U+A991 JAVANESE LETTER KA MURDA kha
U+A992 JAVANESE LETTER GA ga
U+A993 JAVANESE LETTER GA MURDA gha
U+A994 JAVANESE LETTER NGA nga
U+A995 JAVANESE LETTER CA cha
U+A996 JAVANESE LETTER CA MURDA chha
U+A997 JAVANESE LETTER JA ja
U+A998 JAVANESE LETTER NYA MURDA jha
U+A999 JAVANESE LETTER JA MAHAPRANA nya
U+A99A JAVANESE LETTER NYA tta
U+A99B JAVANESE LETTER TTA ttha
U+A99C JAVANESE LETTER TTA MAHAPRANA dda
U+A99D JAVANESE LETTER DDA ddha
U+A99E JAVANESE LETTER DDA MAHAPRANA nna
U+A99F JAVANESE LETTER NA MURDA ta
U+A9A0 JAVANESE LETTER TA tha
U+A9A1 JAVANESE LETTER TA MURDA da
...45 more

mn — Mongolian

Block: 157 assigned codepoints, 153 mapped by at least one library.

Coverage: translit maps 149/153, Unidecode maps 148/153. 5 mapped only by translit, 4 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+1806 MONGOLIAN TODO SOFT HYPHEN -
U+180A MONGOLIAN NIRUGU -
U+1878 MONGOLIAN LETTER CHA WITH TWO DOTS ch
U+1880 MONGOLIAN LETTER ALI GALI ANUSVARA ONE m
U+18AA MONGOLIAN LETTER MANCHU ALI GALI LHA lha

Mapped only by Unidecode (translit returns empty):

Char Codepoint Name Unidecode
U+1843 MONGOLIAN LETTER TODO LONG VOWEL SIGN -
U+1885 MONGOLIAN LETTER ALI GALI BALUDA 3
U+1886 MONGOLIAN LETTER ALI GALI THREE BALUDA 333
U+18A9 MONGOLIAN LETTER ALI GALI DAGALGA '
Char Codepoint Name translit Unidecode anyascii
U+1800 MONGOLIAN BIRGA . @ @
U+1801 MONGOLIAN ELLIPSIS . ... ...
U+1802 MONGOLIAN COMMA , , ,
U+1803 MONGOLIAN FULL STOP : . .
U+1804 MONGOLIAN COLON ... : :
U+1805 MONGOLIAN FOUR DOTS : // *
U+1807 MONGOLIAN SIBE SYLLABLE BOUNDARY MARKER . - -
U+1808 MONGOLIAN MANCHU COMMA , , ,
U+1809 MONGOLIAN MANCHU FULL STOP . . .
U+1825 MONGOLIAN LETTER OE oe O o
U+1826 MONGOLIAN LETTER UE ue U u
U+183B MONGOLIAN LETTER KHA kh kha k
U+1848 MONGOLIAN LETTER TODO OE oe O o
U+1849 MONGOLIAN LETTER TODO UE ue U u
U+184A MONGOLIAN LETTER TODO ANG ang ng ng
U+185A MONGOLIAN LETTER TODO JIA j jy j
U+185B MONGOLIAN LETTER TODO NIA n ny ny
U+1860 MONGOLIAN LETTER SIBE UE ue U u
U+1862 MONGOLIAN LETTER SIBE ANG ang ng ng
U+1881 MONGOLIAN LETTER ALI GALI VISARGA ONE h H h
U+1882 MONGOLIAN LETTER ALI GALI DAMARU d X h
U+1883 MONGOLIAN LETTER ALI GALI UBADAMA u W h
U+1884 MONGOLIAN LETTER ALI GALI INVERTED UBADAMA u M h
U+1889 MONGOLIAN LETTER ALI GALI KA ka k k
U+188A MONGOLIAN LETTER ALI GALI NGA nga ng ng
U+188B MONGOLIAN LETTER ALI GALI CA ca c ts
U+188C MONGOLIAN LETTER ALI GALI TTA ta tt t
U+188D MONGOLIAN LETTER ALI GALI TTHA tha tth th
U+188E MONGOLIAN LETTER ALI GALI DDA da dd d
U+188F MONGOLIAN LETTER ALI GALI NNA na nn n
U+1890 MONGOLIAN LETTER ALI GALI TA ta t t
U+1891 MONGOLIAN LETTER ALI GALI DA da d d
U+1892 MONGOLIAN LETTER ALI GALI PA pa p p
U+1893 MONGOLIAN LETTER ALI GALI PHA pha ph ph
U+1894 MONGOLIAN LETTER ALI GALI SSA sha ss s
U+1895 MONGOLIAN LETTER ALI GALI ZHA zha zh zh
U+1896 MONGOLIAN LETTER ALI GALI ZA za z z
U+1897 MONGOLIAN LETTER ALI GALI AH ah a '
U+1898 MONGOLIAN LETTER TODO ALI GALI TA ta t t
U+1899 MONGOLIAN LETTER TODO ALI GALI ZHA zha zh zh
U+189A MONGOLIAN LETTER MANCHU ALI GALI GHA gha gh gh
U+189B MONGOLIAN LETTER MANCHU ALI GALI NGA nga ng ng
U+189C MONGOLIAN LETTER MANCHU ALI GALI CA ca c ts
U+189D MONGOLIAN LETTER MANCHU ALI GALI JHA jha jh dzh
U+189E MONGOLIAN LETTER MANCHU ALI GALI TTA ta tta t
U+189F MONGOLIAN LETTER MANCHU ALI GALI DDHA dha ddh dh
U+18A0 MONGOLIAN LETTER MANCHU ALI GALI TA ta t t
U+18A1 MONGOLIAN LETTER MANCHU ALI GALI DHA dha dh dh
U+18A2 MONGOLIAN LETTER MANCHU ALI GALI SSA sha ss s
U+18A3 MONGOLIAN LETTER MANCHU ALI GALI CYA cya cy c
...3 more differences

su — Sundanese

Block: 64 assigned codepoints, 63 mapped by at least one library.

Coverage: translit maps 48/63, Unidecode maps 0/63. 48 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+1B83 SUNDANESE LETTER A a
U+1B84 SUNDANESE LETTER I i
U+1B85 SUNDANESE LETTER U u
U+1B86 SUNDANESE LETTER AE ae
U+1B87 SUNDANESE LETTER O o
U+1B88 SUNDANESE LETTER E e
U+1B89 SUNDANESE LETTER EU eu
U+1B8A SUNDANESE LETTER KA ka
U+1B8B SUNDANESE LETTER QA qa
U+1B8C SUNDANESE LETTER GA ga
U+1B8D SUNDANESE LETTER NGA nga
U+1B8E SUNDANESE LETTER CA ca
U+1B8F SUNDANESE LETTER JA ja
U+1B90 SUNDANESE LETTER ZA za
U+1B91 SUNDANESE LETTER NYA nya
U+1B92 SUNDANESE LETTER TA ta
U+1B93 SUNDANESE LETTER DA da
U+1B94 SUNDANESE LETTER NA na
U+1B95 SUNDANESE LETTER PA pa
U+1B96 SUNDANESE LETTER FA fa
U+1B97 SUNDANESE LETTER VA ba
U+1B98 SUNDANESE LETTER BA ma
U+1B99 SUNDANESE LETTER MA ya
U+1B9A SUNDANESE LETTER YA ra
U+1B9B SUNDANESE LETTER RA la
U+1B9C SUNDANESE LETTER LA wa
U+1B9D SUNDANESE LETTER WA sa
U+1B9E SUNDANESE LETTER SA ha
U+1B9F SUNDANESE LETTER XA sa
U+1BA0 SUNDANESE LETTER HA xa
...18 more

nod — Tai Tham

Block: 127 assigned codepoints, 119 mapped by at least one library.

Coverage: translit maps 103/119, Unidecode maps 0/119. 103 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+1A20 TAI THAM LETTER HIGH KA ka
U+1A21 TAI THAM LETTER HIGH KHA kha
U+1A22 TAI THAM LETTER HIGH KXA kha
U+1A23 TAI THAM LETTER LOW KA ga
U+1A24 TAI THAM LETTER LOW KXA gha
U+1A25 TAI THAM LETTER LOW KHA nga
U+1A26 TAI THAM LETTER NGA ca
U+1A27 TAI THAM LETTER HIGH CA sa
U+1A28 TAI THAM LETTER HIGH CHA cha
U+1A29 TAI THAM LETTER LOW CA ja
U+1A2A TAI THAM LETTER LOW SA ha
U+1A2B TAI THAM LETTER LOW CHA nya
U+1A2C TAI THAM LETTER NYA da
U+1A2D TAI THAM LETTER RATA na
U+1A2E TAI THAM LETTER HIGH RATHA da
U+1A2F TAI THAM LETTER DA tha
U+1A30 TAI THAM LETTER LOW RATHA tha
U+1A31 TAI THAM LETTER RANA da
U+1A32 TAI THAM LETTER HIGH TA dha
U+1A33 TAI THAM LETTER HIGH THA na
U+1A34 TAI THAM LETTER LOW TA ba
U+1A35 TAI THAM LETTER LOW THA pa
U+1A36 TAI THAM LETTER NA pha
U+1A37 TAI THAM LETTER BA fa
U+1A38 TAI THAM LETTER HIGH PA pha
U+1A39 TAI THAM LETTER HIGH PHA ba
U+1A3A TAI THAM LETTER HIGH FA bha
U+1A3B TAI THAM LETTER LOW PA ma
U+1A3C TAI THAM LETTER LOW FA ya
U+1A3D TAI THAM LETTER LOW PHA ra
...73 more

cjm — Cham

Block: 83 assigned codepoints, 83 mapped by at least one library.

Coverage: translit maps 78/83, Unidecode maps 0/83. 78 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+AA00 CHAM LETTER A ka
U+AA01 CHAM LETTER I kha
U+AA02 CHAM LETTER U ga
U+AA03 CHAM LETTER E gha
U+AA04 CHAM LETTER AI ngha
U+AA05 CHAM LETTER O nga
U+AA06 CHAM LETTER KA cha
U+AA07 CHAM LETTER KHA chha
U+AA08 CHAM LETTER GA ja
U+AA09 CHAM LETTER GHA jha
U+AA0A CHAM LETTER NGUE nhja
U+AA0B CHAM LETTER NGA nha
U+AA0C CHAM LETTER CHA nhra
U+AA0D CHAM LETTER CHHA a
U+AA0E CHAM LETTER JA ta
U+AA0F CHAM LETTER JHA tha
U+AA10 CHAM LETTER NHUE da
U+AA11 CHAM LETTER NHA dha
U+AA12 CHAM LETTER NHJA nra
U+AA13 CHAM LETTER TA na
U+AA14 CHAM LETTER THA dda
U+AA15 CHAM LETTER DA pa
U+AA16 CHAM LETTER DHA ppa
U+AA17 CHAM LETTER NUE pha
U+AA18 CHAM LETTER NA ba
U+AA19 CHAM LETTER DDA bha
U+AA1A CHAM LETTER PA mba
U+AA1B CHAM LETTER PPA ma
U+AA1C CHAM LETTER PHA bba
U+AA1D CHAM LETTER BA ya
...48 more

btk — Batak

Block: 56 assigned codepoints, 54 mapped by at least one library.

Coverage: translit maps 50/54, Unidecode maps 0/54. 50 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+1BC0 BATAK LETTER A a
U+1BC1 BATAK LETTER SIMALUNGUN A ha
U+1BC2 BATAK LETTER HA ha
U+1BC3 BATAK LETTER SIMALUNGUN HA ba
U+1BC4 BATAK LETTER MANDAILING HA ba
U+1BC5 BATAK LETTER BA pa
U+1BC6 BATAK LETTER KARO BA pa
U+1BC7 BATAK LETTER PA na
U+1BC8 BATAK LETTER SIMALUNGUN PA na
U+1BC9 BATAK LETTER NA na
U+1BCA BATAK LETTER MANDAILING NA wa
U+1BCB BATAK LETTER WA wa
U+1BCC BATAK LETTER SIMALUNGUN WA ga
U+1BCD BATAK LETTER PAKPAK WA ga
U+1BCE BATAK LETTER GA ja
U+1BCF BATAK LETTER SIMALUNGUN GA da
U+1BD0 BATAK LETTER JA da
U+1BD1 BATAK LETTER DA ra
U+1BD2 BATAK LETTER RA ra
U+1BD3 BATAK LETTER SIMALUNGUN RA ma
U+1BD4 BATAK LETTER MA ma
U+1BD5 BATAK LETTER SIMALUNGUN MA ta
U+1BD6 BATAK LETTER SOUTHERN TA ta
U+1BD7 BATAK LETTER NORTHERN TA sa
U+1BD8 BATAK LETTER SA sa
U+1BD9 BATAK LETTER SIMALUNGUN SA sa
U+1BDA BATAK LETTER MANDAILING SA ya
U+1BDB BATAK LETTER YA ya
U+1BDC BATAK LETTER SIMALUNGUN YA nga
U+1BDD BATAK LETTER NGA nga
...20 more

bug — Buginese

Block: 30 assigned codepoints, 30 mapped by at least one library.

Coverage: translit maps 30/30, Unidecode maps 0/30. 30 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+1A00 BUGINESE LETTER KA ka
U+1A01 BUGINESE LETTER GA ga
U+1A02 BUGINESE LETTER NGA nga
U+1A03 BUGINESE LETTER NGKA ngka
U+1A04 BUGINESE LETTER PA pa
U+1A05 BUGINESE LETTER BA ba
U+1A06 BUGINESE LETTER MA ma
U+1A07 BUGINESE LETTER MPA mpa
U+1A08 BUGINESE LETTER TA ta
U+1A09 BUGINESE LETTER DA da
U+1A0A BUGINESE LETTER NA na
U+1A0B BUGINESE LETTER NRA nra
U+1A0C BUGINESE LETTER CA ca
U+1A0D BUGINESE LETTER JA ja
U+1A0E BUGINESE LETTER NYA nya
U+1A0F BUGINESE LETTER NYCA nyca
U+1A10 BUGINESE LETTER YA ya
U+1A11 BUGINESE LETTER RA ra
U+1A12 BUGINESE LETTER LA la
U+1A13 BUGINESE LETTER VA wa
U+1A14 BUGINESE LETTER SA sa
U+1A15 BUGINESE LETTER A a
U+1A16 BUGINESE LETTER HA ha
U+1A17 BUGINESE VOWEL SIGN I i
U+1A18 BUGINESE VOWEL SIGN U u
U+1A19 BUGINESE VOWEL SIGN E e
U+1A1A BUGINESE VOWEL SIGN O o
U+1A1B BUGINESE VOWEL SIGN AE e
U+1A1E BUGINESE PALLAWA .
U+1A1F BUGINESE END OF SECTION .

tl — Tagalog

Block: 23 assigned codepoints, 21 mapped by at least one library.

Coverage: translit maps 21/21, Unidecode maps 0/21. 21 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+1700 TAGALOG LETTER A a
U+1701 TAGALOG LETTER I i
U+1702 TAGALOG LETTER U u
U+1703 TAGALOG LETTER KA ka
U+1704 TAGALOG LETTER GA ga
U+1705 TAGALOG LETTER NGA nga
U+1706 TAGALOG LETTER TA ta
U+1707 TAGALOG LETTER DA da
U+1708 TAGALOG LETTER NA na
U+1709 TAGALOG LETTER PA pa
U+170A TAGALOG LETTER BA ba
U+170B TAGALOG LETTER MA ma
U+170C TAGALOG LETTER YA ya
U+170D TAGALOG LETTER RA ra
U+170E TAGALOG LETTER LA la
U+170F TAGALOG LETTER WA wa
U+1710 TAGALOG LETTER SA sa
U+1711 TAGALOG LETTER HA ha
U+1712 TAGALOG VOWEL SIGN I i
U+1713 TAGALOG VOWEL SIGN U u
U+171F TAGALOG LETTER ARCHAIC RA ra

hnn — Hanunoo

Block: 23 assigned codepoints, 22 mapped by at least one library.

Coverage: translit maps 22/22, Unidecode maps 0/22. 22 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+1720 HANUNOO LETTER A a
U+1721 HANUNOO LETTER I i
U+1722 HANUNOO LETTER U u
U+1723 HANUNOO LETTER KA ka
U+1724 HANUNOO LETTER GA ga
U+1725 HANUNOO LETTER NGA nga
U+1726 HANUNOO LETTER TA ta
U+1727 HANUNOO LETTER DA da
U+1728 HANUNOO LETTER NA na
U+1729 HANUNOO LETTER PA pa
U+172A HANUNOO LETTER BA ba
U+172B HANUNOO LETTER MA ma
U+172C HANUNOO LETTER YA ya
U+172D HANUNOO LETTER RA ra
U+172E HANUNOO LETTER LA la
U+172F HANUNOO LETTER WA wa
U+1730 HANUNOO LETTER SA sa
U+1731 HANUNOO LETTER HA ha
U+1732 HANUNOO VOWEL SIGN I i
U+1733 HANUNOO VOWEL SIGN U u
U+1735 PHILIPPINE SINGLE PUNCTUATION .
U+1736 PHILIPPINE DOUBLE PUNCTUATION .

bku — Buhid

Block: 20 assigned codepoints, 20 mapped by at least one library.

Coverage: translit maps 20/20, Unidecode maps 0/20. 20 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+1740 BUHID LETTER A a
U+1741 BUHID LETTER I i
U+1742 BUHID LETTER U u
U+1743 BUHID LETTER KA ka
U+1744 BUHID LETTER GA ga
U+1745 BUHID LETTER NGA nga
U+1746 BUHID LETTER TA ta
U+1747 BUHID LETTER DA da
U+1748 BUHID LETTER NA na
U+1749 BUHID LETTER PA pa
U+174A BUHID LETTER BA ba
U+174B BUHID LETTER MA ma
U+174C BUHID LETTER YA ya
U+174D BUHID LETTER RA ra
U+174E BUHID LETTER LA la
U+174F BUHID LETTER WA wa
U+1750 BUHID LETTER SA sa
U+1751 BUHID LETTER HA ha
U+1752 BUHID VOWEL SIGN I i
U+1753 BUHID VOWEL SIGN U u

tbw — Tagbanwa

Block: 18 assigned codepoints, 18 mapped by at least one library.

Coverage: translit maps 18/18, Unidecode maps 0/18. 18 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+1760 TAGBANWA LETTER A a
U+1761 TAGBANWA LETTER I i
U+1762 TAGBANWA LETTER U u
U+1763 TAGBANWA LETTER KA ka
U+1764 TAGBANWA LETTER GA ga
U+1765 TAGBANWA LETTER NGA nga
U+1766 TAGBANWA LETTER TA ta
U+1767 TAGBANWA LETTER DA da
U+1768 TAGBANWA LETTER NA na
U+1769 TAGBANWA LETTER PA pa
U+176A TAGBANWA LETTER BA ba
U+176B TAGBANWA LETTER MA ma
U+176C TAGBANWA LETTER YA ya
U+176E TAGBANWA LETTER LA la
U+176F TAGBANWA LETTER WA wa
U+1770 TAGBANWA LETTER SA sa
U+1772 TAGBANWA VOWEL SIGN I i
U+1773 TAGBANWA VOWEL SIGN U u

mni — Meetei Mayek

Block: 79 assigned codepoints, 76 mapped by at least one library.

Coverage: translit maps 73/76, Unidecode maps 0/76. 73 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+ABC0 MEETEI MAYEK LETTER KOK ka
U+ABC1 MEETEI MAYEK LETTER SAM kha
U+ABC2 MEETEI MAYEK LETTER LAI ga
U+ABC3 MEETEI MAYEK LETTER MIT gha
U+ABC4 MEETEI MAYEK LETTER PA nga
U+ABC5 MEETEI MAYEK LETTER NA cha
U+ABC6 MEETEI MAYEK LETTER CHIL chha
U+ABC7 MEETEI MAYEK LETTER TIL ja
U+ABC8 MEETEI MAYEK LETTER KHOU jha
U+ABC9 MEETEI MAYEK LETTER NGOU nya
U+ABCA MEETEI MAYEK LETTER THOU ta
U+ABCB MEETEI MAYEK LETTER WAI tha
U+ABCC MEETEI MAYEK LETTER YANG da
U+ABCD MEETEI MAYEK LETTER HUK dha
U+ABCE MEETEI MAYEK LETTER UN na
U+ABCF MEETEI MAYEK LETTER I ta
U+ABD0 MEETEI MAYEK LETTER PHAM tha
U+ABD1 MEETEI MAYEK LETTER ATIYA da
U+ABD2 MEETEI MAYEK LETTER GOK dha
U+ABD3 MEETEI MAYEK LETTER JHAM na
U+ABD4 MEETEI MAYEK LETTER RAI pa
U+ABD5 MEETEI MAYEK LETTER BA pha
U+ABD6 MEETEI MAYEK LETTER JIL ba
U+ABD7 MEETEI MAYEK LETTER DIL bha
U+ABD8 MEETEI MAYEK LETTER GHOU ma
U+ABD9 MEETEI MAYEK LETTER DHOU ya
U+ABDA MEETEI MAYEK LETTER BHAM ra
U+ABDB MEETEI MAYEK LETTER KOK LONSUM la
U+ABDC MEETEI MAYEK LETTER LAI LONSUM wa
U+ABDD MEETEI MAYEK LETTER MIT LONSUM sha
...43 more

ber — Tifinagh

Block: 59 assigned codepoints, 58 mapped by at least one library.

Coverage: translit maps 58/58, Unidecode maps 0/58. 58 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+2D30 TIFINAGH LETTER YA a
U+2D31 TIFINAGH LETTER YAB b
U+2D32 TIFINAGH LETTER YABH bh
U+2D33 TIFINAGH LETTER YAG g
U+2D34 TIFINAGH LETTER YAGHH ghh
U+2D35 TIFINAGH LETTER BERBER ACADEMY YAJ j
U+2D36 TIFINAGH LETTER YAJ j
U+2D37 TIFINAGH LETTER YAD d
U+2D38 TIFINAGH LETTER YADH dh
U+2D39 TIFINAGH LETTER YADD dd
U+2D3A TIFINAGH LETTER YADDH ddh
U+2D3B TIFINAGH LETTER YEY ey
U+2D3C TIFINAGH LETTER YAF f
U+2D3D TIFINAGH LETTER YAK k
U+2D3E TIFINAGH LETTER TUAREG YAK k
ⴿ U+2D3F TIFINAGH LETTER YAKHH khh
U+2D40 TIFINAGH LETTER YAH h
U+2D41 TIFINAGH LETTER BERBER ACADEMY YAH h
U+2D42 TIFINAGH LETTER TUAREG YAH h
U+2D43 TIFINAGH LETTER YAHH hh
U+2D44 TIFINAGH LETTER YAA a
U+2D45 TIFINAGH LETTER YAKH kh
U+2D46 TIFINAGH LETTER TUAREG YAKH kh
U+2D47 TIFINAGH LETTER YAQ q
U+2D48 TIFINAGH LETTER TUAREG YAQ q
U+2D49 TIFINAGH LETTER YI i
U+2D4A TIFINAGH LETTER YAZH zh
U+2D4B TIFINAGH LETTER AHAGGAR YAZH zh
U+2D4C TIFINAGH LETTER TUAREG YAZH zh
U+2D4D TIFINAGH LETTER YAL l
...28 more

lis — Lisu

Block: 48 assigned codepoints, 48 mapped by at least one library.

Coverage: translit maps 48/48, Unidecode maps 0/48. 48 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+A4D0 LISU LETTER BA ba
U+A4D1 LISU LETTER PA pa
U+A4D2 LISU LETTER PHA pha
U+A4D3 LISU LETTER DA da
U+A4D4 LISU LETTER TA ta
U+A4D5 LISU LETTER THA tha
U+A4D6 LISU LETTER GA ga
U+A4D7 LISU LETTER KA ka
U+A4D8 LISU LETTER KHA kha
U+A4D9 LISU LETTER JA ja
U+A4DA LISU LETTER CA ca
U+A4DB LISU LETTER CHA cha
U+A4DC LISU LETTER DZA dza
U+A4DD LISU LETTER TSA tsa
U+A4DE LISU LETTER TSHA tsha
U+A4DF LISU LETTER MA ma
U+A4E0 LISU LETTER NA na
U+A4E1 LISU LETTER LA la
U+A4E2 LISU LETTER SA sa
U+A4E3 LISU LETTER ZHA zha
U+A4E4 LISU LETTER ZA za
U+A4E5 LISU LETTER NGA nga
U+A4E6 LISU LETTER HA ha
U+A4E7 LISU LETTER XA xa
U+A4E8 LISU LETTER HHA hha
U+A4E9 LISU LETTER FA fa
U+A4EA LISU LETTER WA wa
U+A4EB LISU LETTER SHA sha
U+A4EC LISU LETTER YA ya
U+A4ED LISU LETTER GHA gha
...18 more

sat — Ol Chiki

Block: 48 assigned codepoints, 45 mapped by at least one library.

Coverage: translit maps 43/45, Unidecode maps 0/45. 43 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+1C50 OL CHIKI DIGIT ZERO 0
U+1C51 OL CHIKI DIGIT ONE 1
U+1C52 OL CHIKI DIGIT TWO 2
U+1C53 OL CHIKI DIGIT THREE 3
U+1C54 OL CHIKI DIGIT FOUR 4
U+1C55 OL CHIKI DIGIT FIVE 5
U+1C56 OL CHIKI DIGIT SIX 6
U+1C57 OL CHIKI DIGIT SEVEN 7
U+1C58 OL CHIKI DIGIT EIGHT 8
U+1C59 OL CHIKI DIGIT NINE 9
U+1C5A OL CHIKI LETTER LA la
U+1C5B OL CHIKI LETTER AT at
U+1C5C OL CHIKI LETTER AG ag
U+1C5D OL CHIKI LETTER ANG ang
U+1C5E OL CHIKI LETTER AL al
U+1C5F OL CHIKI LETTER LAA laa
U+1C60 OL CHIKI LETTER AAK aak
U+1C61 OL CHIKI LETTER AAJ aaj
U+1C62 OL CHIKI LETTER AAM aam
U+1C63 OL CHIKI LETTER AAW aaw
U+1C64 OL CHIKI LETTER LI li
U+1C65 OL CHIKI LETTER IS is
U+1C66 OL CHIKI LETTER IH ih
U+1C67 OL CHIKI LETTER INY iny
U+1C68 OL CHIKI LETTER IR ir
U+1C69 OL CHIKI LETTER LU lu
U+1C6A OL CHIKI LETTER UC uc
U+1C6B OL CHIKI LETTER UD ud
U+1C6C OL CHIKI LETTER UNN unn
U+1C6D OL CHIKI LETTER UY unny
...13 more

bax — Bamum

Block: 88 assigned codepoints, 87 mapped by at least one library.

Coverage: translit maps 83/87, Unidecode maps 0/87. 83 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+A6A0 BAMUM LETTER A a
U+A6A1 BAMUM LETTER KA ka
U+A6A2 BAMUM LETTER U u
U+A6A3 BAMUM LETTER KU ku
U+A6A4 BAMUM LETTER EE ee
U+A6A5 BAMUM LETTER REE ree
U+A6A6 BAMUM LETTER TAE tae
U+A6A7 BAMUM LETTER O o
U+A6A8 BAMUM LETTER NYI nyi
U+A6A9 BAMUM LETTER I i
U+A6AA BAMUM LETTER LA la
U+A6AB BAMUM LETTER PA pa
U+A6AC BAMUM LETTER RII rii
U+A6AD BAMUM LETTER RIEE riee
U+A6AE BAMUM LETTER LEEEE leeee
U+A6AF BAMUM LETTER MEEEE meeee
U+A6B0 BAMUM LETTER TAA taa
U+A6B1 BAMUM LETTER NDAA ndaa
U+A6B2 BAMUM LETTER NJAEM njaem
U+A6B3 BAMUM LETTER M m
U+A6B4 BAMUM LETTER SUU suu
U+A6B5 BAMUM LETTER MU mu
U+A6B6 BAMUM LETTER SHII shii
U+A6B7 BAMUM LETTER SI si
U+A6B8 BAMUM LETTER SHEUX sheux
U+A6B9 BAMUM LETTER SEUX seux
U+A6BA BAMUM LETTER KYEE kyee
U+A6BB BAMUM LETTER KET ket
U+A6BC BAMUM LETTER NUAE nuae
U+A6BD BAMUM LETTER NU nu
...53 more

bal — Balinese

Block: 124 assigned codepoints, 114 mapped by at least one library.

Coverage: translit maps 93/114, Unidecode maps 0/114. 93 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+1B05 BALINESE LETTER AKARA a
U+1B06 BALINESE LETTER AKARA TEDUNG aa
U+1B07 BALINESE LETTER IKARA i
U+1B08 BALINESE LETTER IKARA TEDUNG ii
U+1B09 BALINESE LETTER UKARA u
U+1B0A BALINESE LETTER UKARA TEDUNG uu
U+1B0B BALINESE LETTER RA REPA r
U+1B0C BALINESE LETTER RA REPA TEDUNG r
U+1B0D BALINESE LETTER LA LENGA l
U+1B0E BALINESE LETTER LA LENGA TEDUNG l
U+1B0F BALINESE LETTER EKARA e
U+1B10 BALINESE LETTER AIKARA ai
U+1B11 BALINESE LETTER OKARA o
U+1B12 BALINESE LETTER OKARA TEDUNG au
U+1B13 BALINESE LETTER KA ka
U+1B14 BALINESE LETTER KA MAHAPRANA kha
U+1B15 BALINESE LETTER GA ga
U+1B16 BALINESE LETTER GA GORA gha
U+1B17 BALINESE LETTER NGA nga
U+1B18 BALINESE LETTER CA cha
U+1B19 BALINESE LETTER CA LACA chha
U+1B1A BALINESE LETTER JA ja
U+1B1B BALINESE LETTER JA JERA jha
U+1B1C BALINESE LETTER NYA nya
U+1B1D BALINESE LETTER TA LATIK tta
U+1B1E BALINESE LETTER TA MURDA MAHAPRANA ttha
U+1B1F BALINESE LETTER DA MURDA ALPAPRANA dda
U+1B20 BALINESE LETTER DA MURDA MAHAPRANA ddha
U+1B21 BALINESE LETTER NA RAMBAT nna
U+1B22 BALINESE LETTER TA ta
...63 more

nko — N'Ko

Block: 62 assigned codepoints, 54 mapped by at least one library.

Coverage: translit maps 50/54, Unidecode maps 0/54. 50 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
߀ U+07C0 NKO DIGIT ZERO 0
߁ U+07C1 NKO DIGIT ONE 1
߂ U+07C2 NKO DIGIT TWO 2
߃ U+07C3 NKO DIGIT THREE 3
߄ U+07C4 NKO DIGIT FOUR 4
߅ U+07C5 NKO DIGIT FIVE 5
߆ U+07C6 NKO DIGIT SIX 6
߇ U+07C7 NKO DIGIT SEVEN 7
߈ U+07C8 NKO DIGIT EIGHT 8
߉ U+07C9 NKO DIGIT NINE 9
ߊ U+07CA NKO LETTER A a
ߋ U+07CB NKO LETTER EE ee
ߌ U+07CC NKO LETTER I i
ߍ U+07CD NKO LETTER E e
ߎ U+07CE NKO LETTER U u
ߏ U+07CF NKO LETTER OO oo
ߐ U+07D0 NKO LETTER O o
ߑ U+07D1 NKO LETTER DAGBASINNA da
ߒ U+07D2 NKO LETTER N ba
ߓ U+07D3 NKO LETTER BA ka
ߔ U+07D4 NKO LETTER PA ja
ߕ U+07D5 NKO LETTER TA cha
ߖ U+07D6 NKO LETTER JA ta
ߗ U+07D7 NKO LETTER CHA nya
ߘ U+07D8 NKO LETTER DA na
ߙ U+07D9 NKO LETTER RA ra
ߚ U+07DA NKO LETTER RRA rra
ߛ U+07DB NKO LETTER SA sa
ߜ U+07DC NKO LETTER GBA gba
ߝ U+07DD NKO LETTER FA fa
...20 more

vai — Vai

Block: 300 assigned codepoints, 299 mapped by at least one library.

Coverage: translit maps 286/299, Unidecode maps 0/299. 286 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+A500 VAI SYLLABLE EE ee
U+A501 VAI SYLLABLE EEN een
U+A502 VAI SYLLABLE HEE hee
U+A503 VAI SYLLABLE WEE wee
U+A504 VAI SYLLABLE WEEN ween
U+A505 VAI SYLLABLE PEE pee
U+A506 VAI SYLLABLE BHEE bhee
U+A507 VAI SYLLABLE BEE bee
U+A508 VAI SYLLABLE MBEE mbee
U+A509 VAI SYLLABLE KPEE kpee
U+A50A VAI SYLLABLE MGBEE mgbee
U+A50B VAI SYLLABLE GBEE gbee
U+A50C VAI SYLLABLE FEE fee
U+A50D VAI SYLLABLE VEE vee
U+A50E VAI SYLLABLE TEE tee
U+A50F VAI SYLLABLE THEE thee
U+A510 VAI SYLLABLE DHEE dhee
U+A511 VAI SYLLABLE DHHEE dhhee
U+A512 VAI SYLLABLE LEE lee
U+A513 VAI SYLLABLE REE ree
U+A514 VAI SYLLABLE DEE dee
U+A515 VAI SYLLABLE NDEE ndee
U+A516 VAI SYLLABLE SEE see
U+A517 VAI SYLLABLE SHEE shee
U+A518 VAI SYLLABLE ZEE zee
U+A519 VAI SYLLABLE ZHEE zhee
U+A51A VAI SYLLABLE CEE cee
U+A51B VAI SYLLABLE JEE jee
U+A51C VAI SYLLABLE NJEE njee
U+A51D VAI SYLLABLE YEE yee
...256 more

cop — Coptic

Block: 123 assigned codepoints, 121 mapped by at least one library.

Coverage: translit maps 102/121, Unidecode maps 0/121. 102 mapped only by translit, 0 mapped only by Unidecode.

Mapped only by translit (Unidecode returns empty/[?]):

Char Codepoint Name translit
U+2C80 COPTIC CAPITAL LETTER ALFA a
U+2C81 COPTIC SMALL LETTER ALFA a
U+2C82 COPTIC CAPITAL LETTER VIDA b
U+2C83 COPTIC SMALL LETTER VIDA b
U+2C84 COPTIC CAPITAL LETTER GAMMA g
U+2C85 COPTIC SMALL LETTER GAMMA g
U+2C86 COPTIC CAPITAL LETTER DALDA d
U+2C87 COPTIC SMALL LETTER DALDA d
U+2C88 COPTIC CAPITAL LETTER EIE e
U+2C89 COPTIC SMALL LETTER EIE e
U+2C8A COPTIC CAPITAL LETTER SOU so
U+2C8B COPTIC SMALL LETTER SOU so
U+2C8C COPTIC CAPITAL LETTER ZATA z
U+2C8D COPTIC SMALL LETTER ZATA z
U+2C8E COPTIC CAPITAL LETTER HATE e
U+2C8F COPTIC SMALL LETTER HATE e
U+2C90 COPTIC CAPITAL LETTER THETHE th
U+2C91 COPTIC SMALL LETTER THETHE th
U+2C92 COPTIC CAPITAL LETTER IAUDA i
U+2C93 COPTIC SMALL LETTER IAUDA i
U+2C94 COPTIC CAPITAL LETTER KAPA k
U+2C95 COPTIC SMALL LETTER KAPA k
U+2C96 COPTIC CAPITAL LETTER LAULA l
U+2C97 COPTIC SMALL LETTER LAULA l
U+2C98 COPTIC CAPITAL LETTER MI m
U+2C99 COPTIC SMALL LETTER MI m
U+2C9A COPTIC CAPITAL LETTER NI n
U+2C9B COPTIC SMALL LETTER NI n
U+2C9C COPTIC CAPITAL LETTER KSI ks
U+2C9D COPTIC SMALL LETTER KSI ks
...72 more

Key Takeaways

  • Total assigned codepoints scanned: 50464
  • Mapped by at least one library: 50157
  • translit coverage: 49641/50157 (99.0%)
  • Unidecode coverage: 47408/50157 (94.5%)
  • anyascii coverage: 50085/50157 (99.9%)
  • Characters mapped only by translit: 2362
  • Characters mapped only by Unidecode: 129
  • Different output (both mapped): 27034

Generated by benchmarks/diff_vs_unidecode.py