-
Notifications
You must be signed in to change notification settings - Fork 10k
Data Files in different versions
Shreeshrii edited this page Jul 3, 2018
·
10 revisions
Lang Code | Language | 3.00 | 3.02 | 3.04 | 4.00alpha |
---|---|---|---|---|---|
afr | Afrikaans | x | |||
amh | Amharic | x | |||
ara | Arabic | x | |||
asm | Assamese | x | |||
aze | Azerbaijani | x | |||
aze_cyrl | Azerbaijani - Cyrilic | x | |||
bel | Belarusian | x | |||
ben | Bengali | x | |||
bod | Tibetan | x | |||
bos | Bosnian | x | |||
bre | Breton | x | |||
bul | Bulgarian | x | |||
cat | Catalan; Valencian | x | |||
ceb | Cebuano | x | |||
ces | Czech | x | |||
chi_sim | Chinese - Simplified | x | |||
chi_tra | Chinese - Traditional | x | |||
chr | Cherokee | x | |||
cym | Welsh | x | |||
dan | Danish | x | |||
deu | German | x | |||
dzo | Dzongkha | x | |||
ell | Greek, Modern (1453-) | x | |||
eng | English | x | x | x | x |
enm | English, Middle (1100-1500) | x | |||
epo | Esperanto | x | |||
equ | Math / equation detection module | x | |||
est | Estonian | x | |||
eus | Basque | x | |||
fas | Persian | x | |||
fin | Finnish | x | |||
fra | French | x | |||
frk | Frankish | x | |||
frm | French, Middle (ca.1400-1600) | x | |||
gle | Irish | x | |||
glg | Galician | x | |||
grc | Greek, Ancient (to 1453) | x | |||
guj | Gujarati | x | |||
hat | Haitian; Haitian Creole | x | |||
heb | Hebrew | x | |||
hin | Hindi | x | |||
hrv | Croatian | x | |||
hun | Hungarian | x | |||
iku | Inuktitut | x | |||
ind | Indonesian | x | |||
isl | Icelandic | x | |||
ita | Italian | x | |||
ita_old | Italian - Old | x | |||
jav | Javanese | x | |||
jpn | Japanese | x | |||
kan | Kannada | x | |||
kat | Georgian | x | |||
kat_old | Georgian - Old | x | |||
kaz | Kazakh | x | |||
khm | Central Khmer | x | |||
kir | Kirghiz; Kyrgyz | x | |||
kor | Korean | x | |||
kor_vert | Korean (vertical) | x | |||
kur | Kurdish | x | |||
kur_ara | Kurdish (Arabic) | x | |||
lao | Lao | x | |||
lat | Latin | x | |||
lav | Latvian | x | |||
lit | Lithuanian | x | |||
ltz | Luxembourgish | x | |||
mal | Malayalam | x | |||
mar | Marathi | x | |||
mkd | Macedonian | x | |||
mlt | Maltese | x | |||
mon | Mongolian | x | |||
mri | Maori | x | |||
msa | Malay | x | |||
mya | Burmese | x | |||
nep | Nepali | x | |||
nld | Dutch; Flemish | x | |||
nor | Norwegian | x | |||
oci | Occitan (post 1500) | x | |||
ori | Oriya | x | |||
osd | Orientation and script detection module | x | x | x | x |
pan | Panjabi; Punjabi | x | |||
pol | Polish | x | |||
por | Portuguese | x | |||
pus | Pushto; Pashto | x | |||
que | Quechua | x | |||
ron | Romanian; Moldavian; Moldovan | x | |||
rus | Russian | x | |||
san | Sanskrit | x | |||
sin | Sinhala; Sinhalese | x | |||
slk | Slovak | x | |||
slv | Slovenian | x | |||
snd | Sindhi | x | |||
spa | Spanish; Castilian | x | |||
spa_old | Spanish; Castilian - Old | x | |||
sqi | Albanian | x | |||
srp | Serbian | x | |||
srp_latn | Serbian - Latin | x | |||
sun | Sundanese | x | |||
swa | Swahili | x | |||
swe | Swedish | x | |||
syr | Syriac | x | |||
tam | Tamil | x | |||
tat | Tatar | x | |||
tel | Telugu | x | |||
tgk | Tajik | x | |||
tgl | Tagalog | x | |||
tha | Thai | x | |||
tir | Tigrinya | x | |||
ton | Tonga | x | |||
tur | Turkish | x | |||
uig | Uighur; Uyghur | x | |||
ukr | Ukrainian | x | |||
urd | Urdu | x | |||
uzb | Uzbek | x | |||
uzb_cyrl | Uzbek - Cyrilic | x | |||
vie | Vietnamese | x | |||
yid | Yiddish | x | |||
yor | Yoruba | x |
Old wiki - no longer maintained. The pages were moved, see the new documentation.
These wiki pages are no longer maintained.
All pages were moved to tesseract-ocr/tessdoc.
The latest documentation is available at https://tesseract-ocr.github.io/.