OCR Processing Step - Supported Languages

The OCR processing step available in Advanced Workflows supports the recognition of the following OCR languages:

  • Languages with dictionary support

    • Latin, Cyrillic, Greek or Armenian characters, for which the FineReader Engine provides dictionary support: Armenian (Eastern, Western, Grabar), Bashkir, Bulgarian, Catalan, Croatian, Czech, Danish, Dutch (Netherlands and Belgium), English, Estonian, Finnish, French, German (new and old spelling), Greek, Hungarian, Italian, Indonesian, Latvian, Lithuanian, Norwegian (Nynorsk and Bokmal), Polish, Portuguese (Portugal and Brazil), Romanian, Russian, Slovak, Slovenian, Spanish, Swedish, Tatar, Turkish, and Ukrainian.

    • Japanese, Korean and Hangul with dictionary support, Chinese (PRC and Taiwan).

    • Japanese Modern (Japanese texts which may include Latin or Greek letters)

    • Thai with dictionary support.

    • Hebrew with dictionary support, Yiddish.

    • Arabic with dictionary support, Farsi.

    • Latin, Azerbaijani (Latin), Russian (old spelling) with dictionary support.

  • Additional languages

    • Latin, Cyrillic, or Greek characters: Abkhaz, Adyghian, Afrikaans, Agul, Albanian, Altaic, Avar, Aymara, Azerbaijani (Cyrillic), Azerbaijani (Latin), Bangla, Basque, Belarusian, Bemba, Blackfoot, Breton, Bugotu, Burmese, Buryat, Cebuano, Chamorro, Chechen, Chukchee, Chuvash, Congo, Corsican, Crimean Tatar, Crow, Dakota, Dargwa, Dungan, Eskimo (Cyrillic), Eskimo (Latin), Even, Evenki, Faeroese, Fijian, Frisian, Friulian, Gagauz, Galician, Ganda, Georgian, German (Luxemburg), Guarani, Hani, Hausa, Hawaiian, Icelandic, Ingush, Irish, Jingpo, Kabardian, Kalmyk, Karachay-balkar, Karakalpak, Kasub, Kawa, Kazakh, Khakass, Khanty, Kikuyu, Kirghiz, Koryak, Kpelle, Kumyk, Kurdish, Lak, Latin, Latvian Gothic, Lezgi, Luba, Macedonian, Malagasy, Malay, Malinke, Maltese, Mansy, Maori, Mari, Maya, Miao, Minangkabau, Mohawk, Moldavian, Mongol, Mordvin, Nahuatl, Nenets, Nivkh, Nogay, Nyanja, Ojibway, Old Slavonic, Ossetian, Papiamento, Provencal, Quechua, Rhaeto-Romanic, Romany, Rundi, Russian (old spelling), Rwanda, Sami (Lappish), Samoan, Scottish Gaelic, Selkup, Serbian (Cyrillic), Serbian (Latin), Shona, Somali, Sorbian, Sotho, Sunda, Swahili, Swazi, Tabasaran, Tagalog, Tahitian, Tajik, Turkmen (Latin), Tok Pisin, Tongan, Tswana, Tun, Turkmen, Tuvinian, Udmurt, Uigur (Cyrillic), Uigur (Latin), Uzbek (Cyrillic), Uzbek (Latin), Vietnamese, Welsh, Wolof, Xhosa, Yakut, Zapotec, Zulu.

  • Artificial languages

    • Esperanto, Interlingua, Ido, and Occidental.

  • Programming languages

    • Basic, C/C++, COBOL, Fortran, JAVA, and Pascal.

  • Simple chemical formulas

  • Mathematical formulas

  • Digits