Operation Manual

Chapter 7 – Character Recognition
56
Language
In order to recognize documents, the document language must be
specified. Based on the language selection, the software knows
which symbol sets to recognize.
Select the language of your choice in the Language drop-down list.
IRISDocument supports up to 137 languages. IRISDocument can
optionally recognize four Asian languages (Traditional and Simplified
Chinese, Japanese and Korean), Arabic and Farsi, and Hebrew.
IRISDocument also recognizes barcodes and banking fonts. Refer to
the appendices to the User Guide for more information on barcodes
and banking fonts.
Note that the character recognition can also be limited to numeric
digits.
Secondary languages
Next to the primary language, IRISDocument allows you to select
up to 4 secondary languages.
This way, IRISDocument uses mixed character sets, enabling it to
recognize Western words that occur in Greek, Cyrillic and
optionally Asian, Arabic or Hebrew documents.
Select the required secondary languages in the list.
Note that if you select multiple secondary languages they must be of
the same language group. Languages that do not belong to the same
group will be disabled automatically.
Do not select languages that do not apply: the bigger the character set,
the slower the recognition and the higher the risk of OCR errors.
Character pitch
The character pitch is the number of characters per inch in a
typeface.