11.0

Table Of Contents
ABBYY FineReader 11 User’s Guide
76
User Languages and Language Groups
You can create and use your own recognition languages and language groups in addition to the
languages and groups included in the program.
Creating an OCR Language
This is used to process texts that contain a lot of uncommon elements, such as code numbers.
Creating a Language Group
This is used to recognize documents written in several languages that are not available from the
predefined language combinations.
Creating an OCR Language
When performing OCR on a document, ABBYY FineReader uses some information about the
language of the document (this language is to be selected from the Document Languages drop
down list in the main window). If there are too many unusual abbreviations or words in the text,
the program may fail to recognize them correctly. If this is the case, you may wish to create your
own recognition language for this document.
1. On the Tools menu, click Language Editor
2. In the Language Editor dialog box, click New…
3. In the New Language or Group dialog box, select Create a new language based on an
existing one and from the dropdown list below, select the desired language. Click OK.
4. In the Language Properties dialog box, specify the properties of the new OCR language.
a. Language nameType a name for your OCR language in this field.
b. Source languageThe language on which your new OCR language will be based.
(Displays the language you selected in the New Language or Group dialog box. Click the
arrow to the right to select a different language.)
c. AlphabetLists the characters of the alphabet of the source language. Click
to add or
remove characters.
d. DictionaryThe dictionary that ABBYY FineReader will use to perform OCR on your
document and to check the recognized text. The following options are available:
None
No dictionary will be used.
Builtin dictionary
The dictionary supplied with ABBYY FineReader will be used.
User dictionary
A user dictionary will be used. Click the Edit… button to add words to the dictionary
or to import an existing user dictionary or text file in Windows (ANSI) or Unicode
encoding. The words in the text file must be separated by spaces or other non
alphabetic characters.
Note: The words from the user dictionary may occur in texts in the following
capitalizations: 1) lowercase only, 2) uppercase only, 3) first letter
capitalized, 4) as spelt in the user dictionary. The four possibilities are
summed up in the table below.
Word as spelt in the user
dictionary
Possible forms of the word in
texts