16.0
Table Of Contents
- Introducing ABBYY FineReader
- The New Task window
- PDF Editor
- OCR Editor
- Launching the OCR Editor
- OCR Editor interface
- Obtaining documents
- Recognizing documents
- Improving OCR results
- If your document image has defects and OCR accuracy is low
- If areas are detected incorrectly
- If the complex structure of a paper document is not reproduced
- If you are processing a large number of documents with identical layouts
- If tables and pictures are not detected
- If a barcode is not detected
- If an incorrect font is used or some characters are replaced with "?" or "□"
- If your printed document contains non-standard fonts
- If your document contains many specialized terms
- If the program fails to recognize certain characters
- If vertical or inverted text was not recognized
- Checking and editing texts
- Copying content from documents
- Saving OCR results
- Integration with other applications
- Automating and scheduling OCR
- ABBYY Compare Documents
- ABBYY Screenshot Reader
- Reference
- How to set ABBYY FineReader PDF 16 as your default PDF viewer
- Types of PDF documents
- Scanning tips
- Taking photos of documents
- Options dialog box
- Format settings
- Supported OCR and document comparison languages
- Supported document formats
- Document features to consider prior to OCR
- Image processing options
- OCR options
- Working with complex-script languages
- Recognition of text written using a Gothic script
- Supported interface languages
- Current date and time on stamps and in headers and footers
- Fonts required for the correct display of texts in supported languages
- Regular expressions
- Using the command line
- Installing, activating, and registering ABBYY FineReader PDF 16
- Appendix
- Technical support
- Third-party software
349
ABBYY® FineReader PDF User’s Guide
Chukchee
Arial Unicode MS(*) , Lucida Sans Unicode
Yakut
Arial Unicode MS(*)
Japanese
Arial Unicode MS(*) , SimSun fonts
Example SimSun (Founder Extended), SimSun-18030,
NSimSun.
Simhei, YouYuan, PMingLiU, MingLiU, Ming(for-
ISO10646), STSong
Where to find/supplied with
(*) Microsoft Office 2000 or later
Regular expressions
The table below lists the regular expressions that can be used to create a dictionary for a custom
language .
Item name
Conventional
regular
expression
symbol
Usage examples and explanations
Any character
.
c.t— denotes "cat," "cot," etc.
Character from
group
[]
[b-d]ell— denotes "bell," "cell," "dell," etc.; [ty]ell— denotes
"tell" and "yell"
Character not from
group
[^]
[^y]ell— denotes "dell," "cell," "tell," but forbids "yell”; [^n-s]ell
— denotes "bell," "cell," but forbids "nell," "oell," "pell," "qell,"
"rell," and "sell"
Or
|
c(a|u)t— denotes "cat" and "cut"
0 or more matches
*
10*— denotes numbers 1, 10, 100, 1000, etc.
1 or more matches
+
10+— allows numbers 10, 100, 1000, etc.
Letter or digit
[0-9a-zA-Zа-
яА-Я]
[0-9a-zA-Zа-яА-Я]— allows any single character; [0-9a-zA-Zа-
яА-Я]+— allows any word
Capital Latin letter
[A-Z]
Small Latin letter
[a-z]
349
349
349
221










