11.0
Table Of Contents
- Introducing ABBYY FineReader
- The ABBYY FineReader 11 Interface
- Working with ABBYY FineReader
- ABBYY FineReader Tasks
- Managing Automated Tasks
- ABBYY FineReader Step–by–Step
- Splitting an ABBYY FineReader Document
- Taking Into Account Some of the Features of Your Paper Document
- Image Acquisition Tips
- Scanning Tips
- Taking Photos of Documents
- Camera Requirements
- Lighting
- Taking Photos
- When you need to take another photo
- Automatic Image Preprocessing
- Editing Images Manually
- OCR Options
- If the complex structure of a paper document is not reproduced in the electronic document
- Adjusting Area Shapes and Area Borders
- Picture Not Detected
- Barcode Not Detected
- Table Not Detected
- Table Cells Detected Incorrectly
- Adjusting Text Area Properties
- Vertical or Inverted Text Not Recognized Properly
- Paper Document Contains Decorative (Non–Standard) Fonts
- Incorrect Font in Recognized Text or Some Characters Are Replaced with "?" or "□"
- Checking and Editing the Recognized Text
- Working with Complex–Script Languages
- Recommended Fonts
- Saving the Results
- Advanced Features
- Appendix
- Font
- Language
- How to Buy an ABBYY Product
- Activating and Registering ABBYY FineReader
- Technical Support
ABBYY FineReader 11 User’s Guide
87
Ligature is a combination of two or more characters which are "glued together" (e.g. fi, fl, ffi).
These characters are difficult for ABBYY FineReader to separate. Treating them as a single
compound character improves OCR accuracy.
M
Monospaced font is a font (such as Courier New) in which all characters are equally spaced. For
better OCR results on monospaced fonts, select Tools>Options..., click the Document tab, and
select Typewriter under Document print type.
O
Omnifont system is a recognition system that recognizes characters set in any font and font size
without prior training.
Optional hyphen is a hyphen (¬) that indicates exactly where a word or word combination should
be split if it occurs at the end of a line (e.g. "autoformat" should be split into "auto–
format"). ABBYY FineReader replaces all hyphens found in dictionary words with optional hyphens.
P
Page layout is the arrangement of text, tables, pictures, paragraphs, and columns on a page. The
fonts, font sizes, font colors, text background, and text orientation are also part of the page layout.
Page layout analysis is the process of detecting areas on a page image. Areas can be of six
types: text, picture, table, barcode, background picture, and recognition area. Page layout analysis
can be performed automatically when clicking the Read button, or manually by the user prior to
OCR.
Paradigm is the set of all grammatical forms of a word.
Pattern is a set of pairs of type "character image – actual character."
PDF security settings are restrictions that prevent a PDF document from being opened, edited,
copied or printed. These settings include Document Open Passwords, Permissions Passwords, and
encryption levels.
Permissions Password is a password which prevents other users from printing and editing a PDF
document unless they type the password specified by the author. If some security settings are
selected for the document, other users will not be able to change these settings until they type the
password.
Picture area is an image area that contains a picture. This type of area may enclose an actual
picture or any other object that should be displayed as a picture (e.g. a section of text).
Primary form is the "dictionary" form of a word (headwords of dictionary entries are usually given
in heir primary forms).
Print type is a parameter reflecting how the source text was printed (on a laser printer or a similar
device, on a typewriter, etc.). For laser–printed texts, select Auto; for typewritten texts, select
Typewriter; for faxes, select Fax.
Product ID is a parameter that is automatically generated on the basis of the hardware
configuration when activating ABBYY FineReader on a given computer.
Prohibited characters — If certain characters will never occur in a text to be recognized, they
may be included in a list of prohibited characters. Specifying prohibited characters increases the
speed and quality of OCR.
R










