9.0
Table Of Contents
- Introducing ABBYY FineReader
- What's New in ABBYY FineReader 9.0
- Working with ABBYY FineReader 9.0
- Using ABBYY FineReader 9.0 Step–by–Step
- Converting Paper Documents into Microsoft Word Documents
- Converting Images or PDF Documents into Microsoft Word Documents
- Converting Paper Documents into Microsoft Excel Worksheets
- Scanning Paper Documents to Create PDF Documents
- Converting Digital Photos into Microsoft Word Documents
- Scanning and Saving Images
- Running ABBYY FineReader from Another Program
- Improving OCR Quality
- Taking Into Account Some of the Features of Your Paper Document
- Getting Images
- Tips for Improving OCR Quality
- OCR Options
- Incorrect Font in Recognized Text or Some Characters Are Replaced with"?" or "□"
- Paper Document Contains Decorative (Non–Standard) Fonts
- Complex Structure of Paper Document Not Reproduced in Electronic Document
- Table Not Detected
- Table Cells Detected Incorrectly
- Picture Not Detected
- Barcode Not Detected
- Vertical or Inverted Text Not Recognized Properly
- Adjusting Area Types and Area Borders
- Checking and Editing the Recognized Text
- Saving the Results
- Advanced Features
- Appendix
- How to Buy an ABBYY Product
- Technical Support
ABBYY FineReader 9.0 User’s Guide
code page A table that sets the interrelation between the character codes and the characters themselves. Users can select the
characters they need from the set available in the code page.
color mode A scanning parameter that determines whether an image must be scanned in black and white, grayscale, or color.
compound word A word made up of two or more stems (general meaning); a word not found in the dictionary, but potentially made
up of two or more terms found in the dictionary (ABBYY FineReader meaning).
D
despeckle Delete excess small black dots from an image.
Document Open Password A password which prevents users from opening a PDF document unless they type the password the
author specified.
document options The set of options that can be selected in the Options dialog box (Tools>Options). Options sets also include
user languages and patterns. Options sets can be saved and then used (loaded) in other ABBYY FineReader documents.
document print type A parameter reflecting how the source text was printed (on a laser printer or equivalent, on a matrix printer,
or on a typewriter). For laser–printed texts, the Auto mode should be set; for typewritten texts, the Typewriter mode should be set;
for texts printed on a dot matrix printer, the Dot Matrix Printer mode should be set.
dots per inch (dpi) Standard of measurement for the resolution of images.
driver A software program that controls a computer peripheral (e.g., a scanner, a monitor, etc).
F
font effects The appearance of a font (i.e. bold, italic, underlined, strikethrough, subscript, superscript, small caps).
I
ignored characters Any non–letter characters found in words (e.g. syllable characters or stress marks). These characters are ignored
during the spell check.
inverted image An image with white characters against a dark background.
L
License Manager A utility used for managing ABBYY FineReader licenses and activating ABBYY FineReader 9.0 Corporate Edition.
ligature A combination of two or more "glued" characters (such as fi, fl, ffi). These characters are difficult to separate because they are
usually "glued" in print. Treating them as a single compound character improves OCR accuracy.
M
monospaced font A font (such as Courier New) in which all characters are equally spaced. For better OCR results on monospaced
fonts, select Tools>Options..., click the Document tab, and select Typewriter under Document print type.
O
omnifont system A recognition system that recognizes characters set in any font and font size without prior training.
optional hyphen A hyphen (¬) that indicates exactly where a word or word combination should be split if it occurs at the end of a
line (e.g. "autoformat" should be split into "auto–format"). ABBYY FineReader replaces all hyphens found in dictionary words with
optional hyphens.
P
page layout The arrangement of text, tables, pictures, paragraphs, and columns on a page, as well as fonts, font sizes, font colors, text
background, and text orientation.
page layout analysis
The process of detecting areas on a page image. Areas can be of five types: text, picture, table, barcode, and
recognition area. Page layout analysis can be performed automatically when clicking the Read button, or manually by the user prior to
OCR.
paradigm The set of all grammatical forms of a word.
pattern A set of pairs (each pair contains a character image and the character itself) that is created during pattern training.
PDF security settings Restrictions that can prevent a PDF document from being opened, edited, copied or printed. These settings
include Document Open Passwords, Permissions Passwords, and encryption levels.
Permissions Password A password which prevents other users from printing and editing a PDF document unless they type the
password the author specified. If some security settings are selected for the document, other users will not be able to change these
settings until they type the password the author specified.
picture area An area that is used for image areas that contain pictures. This type of area may enclose an actual picture or any other
object that should be displayed as a picture (e.g. a section of text).
primary form The form of a headword in a dictionary entry.
Product ID The parameter that is automatically generated based on the hardware configuration when activating ABBYY FineReader
on a particular computer.
prohibited characters If certain characters will never be found in recognized text, they may be specified in a set of prohibited
characters in the language group properties. Specifying these characters increases the speed and quality of OCR.
R
resolution A scanning parameter that determines how many dpi to use during scanning. Resolution of 300 dpi should be used for
texts set in 10pt font size and larger, 400 to 600 dpi is preferable for texts of smaller font sizes (9pt and less).
S
scanner A device for inputting images into a computer.
separators Symbols that can separate words (e.g. /, \, dash) and that are separated by spaces from the words themselves.
T
table area An area that is used for table image areas or for areas of text that are structured as a table. When the application reads this
type of area, it draws vertical and horizontal separators inside the area to form a table. This area is the rendered as a table in the output
text.
57










