9.0
Table Of Contents
- Introducing ABBYY FineReader
- What's New in ABBYY FineReader 9.0
- Working with ABBYY FineReader 9.0
- Using ABBYY FineReader 9.0 Step–by–Step
- Converting Paper Documents into Microsoft Word Documents
- Converting Images or PDF Documents into Microsoft Word Documents
- Converting Paper Documents into Microsoft Excel Worksheets
- Scanning Paper Documents to Create PDF Documents
- Converting Digital Photos into Microsoft Word Documents
- Scanning and Saving Images
- Running ABBYY FineReader from Another Program
- Improving OCR Quality
- Taking Into Account Some of the Features of Your Paper Document
- Getting Images
- Tips for Improving OCR Quality
- OCR Options
- Incorrect Font in Recognized Text or Some Characters Are Replaced with"?" or "□"
- Paper Document Contains Decorative (Non–Standard) Fonts
- Complex Structure of Paper Document Not Reproduced in Electronic Document
- Table Not Detected
- Table Cells Detected Incorrectly
- Picture Not Detected
- Barcode Not Detected
- Vertical or Inverted Text Not Recognized Properly
- Adjusting Area Types and Area Borders
- Checking and Editing the Recognized Text
- Saving the Results
- Advanced Features
- Appendix
- How to Buy an ABBYY Product
- Technical Support
ABBYY FineReader 9.0 User’s Guide
Capital Cyrillic letter
[А–Я]
Small Cyrillic letter
[а–я]
Digit [0–9]
Space \s
@ Reserved.
Note:
1. To use a regular expression symbol as a normal character, precede it with a backslash. For example, [t–v]x+ stands for tx, txx, txx,
etc., ux, uxx, etc., but \[t–v\]x+ stands for [t–v]x, [t–v]xx, [t–v]xxx, etc.
2. To group regular expression elements, use brackets. For example, (a|b)+|c stands for c or any combinations like abbbaaabbb,
ababab, etc. (a word of any non–zero length in which there may be any number of a's and b's in any order), while a|b+|c stands for
a, c, and b, bb, bbb, etc.
Examples
Regular expression for dates:
The number denoting a day may consist of one digit (1, 2, etc.) or two digits (02, 12), but it cannot be zero (00 or 0). The regular
expression for the day should then look like this: ((|0)[1–9])|([1|2][0–9])|(30)|(31).
The regular expression for the month should look like this: ((|0)[1–9])|(10)|(11)|(12).
The regular expression for the year should look like this: ([19][0–9][0–9]|([0–9][0–9])|([20][0–9][0–9]|([0–9][0–9]).
What is left is to combine all this together and separate the numbers by period (like 1.03.1999). The period is a regular expression
symbol, so you must put a backslash (\) before it. The regular expression for the full date should then look like this:
((|0)[1–9])|([1|2][0–9])|(30)|(31)\.((|0)[1–9])|(10)|(11)|(12)\.((19)[0–9][0–9])|([0–9][0–9])|([20][0–9][0–9]|([0–9][0–9])
Regular expression for the e–mail addresses:
[a–zA–Z0–9_\–\.]+\@[a–z0–9\.\–]+
Glossary
A
ABBYY FineReader document A folder where document images and service files are stored. An ABBYY FineReader document may
contain up to 9,999 pages and groups together images of paper documents that are logically connected (e.g. pages from the same
book).
ABBYY Hot Folder & Scheduling A scheduling agent which allows you to select a folder with images and set the time for processing
images in this folder. The images from the selected folder will be processed automatically at the specified time.
ABBYY Screenshot Reader An application to create screenshots and recognize texts in them.
abbreviation A shortened form of a word or phrase used to represent the whole. For example, MS–DOS (for Microsoft Disk
Operating System), UN (for United Nations), etc.
activation The process of obtaining a special code from ABBYY which allows the user to use his/her copy of the software in full
mode on a given computer.
activation code A code that is issued by ABBYY to each user of ABBYY FineReader 9.0 Professional Edition during the activation
procedure. The activation code is required to activate ABBYY FineReader on the computer that generated the Product ID.
activation file A file issued by ABBYY to each user of ABBYY FineReader 9.0 Corporate Edition during the activation procedure. The
activation file contains information required to activate ABBYY FineReader on the server or on a standalone computer as the case may
be. From the server, the product will be activated on workstations.
active area a selected area on an image that can be deleted, moved or modified. To make an area active, click it. The frame enclosing
an active area is bold and has small squares that can be dragged to change the size of the area.
Automatic Document Feeder (ADF) A device that automatically feeds documents to a scanner. A scanner with an ADF can scan
multiple pages without manual intervention. ABBYY FineReader also supports scanning multi–page documents.
area A section on an image enclosed by a frame. Before performing OCR, ABBYY FineReader detects text, picture, table, and barcode
areas in order to determine which sections of the image should be recognized and in what order.
area template A template that contains information about the size and location of areas within a set of similar–looking documents.
Automation Manager A built–in manager which allows you to run an automated task, create and modify automated tasks, and
delete custom automated tasks which you no longer use.
B
barcode area An area that is used for barcode image areas.
brightness A scanning parameter that indicates the contrast between black and white image areas. Setting the correct brightness
increases recognition quality.
C
56










