2.0

Table Of Contents
ABBYY Recognition Server 2.0 System Administrator’s Guide
COMbased API
This API allows you to integrate Recognition Server with other applications in the local area network. The COMbased
API component should be installed on the computer on which the client application will run. It can also be distributed
together with the client application. See more information about the distribution of the COMbased API in the Open API
Help file.
Web Service
This component provides a Web Service API for integrating the Recognition Server with remote applications over the
HTTP protocol.
All the Recognition Server components can be installed on the same or on different computers in the local area network.
Workflows and Jobs
A workflow is a set of processing parameters that are to be applied to image files. Recognition Server can handle several
workflows with different parameters. Each workflow corresponds to a unique input source (a folder or a mailbox). All
the images that appear in the source folder/mailbox are processed using the parameters of the corresponding workflow.
A job is a processing unit. A job contains an image (or a set of images) and its processing parameters. A job is created
when Recognition Server imports images for processing. Usually a job contains one image file and inherits the processing
parameters of the workflow in which it was created. However, with the help of the API or XML tickets it is possible to
create a job that will contain several image files and will have parameters different from the workflow settings. (See the
Integration Tools section for more information.)
Interaction of Components and Load Balancing
When image files are submitted to ABBYY Recognition Server 2.0, the Server Manager creates jobs for them and queues
them for processing. The first job in the queue will be sent to the first available Processing Station for processing. If there
are several Processing Stations in the system, the Server Manager will evenly distribute the jobs from the queue among
these Processing Stations.
A Processing Station can run several OCR processes (their number can be adjusted in the Remote Administration
Console). For optimal performance, the recommended number of processes for a station is N+1, where N is the number
of CPU cores on the station. Usually each OCR process gets one file at a time. For example, if a Processing Station runs
two OCR processes, it will recognize two files in parallel (they can belong to the same job or to different jobs). However,
if the file has many pages (e.g. several dozen) and there are no more than 5 jobs waiting in the queue, the big file will be
split into several chunks, and the chunks will be sent to different OCR processes, in order to get the work done faster.
When the Processing Station has finished processing the file, it returns the recognized file to the Server Manager and is
assigned the next job from the queue.
If verification is turned on in the workflow settings, pages that require verification will be queued for verification after
recognition. If there are Verification Stations connected, the Server Manager will route the queued pages to those
stations. Each Verification Station gets one page at a time. If no Verification Stations are currently connected, or the users
logged on the stations are not permitted to verify pages from this workflow, the pages will wait in the queue with
“Queued for verification” status. They will not be exported until they are verified.
After all the pages from the document are recognized and (if necessary) verified, the Server Manager will deliver the
output document to the destination specified in the job settings.
If document separation is enabled in the workflow settings, the pages in the job queue will be rearranged into
documents before export according to the separation rule. In this case, each job shown in the Job Log will correspond to
one output document and will list all the files that form the output document.
If several workflows are set up, ABBYY Recognition Server will process jobs from all the workflows simultaneously,
within the single queue. The jobs will be arranged in the queue according to their creation time and priorities.
Handling Documents
When image documents get in the Input Folder or mailbox, they are imported by the Server Manager and transferred to
the Images subfolder of the ABBYY Recognition Server 2.0 temporary folder. The path to the Server Manager temporary
6