Operation Manual

Chapter 4 – Input Source
28
IRISDocument does not delete the images from the image folder
by default, but you can choose to delete them after recognition.
It is recommended to use a local hard disk as image folder, rather than
a network volume. The recognition is faster and bandwidth and
network issues are avoided.
Options
IRISDocument by default loads PDF and DjVu documents in
color. Should you want to load these documents in black-and-
white to reduce the processing time, then clear this option.
When you need to process many PDF documents from different
sources, among which PDF documents that already contain text,
and you want to generate PDF Image-Text output documents, it
is useful to select the option Don't modify PDF containing text.
By selecting this option, IRISDocument Server will not convert
the PDF files that already contain text into PDF Image-Text files.
Instead, IRISDocument will simply copy the original files to the
output folder. This way, unnecessary processing is avoided.
So this option is useful if your batches of input documents
contain PDF files of which you do not know the content (text,
images, or both?). IRISDocument considers a PDF file to contain
text when there are more than 10 characters per page.
Important note: the option Don't modify PDF containing text is not
compatible with options that require modifying the input PDF (e.g.
password protection, document structuring). That's why the following
number of parameters need to be configured:
The selected output format must be PDF Image-Text (in the
Output section, on the PDF tab)
In the Processing section, on the Barcode Reading tab:
Enable barcode reading must not be selected (not selected
by default).