6.0
59
Chapter 8 - Saving into External Applications and Formats
Note:
1. A special Replace uncertain words with images option is available if you use Text and
pictures only or Text over the page image mode. If you select this option, all uncertain
words will be replaced with their images. Set this option on the
PDF tab in the Formats
Settings dialog.
2. If you wish to edit recognized text before exporting it in PDF format, we recommend you
pay special attention to preserving the original line division (i.e. avoid deleting existing lines
and adding new ones), otherwise the resulting PDF file may be displayed incorrectly (e.g.
lines may overlap).
3. When you save texts that use non-Latin code page (e.g. Cyrillic, Greek, Czech, etc.), ABBYY
FineReader will save them using ParaType company fonts (www.paratype.com/shop).
4. If, during PDF export, a message appears informing you that your text contains a number
of non-standard font characters, you must then select Type 1 working mode and
corresponding Type 1 fonts. These fonts are supplied as part of Adobe Type Manager or in
the Windows 2000 postscript font installer. For more information on Type 1 fonts, see
"Using Type 1 fonts during export to PDF" in ABBYY FineReader Help.
5. Before you can edit PDF files that contain non-Latin code page (e.g. Cyrillic, Greek, Czech,
etc.) in Adobe Acrobat, the text font must be changed to one installed on your computer.
Layout retention modes are set on the
Formatting tab in the Options dialog (Tools>Options menu).
Note: When you save text in HTML format, the fonts used are either those set on the Formatting tab
in the Options dialog (Tools>Options menu) or those set during text editing in the Text window.
To retain pictures in a HTML file:
● Select the Keep pictures option on the Formatting tab in the Options dialog
(
Tools>Options menu)
Note: Pictures are saved into separate *.jpg files. The resolution of the images and their quality can be
determined on the HTML tab of the Formats dialog (Tools>Formats).
HTML formats available:
1. Full (uses CSS and requires Internet Explorer 4.0 or later) - the latest HTML format -
HTML 4 – is used. HTML 4 supports all document layout retention types (the actual
retention type used depends on the options set on the
Formatting tab in the Retain
layout group). The built-in style sheet is used.
2.
Simple (compatible with all (Internet-) browsers) - HTML 3 format is used. The
approximate document layout is retained i.e. the first line indent is not retained but the
approximate font size is (HTML 3 format supports only a limited number of font sizes;
FineReader will choose the HTML 3 format font size that corresponds to the actual font
size of your text). This HTML format is supported by all browsers (Netscape Navigator,
Internet Explorer 3.0 and later).
3.
Auto (saves Full and Simple formats in a single file with autoselection depending on
browser type) - both formats (Simple and Full) are saved to the same file. The browser you
use will determine the format that is used.
Saving Recognized Text in HTML Format










