User manual

Conversion options for HTML, XML, or plain text format
By default, images are converted to JPEG format.
Encoding
Refers to the binary values, based on international standards, used to represent the text
characters. UTF-8 is a Unicode representation of characters using one or more 8-bit bytes
per character. UTF-16 is a Unicode representation of characters using one or more 16-bit
bytes per character. ISO-Latin-1 is an 8-bit representation of characters that is a superset
of ASCII. UCS-4 is a Universal Character Set coded in 4 octets. HTML/ASCII is a 7-bit
representation of characters developed by ANSI.
Use Mapping Table Default uses the default character encoding defined in mapping
tables, which appear in the Plug-ins/SaveAsXML/MappingTables folder. These mapping
tables specify many characteristics of how the data is output, including the default
character encoding. These defaults are:
Save as XML: UTF-8
Save as Text: Host encoding, which is defined by the operating system, based on its locale
setting
Save as HTML 3.0: HTML/ASCII
Save as HTML 4.0.1: UTF-8
Generate Bookmarks
Generates bookmark links to content for HTML or XML documents. Links are placed at
the beginning of the resulting HTML or XML document.
Generate Tags For Untagged Files
Generates tags for files that are not already tagged, such as PDF files created using
Acrobat 4.0 or earlier. If this option is not selected, untagged files are not converted.
Note: Tags are applied only as part of the conversion process and are discarded after the
conversion. This is not a method for creating tagged PDF files from legacy files.
Generate Images
Controls how images are converted. Converted image files are referenced from within
XML and HTML documents.
Use Sub-Folder
Specify the name of the folder in which to store generated images. The default is Images.
Use Prefix
You can specify a prefix to be added to the image file names in case you have several
versions of the same image file. File names assigned to images have the format
filename_img_#.
Output Format
The default is JPG.
Downsample To
If you do not select this option, image files have the same resolution as in the source file.
Image files are never upsampled.