2019.2

Table Of Contents
l
Ignore unparseable lines: Ignores any line that does not correspond to the settings
above.
l
Skip empty lines: Ignore any line that has no content. Note that spaces are considered
content.
l
Sort on: Select a field on which to sort the data, in ascending (A-Z) or descending
(Z-A) order. Note that sorting is always textual. Even if the selected column has
numbers, it will be sorted as a text.
Excel file Input Data settings
There are no settings for field separation in an Excel file, only settings with regards to the file as
a whole.
l
Lines to skip: Defines a number of lines in the Excel file that will be skipped and not
used as records.
l
First row contains field names: Check this option to use the first line of the Excel file as
headers. This option automatically names all extracted fields.
l
Sheet: Only one sheet can be selected as the data source.
l
Skip empty lines: Ignore any line that has no content. Note that spaces are considered
content.
l
Sort on: Select a field on which to sort the data, in ascending (A-Z) or descending
(Z-A) order. Note that sorting is always textual. Even if the selected column has
numbers, it will be sorted as a text.
PDF file Input Data settings
PDF Files have a natural, static delimiter in the form of pages, so the options here are
interpretation settings for text in the PDF file.
The Input Data settings for PDF files determine how words, lines and paragraphs are detected
in the PDF when creating data selections.
Each value represents a fraction of the average font size of text in a data selection, meaning
"0.3" represents 30% of the height or width.
l
Word spacing: Determines the spacing between words. As PDF text spacing is
somehow done through positioning instead of actual text spaces, text position is what is
used to find new words. This option determines what percentage of the average width of a
single character needs to be empty to consider a new word has started. The default value
is 0.3, meaning a space is assumed if there is a blank area of 30% of the width of the
Page 342