2020.2

Table Of Contents
3.
Click Next.
4.
Click the Browse button and open the PDF/VT or AFP file you want to work with.
Click Next.
After selecting the file, select the following options in the Metadata page: Metadata record
levels: Use the drop-down to select what level in the metadata defines a record.Field List: This
list displays all fields on the chosen level and higher levels in the PDF/VT or AFP metadata.
The right column shows the field name. The left column displays the level on which it is
located. Check any field to add it to the extraction.
Click Finish to close the dialog and open the actual Data Mapping configuration.
On the Settings pane, you will see that the boundary trigger is set to On metadata. The
selected metadata fields are added to the Data Model.
Note
Extracting data from a PDF that comes from a Windows printer queue (a PDF converted
to PostScript, converted back to PDF by an Input task in Workflow) might not work (see
the Connect Knowledge Base.)
The rule of thumb is: if copy-paste from Acrobat works, so will data mapping; if not, the
DataMapper won't either.
Rotated pages in a PDF are supported (if rotated 0/90/180/270 degrees). The Extract
step will be able to extract data from horizontal and vertical lines of text on rotated pages.
Motion steps (such as the Repeat step and the Goto step) however, can only work as
expected if text on a page has the same orientation as the page, not when text has been
rotated after the page was rotated.
The page number and rotation of a page are shown in the status bar at the bottom, next to
the region selection information.
Using the wizard for XML files
The DataMapper wizard for XML and JSON files helps you create a data mapping configuration
for an XML file. The wizard lets you select the type of node and the trigger that delimit the start
of a new record. Next, the wizard extracts the data in one extraction step.
This wizard can also be used to extract data from a JSON file. JSON files are automatically
converted to XML.
Page 210