2020.2

Table Of Contents

Click Next.

Click the Browse button and open the PDF/VT or AFP file you want to work with.

Click Next.

After selecting the file, select the following options in the Metadata page: Metadata record

levels: Use the drop-down to select what level in the metadata defines a record.Field List: This

list displays all fields on the chosen level and higher levels in the PDF/VT or AFP metadata.

The right column shows the field name. The left column displays the level on which it is

located. Check any field to add it to the extraction.

Click Finish to close the dialog and open the actual Data Mapping configuration.

On the Settings pane, you will see that the boundary trigger is set to On metadata. The

selected metadata fields are added to the Data Model.

Note

Extracting data from a PDF that comes from a Windows printer queue (a PDF converted

to PostScript, converted back to PDF by an Input task in Workflow) might not work (see

the Connect Knowledge Base.)

The rule of thumb is: if copy-paste from Acrobat works, so will data mapping; if not, the

DataMapper won't either.

Rotated pages in a PDF are supported (if rotated 0/90/180/270 degrees). The Extract

step will be able to extract data from horizontal and vertical lines of text on rotated pages.

Motion steps (such as the Repeat step and the Goto step) however, can only work as

expected if text on a page has the same orientation as the page, not when text has been

rotated after the page was rotated.

The page number and rotation of a page are shown in the status bar at the bottom, next to

the region selection information.

Using the wizard for XML files

The DataMapper wizard for XML and JSON files helps you create a data mapping configuration

for an XML file. The wizard lets you select the type of node and the trigger that delimit the start

of a new record. Next, the wizard extracts the data in one extraction step.

This wizard can also be used to extract data from a JSON file. JSON files are automatically

converted to XML.

Page 210