Operation Manual

Creating PDFs

Last updated 4/7/2015

2 In the Recognize Text dialog box, click Add Files, and choose Add Files, Add Folders, or Add Open Files. Then

select the files or folder.

3 In the Output Options dialog box, specify a target folder for output files, and filename preferences.

4 In the Recognize Text - General Settings dialog box, specify the options, and then click OK.

Acrobat creates a layer of text in your PDF that can be searched — or copied and pasted into a new document.

Recognize Text - General Settings dialog box

Document Language Specifies the language for the OCR engine to use to identify the characters.

Output (PDF Output Style) Determines the type of PDF to produce. All options require an input resolution of 72 dpi or

higher (recommended). All formats apply OCR and font and page recognition to the text images and convert them to

normal text.

Searchable Image Ensures that text is searchable and selectable. This option keeps the original image, deskews it as

needed, and places an invisible text layer over it. The selection for Downsample Images in this same dialog box

determines whether the image is downsampled and to what extent.

Searchable Image (Exact) Ensures that text is searchable and selectable. This option keeps the original image and places

an invisible text layer over it. Recommended for cases requiring maximum fidelity to the original image.

Editable Text & Images Synthesizes a new custom font that closely approximates the original, and preserves the page

background using a low-resolution copy.

Downsample To Decreases the number of pixels in color, grayscale, and monochrome images after OCR is complete.

Choose the degree of downsampling to apply. Higher-numbered options do less downsampling, producing higher-

resolution PDFs.

Correct OCR text in PDFs

When you run OCR on a scanned output, Acrobat DC analyzes bitmaps of text and substitutes words and characters

for those bitmap areas. If the ideal substitution is uncertain, Acrobat DC marks the word as suspect. Suspects appear

in the PDF as the original bitmap of the word, but the text is included on an invisible layer behind the bitmap of the

word. This method makes the word searchable even though it is displayed as a bitmap.

Note: If you try to select text in a scanned PDF that does not have OCR applied, or try to perform a Read Out Loud

operation on an image file, Acrobat DC asks if you want to run OCR. If you click OK, the Text Recognition dialog box

opens and you can select options, which are described in detail under the previous topic.

1 Choose To o ls > Enhance Scans > Recognize Text > Correct Recognized Text.

Acrobat DC identifies suspected text errors and displays the image and text side by side in the Secondary toolbar.

(All suspect words on the page are enclosed in boxes.)

2 Click the highlighted object or box in the document, and then correct it in the Recognized As box in the Secondary

toolbar. Click Accept.

The next suspect is highlighted. Correct mistakes as needed. Click Accept for each correction.

3 Click Close in the Secondary toolbar when the task is complete.

Overview of PDF creation