10.0

Table Of Contents

What Is Optical Character Recognition (OCR)?

8 Chapter 2

What Is Optical Character Recognition (OCR)?

Optical character recognition

(

OCR

) is the process of turning an

image

into

computer-editable text. An image is an electronic picture of text such as

a scanned paper document or an electronic fax file. Images do not have

editable text characters; they have many tiny dots (

pixels

) that together

form a picture of text.

During OCR, OmniPage Pro analyzes an image and defines characters

to produce editable text. After OCR, you can save the resulting text to a

variety of word-processing, page layout, and spreadsheet applications.

OmniPage Pro’s OCR Capabilities

In addition to text recognition, OmniPage Pro can retain the following

elements of a document during OCR.

Graphics

Photos, logos, and drawings are examples of graphics.

Text formatting

Font types, font sizes, and font styles (such as bold or

italic

) are examples

of text formatting.

Page formatting

Column structure, paragraph spacing, table formats, and placement of

graphics are examples of page formatting.

The graphics, text formatting, and page formatting elements that

OmniPage Pro retains are determined by the settings you select. Refer to

the Settings Guidelines in the online Help for more information about

selecting settings.

OmniPage Pro only recognizes machine-printed characters such as

laser-printed or typewritten text. However, it can retain handwritten

text, such as a signature, as a graphic.