10.0

Table Of Contents
What Is Optical Character Recognition (OCR)?
8 Chapter 2
What Is Optical Character Recognition (OCR)?
Optical character recognition
(
OCR
) is the process of turning an
image
into
computer-editable text. An image is an electronic picture of text such as
a scanned paper document or an electronic fax file. Images do not have
editable text characters; they have many tiny dots (
pixels
) that together
form a picture of text.
During OCR, OmniPage Pro analyzes an image and defines characters
to produce editable text. After OCR, you can save the resulting text to a
variety of word-processing, page layout, and spreadsheet applications.
OmniPage Pro’s OCR Capabilities
In addition to text recognition, OmniPage Pro can retain the following
elements of a document during OCR.
Graphics
Photos, logos, and drawings are examples of graphics.
Text formatting
Font types, font sizes, and font styles (such as bold or
italic
) are examples
of text formatting.
Page formatting
Column structure, paragraph spacing, table formats, and placement of
graphics are examples of page formatting.
The graphics, text formatting, and page formatting elements that
OmniPage Pro retains are determined by the settings you select. Refer to
the Settings Guidelines in the online Help for more information about
selecting settings.
OmniPage Pro only recognizes machine-printed characters such as
laser-printed or typewritten text. However, it can retain handwritten
text, such as a signature, as a graphic.