18.0
Table Of Contents
- Welcome
- Installation and setup
- Using OmniPage
- Processing documents
- Proofing and editing
- Saving and exporting
- Workflows
- Technical information
- Index
Chapter 4 Proofing and editing 55
in different orientations. The program can handle these; in the output they appear right-
rotated.
Beside the language list the option Verify language choices invokes automatic language
detection that warns of differences between a detected language and the language setting. It
works at page-level and identifies four categories: Japanese, Chinese, Korean and non-Asian.
It cannot distinguish between Traditional and Simplified Chinese or between non-Asian
languages. The last category means Japanese, Chinese or Korean characters were not
detected. Verification takes place during image pre-processing, so the required recognition
language must be set before image loading.
Auto-layout and auto-zoning are recommended for Asian pages. This places all detected texts
into text zones; by choosing an Asian recognition language you set Asian OCR to run in these
zones and that can automatically detect and transmit the text direction, coping with mixed
areas of horizontal and vertical texts on a page.
However, the zoning tool lets you force vertical Asian recognition by manual zoning.
Please draw rectangular zones with this tool. To manually zone horizontal Asian text, use the
usual text zone type. Do not use the two other vertical-text tools on Asian texts. Drawing a
vertical Asian zone does not automatically enable an Asian language, nor influence the
language auto-detection.
Digital camera images are accepted for Asian languages. However, the automatic 3D deskew
algorithm is unlikely to be useful - certainly not for vertical texts. Preferably use the standard
image loading command and perform manual 3D deskewing with the relevant SET tool if
required. In general, SET tools can be used on Asian images.
Recognized Asian pages appear in the Text Editor, provided your system has support for East
Asian languages - always with horizontal text direction. There is no need to specify Asian
fonts under Options/OCR, a default font is automatically applied - typically Arial Unicode
MS. Other Asian-capable fonts on your system can be chosen in the Text Editor. Editor
support allows text viewing and verifying - Formatted Text is recommended as formatting
level. Large-scale editing and spell-checking are better done in the target application.
Proofing, training and dictionary support are not available for Asian texts. Therefore, prior to
performing Asian OCR, go to the Proofing panel under Options and disable dictionary word
marking, automatic proofreading and IntelliTrain and ensure that no training file is loaded.
Redaction can be applied to Asian texts, either by selection or searching. The workflow step
Form Data Extraction should not be applied to Asian pages.
Typical output converters for Asian texts are RTF, Microsoft Word, Searchable PDF or XPS.
The text direction will be as detected during pre-processing. Changes made in the Text Editor