LEGAL NOTICES Copyright © 2013 Nuance Communications, Inc. All rights reserved. No part of this publication may be transmitted, transcribed, reproduced, stored in any retrieval system or translated into any language or computer language in any form or by any means, mechanical, electronic, magnetic, optical, chemical, manual, or otherwise, without prior written consent from Nuance Communications, Inc., 1 Wayside Road, Burlington, Massachusetts 01803-4609.
C O N T E N T S WELCOME . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 New Features in OmniPage Ultimate ................................................................................................. 2 Key Features in OmniPage Ultimate .................................................................................................. 4 I NS TA LL AT ION AND S E TU P . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 System Requirements .........
S AV IN G AND E X PO R T I N G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58 Saving and Exporting ......................................................................................................................... 58 Saving Original Images ...................................................................................................................... 58 Saving Recognition Results ..............................................................................................
Welcome Welcome to this OmniPage® Ultimate text recognition program, and thank you for choosing our software! The following documentation has been provided to help you get started and give you an overview of the program. This User’s Guide This guide introduces you to using OmniPage Ultimate. It includes installation and setup instructions, a description of the program’s commands and working areas, task-oriented instructions, ways to customize and control processing, and technical information.
Electronic Help OmniPage Help contains information on features, settings, and procedures. It also has a comprehensive glossary, with its own alphabetical index and a table of contents. The HTML help system has been designed for quick and easy information retrieval. Help is available after you install OmniPage. Comprehensive context-sensitive help aims to provide just enough assistance to let you keep working without delay. It is available from dialog boxes.
• DocuDirect: This is a powerful workflow management tool – previously known as the Batch Manager. Technical improvements make large-scale processing more robust, with better reporting and separation of problematic documents and improved recovery from critical situations. Differing default settings for Workflow Assistant and DocuDirect are introduced to better match their purposes. See “At a later time” on page 23. • Make PDF file searchable: A new workflow step is available inside DocuDirect.
Key Features in OmniPage Ultimate Click the links for more information. • Customize Windows Explorer shortcut menus: The OmniPage items in the Windows Explorer shortcut menus of input files allow direct conversion to popular file formats, and the addition of user-defined workflows to the menu; the Convert Now Wizard makes it easy to customize the conversion process. • Handling multiple documents: Multiple document handling allows you to work on more than one document at a time.
• Linking workflows to scanner buttons: OmniPage functions and workflows can be associated with scanner buttons, so the whole pre-processing, recognition and storage of documents can be launched from the scanner. See “Scanning to OmniPage and workflows” on page 29. Features in OmniPage Ultimate only This icon is used throughout the guide to denote features that are available only in OmniPage Ultimate.
Installation and Setup This chapter provides information on installing and starting OmniPage.
• • • • 2-megapixel digital camera with auto-focus or higher for digital camera text capture. See Help for details. A compatible scanner with its own scanner driver software for scanning documents (WIA, TWAIN, or ISIS scanner driver). See the Scanner Guide at Nuance’s web site (www.nuance.
exclude or add modules. To exclude a module, click its down arrow and select ‘This feature will not be available’. 4. Follow the instructions on each screen to install the software. All files needed for scanning are copied automatically during installation. Unless deselected in the OmniPage Ultimate installation, Nuance PDF Create 8 installation starts as soon the installation of OmniPage is completed. Document-to-document conversions depend on PDF Create being present.
The wizard reports whether the chosen scanner model already has settings in the scanner database. If it does, you do not need to test it. If it does not, you should test it. Click on Next. • If you chose not to test, click Finish. If you chose testing, click Next to have the scanner connection tested. If the connection is in order, you see a menu of further tests. Choose which testing steps you want to run. The Basic test scan is recommended.
How to Start the Program OmniPage Ultimate features OmniPage Launchpad, a new clear-cut metro-style start page for simplified, faster conversions. Click Start in the Windows taskbar and choose All Programs > Nuance OmniPage Ultimate>OmniPage Launchpad for accessing it. The OmniPage Launchpad looks like this: 1. The Build panel column ‘Convert’ – choose a page type that best describes the layout of the input document. 2. The Build panel column ‘To’ – choose the output file type you desire. 3.
6. The currently selected ‘Save’ tile. These three form the Go-flow in the fourth slot. 7. The Go-flow slots. 8. The currently selected Go-flow, just compiled from the selected Build Panel tiles. 9. The last unfilled Go-flow slot. 10. Run the selected Go-flow. 11. The Settings bar – collection of eight buttons (six of them with two different states) for managing the prepared Go-flows.
• • Click the OmniPage Agent icon on the taskbar. Choose a workflow to start the program and run the workflow. Use OmniPage Ultimate with Nuance’s PaperPort document management product, to add OCR services. See “How to Use OmniPage with PaperPort” on page 21. Registering your Software Nuance’s online registration runs at the end of installation. Ensure web access is available. We provide an easy electronic form that can be completed in less than five minutes. When the form is filled, click Submit.
Uninstalling the Software Sometimes uninstalling and then reinstalling OmniPage will solve a problem. The OmniPage Uninstall program will not remove files containing recognition results or any of the following user-created files: Zone templates (*.zon) Image enhancement templates (*.ipp) Training files (*.otn) User dictionaries (*.ud) OmniPage Documents (*.opd) Job files (*.opj) Workflow files (*.xwf) To uninstall you must be logged into your computer with administrator privileges.
Using OmniPage OmniPage Ultimate uses optical character recognition (OCR) technology to transform text from scanned pages or image files into editable text for use in your favorite computer applications. In addition to text recognition, OmniPage can retain the following elements and attributes of a document through the OCR process.
The OmniPage Desktop and Views OmniPage comes with three different views to suit your task. • Classic View - This view has a similar look and feel to previous versions of OmniPage. • Flexible View - This view provides an alternate layout of the OmniPage function panels stacked in a tabbed view to give each panel more space. • Quick Convert View - This view is designed for quick and easy document conversion without having to learn a lot.
Standard Toolbar OmniPage Toolbox Formatting toolbar Thumbnails Image toolbar Document Manager Page Image Status bar Text Editor OmniPage toolbox: This Toolbox lets you drive the processing. Thumbnails panel: This displays page thumbnails. Document Manager: This provides an overview of your document with a table. Each row represents one page. Columns present statistical or status information for each page, and (where appropriate) document totals.
Flexible View Use this view to set up the OmniPage workspace so that it fits your task optimally. By default all panels appear. There are five tabs: Page Image (including Thumbnails), Text Editor, Easy Loader, Workflow Status and Help. The Document Manager appears in a horizontal panel at the base of the working area. You can undock, move, minimize, group or close panels as already described. Drag a tab onto the working area to convert it to a Classic-type tiled panel.
Verifying (dual-screen) Place the Page Image on one screen and the Text Editor on the other. This gives you more space for editing and proofing. The Page Image is always available for verifying recognition and for performing on-the-fly zoning and editing. The scenarios presented above are only examples to give you an idea of what you can do in Flexible View. Quick Convert View Use the Quick Convert View for fast recognition and saving.
The Easy Loader is by default on a tab that toggles with the Quick Convert Options panel. A Help panel can be added, but further panels are not available in this view. You can change tabs to separate panels and minimize them, as in other views. After loading a file, you should convert it before loading the next file. When an image conversion is finished, you do not need to explicitly close the image; just load a new file. The Easy Loader in Quick View provides an additional feature: ‘one-click’ processing.
Image toolbar: Performs image, zoning and table operations. Three of its tool groups can now be handled separately (mini-toolbars): Zones toolbar: Offers zoning tools. • Rotate toolbar: Provides rotating tools. • Table toolbar: Inserts, moves and removes row and column dividers. Formatting toolbar: Formats recognized text in the Text Editor. Verifier toolbar: Controls the location and appearance of the verifier. Reorder toolbar: Modifies the order of elements in recognized pages.
Using OmniPage, you can choose from the following processing methods: Automatic, Manual, Combined, or Workflow. You can start recognition from other applications, using Direct OCR and can also schedule processing to run at a later time. Processing methods are detailed in the next chapter and in the Help. Settings The Options dialog box is the central location for OmniPage settings. Access it from the Standard toolbar or the Tools menu. Context-sensitive help provides information on each setting.
Processing Documents This tutorial chapter describes different ways you can process a document and also provides information on key parts of this processing. Processing Methods Using OmniPage, you can choose from the following processing methods: Automatic A fast and easy way to process documents is to let OmniPage do it automatically for you. Select settings in the Options dialog box and in the OmniPage Toolbox dropdown lists and then click Start.
The default for manual processing is to have all entered pages automatically selected. This way you can have all new pages recognized by a single mouse click. You can remove this default in the Process panel of the Options dialog box. Combined You can process a document automatically and view results in the Text Editor. If most pages are in order, but a few have not turned out as expected, you can switch to manual processing to adjust settings and re-recognize just those problem pages.
Processing from other applications You can use the Direct OCR™ feature to call on the recognition services of OmniPage while you work in the following applications: Microsoft Office XP or higher, Corel WordPerfect 12 or X3. First you must check the Enable Direct OCR check box under Tools > Options > General. Then, two buttons in the Office 2010 or 2013 Nuance OCR tab, or in an OmniPage toolbar open the door to OCR facilities. How to set up Direct OCR Start the application you want connected to OmniPage.
6. If proofing was specified, this follows recognition. Then the recognized text is placed at the cursor position in your application, with the formatting level specified in the Output Format panel under Acquire Text Settings. Defining the Source of Page Images There are three possible image sources: from image files, from a digital camera and from a scanner. There are two main types of scanners: flatbed or sheetfed.
In OmniPage Ultimate, files can also be imported from Microsoft SharePoint 2003, 2007 and 2010, Hummingbird, iManage and ODMA-compliant Enterprise Content Management sources. Input from digital camera Digital camera files are auto-detected in OmniPage Ultimate, hence there is no need to use Load Digital Camera Files button. Auto-detection of camera files means that now they can be processed as camera files from any source, even from the cloud.
current document’s list with Delete All or Clear in the Process menu. Use Clear all to clear all files destined for all open documents. See a tutorial in Help on loading files for multiple documents. Easy Loader is available as a panel in Quick Convert View. The Process menu has two commands unique to Quick View. • Get and Convert offers 'one-button' processing - files are loaded, passed through recognition and saved to files using existing settings.
Scan black and white Select this to scan in black-and-white. Black-and-white images can be scanned and handled quicker than others and occupy less disk space. Scan grayscale Select this to use grayscale scanning. For best OCR accuracy, use this for pages with varying or low contrast (not much difference between light and dark) and with text on colored or shaded backgrounds. Scan color Select this to scan in color. This will function only with color scanners.
Scanning to OmniPage and workflows Go to Tools / Options / Scanners to choose an action to be performed when a button on your local scanner is pushed. This can be simple scanning resulting in images loaded into OmniPage. It is also possible to select a scanner-based workflow from those you have created or choose to be prompted to select a workflow whenever the button is pressed. Use the Control Panel button to associate OmniPage with a scanner event (a scanner button being pressed).
Single column, no table Choose this setting if your pages contain only one column of text and no table. Business letters or pages from a book are normally like this. Multiple columns, no table Choose this if some of your pages contain text in columns and you want this decolumnized or kept in separate columns, similar to the original layout. Single column with table Choose this if your page contains only one column of text and a table.
Preprocessing Images To improve OCR results, you can enhance your images before zoning and recognition using the Image Enhancement tools. Click the SET - Enhance Image button in the Image Toolbar to open the Image Enhancement window. This window has a starting image panel (1) on the left and a result panel (2) on the right. Choose a tool (see following topics), then move sliders and adjust controls (3). When the result is good, click Apply (4).
The input for Image Enhancement is the Primary image This tool lets you switch between the Primary and the OCR image. Some tools affect the Primary image, others the OCR image. Be sure you know which image you are editing. Good brightness and contrast settings play an important role in OCR accuracy. Set these in the Scanner panel of the Options dialog box or in your scanner’s interface. The diagram illustrates an optimum brightness setting. After loading an image, check its appearance.
P/O - affects both images. WH - applies to whole images only. AR - can be applied to selected image areas. Pointer (F5) - the Pointer is a neutral tool carrying out different operations under different circumstances (for example, to pick a color for the Fill operation, or to catch the deskew line.) PO. Zoom (F6) - click the tool then use the left mouse button to zoom in on your image or the right mouse button to zoom out. You can also use the mouse wheel for zooming in and out - even in the inactive view.
Despeckle - click this tool to remove stray dots from your image. Despeckle works on the OCR image at 4 levels of severity. You can also use this tool not to remove noise from the page but to strengthen letter outlines: to do this mark the checkbox Inverse despeckling. O. AR. OCR Brightness - use this tool the set Brightness and Contrast of your OCR image. See the diagram of optimum brightness under Preprocessing Images above. O. AR.
Auto-crop - automatically detects margin areas on the page and reduces this to a minimum. This is a way of unifying the margins on a set of pages with different sized text areas. P+O. WH > AR Clean borders - removes scanning shadows, spots and marginal notes from page edges P+O. WH but relates only to the border area. Punch-hole remover - replaces punch holes with the background page color. P+O. WH but relates only to the border area.
Here the 3D deskew is being applied, with the result on the right. The Enhance whiteboard photo tool’s slider is being used to improve the contrast of the image. On the left is the starting image; on the right is the result.
Some of these tools are also available for automatic pre-processing of all incoming images. These are shown on the Process panel of the Options dialog box. Using Image Enhancement history To commit or undo your image edits (one by one or all the steps), use the History panel in the Image Enhancement window. Once you have modified the starting image, the result window displays the changes. Click the Apply button next to the History list to commit the change.
The following options are available: Display images for manual enhancement - during the execution of a workflow, each loaded image will be displayed for manual editing. Apply enhancement template - an already saved enhancement template will be applied automatically to the image while being processed by the workflow. Apply enhancement template and display - the workflow will apply the selected image enhancement template, and will also display the image so that you can make further edits to it.
Vertical Asian text appears horizontally in the Text Editor, but can be exported as vertical - see Chapter 4, page 47. Auto-zoning detects vertical texts in non-Asian languages in table cells and anywhere on Normal PDF or XPS pages. Multi-line detection is possible in these cases.
Vertical Asian text zone Use this to draw text zones for vertical text in Japanese or Chinese. Zones should be rectangular. Vertical left-rotated text zone Use this to draw text zones for vertical text that is left rotated (non-Asian languages only). The zones should be rectangular. Vertical right-rotated text zone Use this to draw a text zone for vertical text that is right rotated (non-Asian languages only). The zones should be rectangular. Table zone Use this to have the zone contents treated as a table.
To resize a zone, select it by clicking in it, move the cursor to a side or corner, catch a handle and move it to the desired location. It cannot overlap another zone. To make an irregular zone by addition draw a partially overlapping zone of the same type. To join two zones of the same type draw an overlapping zone of the same type (drawn zones on the left, resulting zone on the right). To make an irregular zone by subtraction draw an overlapping zone of the same type as the background.
Table grids in the image After automatic processing you may see table zones placed on a page. They are denoted with a table zone icon in the top left corner of the zone. To change a rectangular zone to or from a table zone, use its shortcut menu. You can also draw table type zones, but they must remain rectangular. You draw or move table dividers to determine where gridlines will appear when the table is placed in the Text Editor.
Process zones or process background areas from a template may be replaced during recognition by a set of smaller zones; specific zone types will be assigned to these zones. How to save a zone template Select a background value and prepare zones on a page. Check their locations and properties. Click Zone Template... in the Tools menu. In the dialog box, select [zones on page] and click Save, then assign a name and optionally a different path. Choose a network location to share the template file. Click OK.
Proofing and Editing Recognition results are placed in the Text Editor. These can be recognized texts, tables, forms and embedded graphics. This WYSIWYG (What-You-See-Is-What-You-Get) editor is detailed in this chapter. Asian text handling is in some respects different from other languages. See “Asian language recognition” on page 48. The Editor Display and Formatting Levels The Text Editor displays recognized texts and can mark words that were suspected during recognition with red, wavy underlines.
Proofreading OCR Results After a page is recognized, the recognition results appear in the Text Editor. Proofreading starts automatically if that was requested in the Proofing panel of the Options dialog box. You can start proofing manually any time. Work as follows: 1. Click the Proofread OCR tool in the Standard toolbar, or choose Proofread OCR... in the Tools menu. 2. Proofing starts from the current page, but skips text already proofed.
How much context for dynamic verifier? • one word • three words (current + neighbors) • whole image line zoom in/out To turn the Verifier on, click the Verifier tool or press F9. To turn it off, click the Verifier tool again, press F9 again, or press Esc. A full list of verifier keyboard shortcuts is available in Help. The Character Map The Character Map is a dockable tool giving you aid in proofing.
User Dictionaries The program has built-in dictionaries for many languages. These assist during recognition and may offer suggestions during proofing. They can be supplemented by user dictionaries. You can save any number of user dictionaries, but only one can be loaded at a time. A dictionary called Custom is the default user dictionary for Microsoft Word.
language to the whole page. That means this feature is not suitable for pages containing multiple languages. The program chooses from the languages with dictionary support that use a Latin-based alphabet (meaning Russian and Greek are excluded) plus optionally Asian languages. Choose from three language groups: Latin-alphabet languages (choose it to see the enabled languages) • Asian languages (Japanese, Korean and Chinese – Traditional and Simplified) • Latin-alphabet and Asian languages.
languages. The last category means Japanese, Chinese or Korean characters were not detected. Verification takes place during image pre-processing, so the required recognition language must be set before image loading. Auto-layout and auto-zoning are recommended for Asian pages.
throughout a document. OmniPage offers two types of training: manual training and automatic training (IntelliTrain). Data coming from both types of training are combined and available for saving to a training file. When you leave a page on which training data was generated, you will be asked how to apply it to other existing pages in the document.
A training file can be also edited; its name appears in the title bar. If it has unsaved training added to it, an asterisk appears after its name. Both the unsaved and the modified training are saved when you close the dialog box. The Edit Training dialog box displays frames containing a character shape and an OCR solution assigned to that shape. Click a frame to select it. Then you can delete it with the Delete key, or change the assignation. Use arrow keys to move to the next or previous frame.
Graphics You can edit the contents of a selected graphic if you have an image editor in your computer. Click Edit Picture With in the Format menu. Here you can choose to use the image editor associated with BMP files in your Windows system, and load the graphic. Alternatively, you can use the Choose Program... item to select another program. This will replace the Default Image Editor item. Edit the graphic and then close the editor to have it re-embedded in the Text Editor.
On-the-Fly Editing This allows you to modify a recognized page through re-zoning, without having to re-process the whole page. When on-the-fly editing is enabled, zone changes (deleting, drawing, resizing, changing type) immediately make changes in the recognized page. Conversely, when you modify elements in the Text Editor’s True Page formatting level, this changes the zones on that page. Two linked tools on the Image toolbar control on-the-fly zoning.
To find and redact text by searching, select Find and Mark Text from the Edit menu to display the Find, Replace and Mark Text dialog box. Search for text to be marked for redaction. Step through all occurrences and decide for each case whether to redact immediately or mark for redaction. In the latter case, perform the redaction by choosing Close and Redact Document in the Mark Text dialog box or later click the Redact Document button.
example, male or female for a given language), a reading speed and the volume. You must ensure the language selection is appropriate for the text you want to hear. You also have the following keyboard controls: To do this: Use this: Pause/Resume Ctrl + Numpad 5 Set speed higher Ctrl + Numpad + Set speed lower Ctrl + Numpad – Restore speed Ctrl + Numpad * All speech systems will be installed with OmniPage if you choose a complete installation.
Line: The Line tool is mainly used in layout design: click it and draw lines to separate distinct sections in your form. Rectangle: Click this tool to create rectangles in your form for design purposes. Graphic: Use this tool to select areas of your form that are to be treated as graphics. Fill text: Click this tool to create fillable text fields. These are fields where you want people to enter text. Comb: Use this tool to create a text field consisting of boxes.
Editing Form object properties To edit a form object directly select it then right-click the given element to display its shortcut menu. You can edit the appearance or the properties of any form element here. Use the following commands: Form Object Appearance - use the tabs Borders, Shading and Shadow to design the look of your form elements in a similar way as you would do in a text-editing application.
Saving and Exporting Once you have acquired at least one image for a document, you can export the image to file. Once you have recognized at least one page, you can export recognition results. After further recognition you can save a single page, selected pages or the whole document by saving to file, copying to Clipboard or sending to a mailing application. Saving as an OmniPage Document is always possible. OmniPage provides comprehensive support for Office 2010 and 2013 applications and formats.
1. Choose Save to Files in the Export Results drop-down list. In the dialog box that appears, select Image under Save as. 2. Choose a folder location and a file type. Type in a file name. 3. Select to save the selected zone image(s) only, the current page image, selected page images or all images in the document. For multiple zones or multiple pages, you can have all images in a single multi-page image file, providing you set TIFF, MAX, DCX, JB2 or Image-only PDF or XPS as file type.
Selecting a formatting level The formatting level for export is defined at export time, in the saving dialog box (Save to Files, Copy to Clipboard, Send in Mail or other dialog box). Three of the levels correspond to the format views of the same name in the Text Editor. However, the level to be applied for saving is independent of the formatting view displayed in the Text Editor. When exporting to file or mail, first specify a file type. This determines which formatting levels are available.
worksheet with non-table parts placed in an index worksheet with hyperlinks to each relevant worksheet. Selecting converter options Click the Options... button in a saving dialog box to have precise control over the export. This brings up a dialog box with the name of the converter associated with the current file type. It presents a series of options tailored to this file type. First, confirm or change the formatting level, because this influences which other options are presented.
Saving different page ranges You cannot save different page ranges to different file types, because only one set of selected pages can exist at saving time. For the same reason, a single workflow cannot be used either. Perform two separate saves or use two workflows. Saving to PDF You have five choices when saving to Portable Document Format (PDF) files. The first four are presented as Text converters, the last one is listed among the Image converters.
Tagged PDF Create a tagged PDF file to preserve its structure. This will ensure logical reading order, correct table structure and more. PDF MRC Use this high compression technology for good quality and smaller file size; available for color and grayscale PDF Images or PDF Searchable Images. Linearized PDF Choose this to create PDF files optimized for fast loading and display when embedded in web pages.
Creating PDF files from other applications The Nuance PDF Create product supplied with OmniPage Ultimate provides the ability to create Normal PDF files from documents in any print-capable application on your system. Click File / Print and select the printer ScanSoft PDF Create! Adjust properties as desired and click OK and supply a file name and location. If View resulting PDF is selected, your default PDF viewer displays the result.
3. Type in a name for the new workflow. 4. Choose a document source: Scan, Load files or Load digital camera files. With file input, you will be prompted to choose input files when the workflow starts running. 5. Enter the e-mail address linked to your Kindle reader. 6. Provide a name for the output file. All recognition results enter a single file. 7. Choose Save to save the workflow for later use, or Save and Run to immediately run the workflow and transfer its results to your Kindle device.
• ePub simple: This removes most formatting, but allows text to flow, so it can be resized by the mobile device. Many smart devices analyze incoming text and apply their own formatting. • ePub for poems: This retains formatting but line breaks from the original are conserved. Two ePub sample workflows are shipped with OmniPage Ultimate: • ePub from PDF or Scanned Document: this retains formatting • ePub from PDF or Scanned poems: conserves line breaks The simplest way to prepare an ePub workflow: 1.
box. When you click OK you may be directed to log-in and invited to specify the required path. When using SharePoint, the server, login and password information must be provided only once per session, and it is offered in each subsequent session. If an ODMA-compliant Document Management System (DMS) is detected in your computing environment, it will be offered. If you have access to more than one DMS, the system default will apply.
Workflows A workflow contains a series of processing steps and their settings. It can be saved for repeated use whenever you have a task needing the same processing. Workflows usually begin with a scanning or loading step, but they can also start from the document currently open in OmniPage. After that, they do not have to conform to the traditional 1-2-3 processing pattern. Usually a workflow will include a recognition step, but this is not compulsory.
4. If run-time input selection is specified, the Load Files dialog box awaits your choice of 5. 6. 7. 8. files. If you requested a step requiring interaction (image enhancement, manual zoning, or proofing) the program presents pages for attention. When a page is enhanced, zoned or proofed, click the Page Ready button in the Toolbox or appropriate dialog box to move to the next page.
Workflow Assistant This allows you to create and modify workflows. The Job Wizard also uses this to create or modify workflows that jobs execute - see the next section. The Assistant offers one or more steps, each with a drop-down list. This left panel of the Workflow Assistant dialog box lets you build your workflow. This shows the steps you have chosen. This drop-down list shows the possible steps at any given workflow position. Use this to add a new step to your workflow.
The opening Assistant panel offers two starting points: Choose Fresh Start to begin with no steps in the workflow diagram on the right. Accept or change the default workflow name. Then click Next and choose your first step. Choose an image loading step that can take input from file, scanner or digital camera files. Specify settings on the right. Then move on to build your workflow: it can include a variety of different steps. When done, click Finish.
DocuDirect DocuDirect is a separate but integrated program to let you create jobs to be processed immediately, or at some time in the future. By choosing steps carefully, you can set up jobs that can run unattended. A job executes a workflow according to the job settings. Jobs are created in the Job Wizard.
Job types available in OmniPage Ultimate only: Barcode cover page job: This is a special type of folder watching job (see below). It monitors a folder for incoming barcode pages, then processes subsequently incoming images with the workflow identified by the barcode. For details, see Barcode processing later in this chapter. Folder watching job: Select this job type and browse to the folder(s) to be watched for incoming image files.
The General panel lets you limit the number of pages allowed in an output document, even if the file option Create one file for all pages is selected. When the limit is reached, a new file is started, distinguished by a numerical suffix. Click Finish to confirm job creation. Modifying jobs Jobs with an inactive status can be modified. Select the job in the left panel of DocuDirect and choose Modify from the Edit menu or click the Modify Job button. First, modify timing instructions as desired.
Activate Job in the File menu serves to activate any inactive job immediately. Deactivate Job in the File menu deactivates any active job. If the job is running, this will stop it before deactivating. Choose this to close a Watch type job immediately to save its result. Stop Job in the File menu stops a job with status Starting, Running, or Paused. Pause Job is available for jobs with status Running or Starting. To modify such a job’s timing instructions you must stop it.
Watched Folders In OmniPage Ultimate you can specify watched folders and e-mail inboxes (Outlook and Lotus Notes) as job input. These allow processing to be started automatically whenever image files are placed in pre-defined folders or arrive into inboxes as e-mail attachments. This is useful to have sets of files with predictable content arriving from remote locations processed automatically on arrival, even if no-one is in attendance.
Monday, Wednesday and Friday, or overnight in the last three days of each month, when you keep your computer running to collect and process monthly reports arriving from afar. When files enter a watched folder, the program waits for approximately the interval specified in DocuDirect Options for more files to arrive in order to process them together. When files cease to arrive, processing starts. To finish the watching early, choose Deactivate Job. Then you can modify the job freely.
2. Print a barcode page that identifies the workflow. 3. Start barcode processing from the scanner. To scan with a barcode page: 1. Place the barcode cover page on the top of the document in the ADF. 2. Press the Start button on the scanner. 3. Select “Barcode cover page workflow” as Scanner button default action on the Scanner tab of Options. You can also set it to Prompt for workflow.
File-it Assistant The File-it Assistant lets you create scanning workflows for repeated document conversion tasks. The Assistant is for scanning jobs that require no user interaction during the processing. In a typical scenario, operators at a scanning station prepare documents, applying the appropriate barcode cover page to each, without needing to know anything about the later processing or destination of the documents, because all that is pre-determined.
Convert to PDF Job This allows input from document files (typically MS office files plus txt, csv) provided their native applications are installed; output is one PDF file for each input file with the same name as the input file. The saving location can be specified. Typically, the resulting PDF files are both searchable and editable. Nuance PDF Create must be present for this job type. Make PDF Searchable This accepts input from image-only PDF files or PDF files which may contain image-only areas or pages.
Technical Information This chapter provides troubleshooting and other technical information about using OmniPage. Please also read the Readme file and other help topics, or visit the Nuance web pages. Troubleshooting Although OmniPage is designed to be easy to use, problems sometimes occur. Many of the error messages contain self-explanatory descriptions of what to do – check connections, close other applications to free up memory, and so on.
Testing OmniPage Restarting Windows in its safe mode allows you to test OmniPage on a simplified system. This is recommended when you cannot resolve crashing problems or if OmniPage has stopped running altogether. See Windows online Help for more information. To test OmniPage in safe mode: 1. Restart your computer in safe mode by pressing F8 immediately after you see the ‘Starting Windows’ message. 2. Launch OmniPage and try performing OCR on an image.
• • • • • • • Check the resolution of the original image. Hover the cursor over a page thumbnail for a popup display. If the resolution is significantly above or below 300 dpi, recognition is likely to suffer. Make sure the correct document languages are selected in the OCR panel of the Options dialog box. Only languages included in the document should be selected. In particular, setting an Asian language for non-Asian texts (and vice versa) is likely to produce unusable results.
Break complex page images (lots of text and graphics or elaborate formatting) into smaller jobs. Draw zones manually or modify automatically created zones and perform OCR on one page area at a time. See “Working with zones” on page 40. • Restart Windows in safe mode and test OmniPage by performing OCR on the included sample image files. If you are performing multiple tasks at once, such as recognizing and printing, OCR may take longer.
• • • • • • • • • • • • • • PDF with image substitutes (*.pdf) Text (*.txt) Text - Comma Separated (*.csv) Text - Formatted (*.txt) Text with line breaks (*.txt) Unicode Text (*.txt) Unicode Text - Comma Separated (*.csv) Unicode Text - Formatted (*.txt) Unicode Text with line breaks (*.txt) WordPad (*.rtf) WordPerfect 12, X3 (*.wpd) XML (*.xml) XPS (*.xps) XPS Searchable Image (*.
Index Click a page number to jump to the referenced item.
new workflows from existing ones 71 training data 50 workflows 71 Crookedly scanned pages 34 Crop (E) 33 Ctrl to avoid panel redocking 15 Custom Layout 30 Custom views 19 Customizing export converters 61 Duplex scanners 28 Dynamic verifier 45 E East Asian language support 7, 48 Easy Loader 15, 17, 26 Easy Loader in Quick View 19, 27 eDiscovery Assistant for searchable PDF 63 Editing character attributes 51 form objects 57 graphics 52 in True Page 52 on-the-fly 53 paragraph attributes 51 PDF output 62 reco
Extracting text from PDF files 64 Graphic zones 40 Graphics editing 52 in export 59 Grayscale images 59 scanning 28 Grouping elements 52 F Fast recognition and saving 18 Fax recognition 83 Features, new 2 File-it Assistant 79 Files as export target 58 as image source 25 retained on uninstall 13 separation options 59 types for export 60 Fill text tool (F) 56 Fill (E) 34 Financial dictionaries 48 Finding non-dictionary words 45 suspect words 45 Finishing proofing in a workflow 69 workflows 71 zoning in a wo
Links to web pages 52 Loading Image Enhancement templates 37 image files 25 images from Windows Explorer 27 images with Easy Loader 19, 26 training files 50 user dictionaries 47 zone templates 30, 42 Lotus Notes 72, 73, 77 Improving accuracy 27, 50, 82 Increasing memory 82 Input from digital camera 26 from image files 25 from PDF files 25 from scanners 27 via Easy Loader 26 Installing OmniPage 7 scanners 8 IntelliTrain 50, 83 Interactive job steps 73 Italic text 51 M Mail 64 Mailbox watching 77 Managing j
Numeric zones 39 P O Page Image panel 15 Page limit for jobs 74 Page Ready button 69 Pages deskewing 34 multi-page image files 59 navigation 15 sending as mail 64 sending to Clipboard 58 Panels 15 PaperPort 13, 21 Paragraph editing attributes 51 styles 51, 59 Passwords for PDF 63 Pausing workflows 69 PDF converting from/to 63 PDF Edited 62 PDF file input 25 PDF flavors 62 PDF linearized 63 PDF to MS Word 64 PDF-make fully searchable 63 Pending pages 53 Performance problems during OCR 83 Plain Text in Edi
Sample image files 82 Saturation / Hue (E) 33 Saving and launching 59 as OmniPage Document 58 documents 58 options 61 original images 59 PDF files 62 recognition results 59 text 59 to file 58 to mail 64 to multiple file types 61 training files 50 user dictionaries 47 zone templates 43 Saving and applying Image Enhancement templates 37 Scanners 83 drivers 9 duplex 28 setting up 8 Scanning 28 input from 28 pictures 28 to workflows 29, 79 Wizard 8 Scheduled processing 72 Searchable PDF 62, 63 Searching PDF out
Single-column pages with tables 30 Skipping interactive job steps 73 Slow recognition 83 Smart folders 76, 77 Solutions for poor performance 81 Specialized dictionaries 48 Speed zoning 41 Spreadsheet pages 30 Standard toolbar 15 Starting a user dictionary 47 Starting DocuDirect 72 Starting the program 8 Status of jobs 74, 75 Step-by-step processing 15 Steps for workflows 70 Stopping workflows 69 Storing zoning changes 53 Straightening pages 34 Strengthening letter outlines 34 Striking out text 53 Subtractiv
manual 38, 82, 84 modifying templates 42 numeric 39 process 40 prohibited shapes 41 properties 39 replacing templates 42 saving templates 42 table 40, 42 templates 30, 42, 82 types 39, 82 unloading templates 43 vertical Asian text 48 working with 40 Zoning in a workflow 69 Zoning on-the-fly 53 Zoom (E) 33 Zooming displays 15, 45 63 resetting 17 using Window menu 17 W Warning messages from jobs 75 Watched folders 76, 77 Watched mailboxes 77 Web access for activation 7 Web display with PDF files 63 Web page
T HIR D PA RTY L I CEN SE S / N OT I CES The word verification, spelling and hyphenation portions of this product are based in part on Proximity Linguistic Technology. The Proximity Hyphenation System © Copyright 1988. All Rights Reserved. Franklin Electronic Publishers, Inc. The Proximity/Merriam-Webster American English Linguibases. © Copyright 1982, 1983, 1987, 1988 Merriam-Webster Inc. © Copyright 1982, 1983, 1987, 1988 Franklin Electronic Publishers, Inc.