OmniPage Web ® User’s Manual
Caere Corporation 100 Cooper Court Los Gatos, California 95032-7603 USA www.caere.com Caere GmbH Innere Wiener Strasse 5 81667 München, Germany Caere UK Information Centre 3 Catherine Place Westminster, London SW1E 6DX Caere France 72, rue Baratte-Cholet 94100 Saint-Maur, France Please Note To use this program, you should know how to work in the Microsoft Windows environment.
Table of Contents Welcome Using This Manual ................................................................................................................2 Getting Online Help .............................................................................................................3 Help Menu ......................................................................................................................3 Context-Sensitive Help......................................................................
Creating Zones Automatically ...................................................................................27 Performing OCR on a Document .....................................................................................28 Proofreading OCR Results ................................................................................................29 Modifying Words.........................................................................................................30 Outlining a Document ............
Changing Zone Properties..........................................................................................69 Creating User Dictionaries.................................................................................................72 Chapter 6 Technical Information General Troubleshooting Solutions .................................................................................74 Solutions to Try First ...............................................................................................
vi
Welcome Welcome to OmniPage Web, and thank you for using our software! The following documentation has been provided to help you learn about OmniPage Web. This User’s Manual This manual introduces you to the basics of using OmniPage Web. It includes installation and setup instructions, an introduction to OmniPage Web, task-oriented instructions, ways to customize processing, settings guidelines, and technical information.
Using This Manual Using This Manual This manual is written with the assumption that you know how to work in the Microsoft Windows environment. Please refer to your Windows documentation if you have questions about how to use dialog boxes, menu commands, scroll bars, drag and drop functionality, shortcut menus, and so on. The following conventions are used in this manual.
Getting Online Help Getting Online Help In addition to using this manual, you can use OmniPage Web’s online Help to learn about features, settings, and procedures. Online Help is available after you install OmniPage Web. Choose How to Use Help... in OmniPage Web’s Help menu to get information on using Windows Help. Help Menu One way to open OmniPage Web’s online Help is to choose commands in the Help menu. OmniPage Web Help Topics is the first command in the Help menu.
Product Support Product Support For the fastest and easiest way to get help, please look for solutions in this manual or in the online Help. See “General Troubleshooting Solutions” on page 74 for more information. If you need additional help, please use the following resources. • Caere on the World Wide Web To access Caere’s corporate Web site for general product and company information, choose Caere on the Web !Caere Web site in the Help menu. Caere’s Web site address is www.caere.com.
Chapter 1 Installation and Setup This chapter provides information on installing and starting OmniPage Web.
Minimum System Requirements Minimum System Requirements You need the following setup, at minimum, to install and run OmniPage Web: • Computer with a Pentium or higher processor • Microsoft Windows 95, Windows 98, or Windows NT 4.0 • 16MB of memory (RAM) for Windows 95 and 98 32MB of memory (RAM) for Windows NT 4.
Setting Up Your Scanner with OmniPage Web To install OmniPage Web: 1 Insert OmniPage Web’s CD-ROM in the CD-ROM drive. OmniPage Web’s Setup program should start automatically. If it does not start, locate your CD-ROM drive in Windows Explorer and double-click the Setup.exe program at the toplevel of the CD-ROM. 2 Follow the instructions on the screen to finish installation.
Starting OmniPage Web Starting OmniPage Web To start OmniPage Web, click Start in the Windows taskbar and choose Programs!Caere Applications!OmniPage Web 1.0. Or, double-click the OmniPage Web icon on your Windows desktop. OmniPage Web’s desktop appears when you open OmniPage Web. See “The OmniPage Web Desktop” on page 14 for an introduction to OmniPage Web’s user interface. Standard toolbar Zone toolbar AutoWeb toolbar The thumbnail view displays the pages in an open document.
Registering OmniPage Web Registering OmniPage Web Register your copy of OmniPage Web with Caere Corporation to receive access to product support, notification of special offers, and the best prices on product upgrades. To register OmniPage Web: 1 Click the Register menu to open the Register dialog box. 2 Click Register Now. 3 Fill out the information requested on the screen and then click Next. 4 Follow the instructions on the screen.
10 Chapter 1
Chapter 2 Introduction to OmniPage Web You probably have documents lying on your desk that you would like to share with the rest of your company, or, perhaps, the rest of the world. You could photocopy the information and mail it to anyone who might be interested, or you could retype it and hand-code it in HTML format. Neither of these is an appealing option. OmniPage Web offers a smart solution to increase your productivity and the visibility of your documents.
What Is Optical Character Recognition (OCR)? What Is Optical Character Recognition (OCR)? Optical character recognition (OCR) is the process of turning an image into computer-editable text. An image is an electronic picture of text such as a scanned paper document or an electronic fax file. Images do not have editable text characters; they have many tiny dots (pixels) that together form a picture of text. During OCR, OmniPage Web analyzes an image and defines characters to produce editable text.
Basic Steps of Creating a Web Page Basic Steps of Creating a Web Page These are the basic steps of OmniPage Web’s HTML-conversion process. 1 Bring a document image into OmniPage Web. You can scan a paper document or load an image file. The resulting image appears in OmniPage Web’s image view. See “Bringing Document Images into OmniPage Web” on page 24 for more information. 2 Create zones to identify areas you want to recognize as text or retain as graphics.
The OmniPage Web Desktop The OmniPage Web Desktop Before a document is outlined, OmniPage Web’s desktop displays the pages of the open document in its thumbnail view, image view, and text view. You can use buttons in the Standard, AutoWeb, and Zone toolbars to perform various tasks on the document. Standard toolbar Zone toolbar The image view displays the current page’s original image. AutoWeb toolbar The thumbnail view displays a picture of each page in the document.
The OmniPage Web Desktop After a document is outlined, OmniPage Web’s desktop displays the document outline in outline view, the original image in image view, and a preview of the HTML document in HTML view. Outline toolbar The image view displays the current page’s original image. The outline view displays an outline of the original document objects. Drag this splitter up or down to resize a view. The HTML view displays a preview of the HTML-formatted document.
The OmniPage Web Desktop AutoWeb Toolbar The AutoWeb toolbar contains buttons that can activate each step of the HTML-conversion process. AUTO button Image button Zone button OCR button Outline button Export button Click the down arrow to display the commands in a button’s drop-down list. You can set different commands in the AutoWeb toolbar buttons for the operations you want to perform. Choose a command using each button’s drop-down list.
The OmniPage Web Desktop Standard Toolbar The Standard toolbar contains buttons and a drop-down list for performing standard tasks. New Save Open Proofread OCR Print Undo Copy Image Editor View HTML Straighten Option Image Options Zoom Rotate Image Help Zone Toolbar The Zone toolbar contains buttons that allow you to draw and define zones on a page image.
The OmniPage Web Desktop Outline Toolbar The Outline toolbar contains buttons that allow you to filter which objects are visible in the outline, making the outline easier to read and edit. You can also use the Outline toolbar to change the object hierarchy by promoting, demoting, changing, or deleting objects. Promote Demote Change Filter to Body to Objects...
The OmniPage Web Desktop HTML Options Dialog Box You can select settings for HTML components in the HTML Options dialog box. To open it, click the HTML Options button or choose HTML Options... in the Tools menu. Click the tabs in the HTML Options dialog box to view and select different settings. See Chapter 4, OmniPage Web Settings, for more information on settings.
20 Chapter 2
Chapter 3 Processing Documents This chapter describes how to work with documents in OmniPage Web, including each step of converting paper documents to HTML. There are different ways to accomplish the same tasks in OmniPage Web. You can use toolbar buttons or menu commands to start procedures. OmniPage Web can perform all steps automatically, or you can start each step individually. You can even do different tasks at the same time.
Ways to Process Documents Ways to Process Documents OmniPage Web instantly turns a paper document into an HTML file that you can publish as a Web page. The basic steps of OmniPage Web’s HTML-conversion process are explained on page 13. The following is a summary of those steps. 1 Bring a document image into OmniPage Web. See page 24 for more information. 2 Create zones to identify areas you want to recognize as text or retain as graphics. See page 27 for more information.
Ways to Process Documents The first wizard screen appears. 3 Answer the question in the first screen and click Next. 4 Continue answering questions in the screens that follow. Automatic Processing Use the AUTO button to process a new document from start to finish or to finish processing an open document. To process your document automatically: Processing Documents 1 Set AutoWeb as the command in the AUTO button’s drop-down list. 2 Set the desired Image, Zone, OCR, Outline, and Export commands.
Bringing Document Images into OmniPage Web Performing Multiple Tasks at Once OmniPage Web takes advantage of your computer’s ability to handle more than one process at a time. You can simultaneously scan, create zones, recognize, and edit documents before outlining. For example, if you scan a multiple-page document, you can draw zones on an image as soon as the first page is scanned and you can edit recognized text as soon as it appears in the text view.
Bringing Document Images into OmniPage Web 4 Click the Image button or choose Scan Image in the Process menu. Pages are scanned in order and combined into one working document. Loading Image Files You can load image files into OmniPage Web. An image file is an electronic picture of text, such as a scanned paper document or an electronic fax, that is saved in an image file format such as PCX or TIFF. If a document is already open, loaded image files are inserted as new pages.
Bringing Document Images into OmniPage Web You can Shift-click or Ctrl-click to select multiple files in the same folder. 5 Click Advanced if you want to select files from more than one folder. • Select a file and click Add to put it in the Selected Files list. • Click Add All to add all files from the current folder. 6 Click Open when you have selected all the files you want to load. Image files are loaded in the order selected and combined into one working document.
Creating Zones for OCR Creating Zones for OCR Page images are displayed in OmniPage Web’s image view where zones are created before OCR. Zones are borders that identify areas of an image that will be recognized as text or retained as graphics. Any part of an image not enclosed by a zone is ignored during OCR and outlining. This is a table zone. It will be kept in a row-andcolumn format during OCR. These are text zones. They will be converted to text during OCR. This is an unzoned area.
Performing OCR on a Document 2 Click the Zone button or choose Auto Zones in the Process menu. OmniPage Web automatically draws zones on the current page in the image view. Each zone has a number indicating its order and a picture indicating its zone type. Performing OCR on a Document Performing OCR converts an image to editable text. This is also referred to as recognizing text. OmniPage Web only recognizes machine-printed characters such as laser-printed or typewritten text.
Proofreading OCR Results Proofreading OCR Results After performing OCR, recognized text appears in the text view where you can proofread the results. Proofreading starts automatically if you chose OCR and Proof as the OCR process command. OmniPage Web marks suspected errors in green and inserts a red “reject” character for any character it cannot recognize. To turn off these color markers, choose Show Markers in the View menu so that it is deselected.
Outlining a Document Modifying Words After performing OCR, you can compare recognized text against the original image to verify that the text was recognized correctly. You can modify the recognized text in the Modify Word dialog box. To modify recognized words: 1 Double-click any word in the text view. The Modify Word window opens and shows a picture of the original word and its surrounding area. This window shows a picture of the original image. Click inside it to enlarge or reduce the picture.
Editing Outline Results 3 Highlight an object and click the appropriate button in the Outline toolbar to make changes to the outline. See the next section, “Editing Outline Results,” for more information. Editing Outline Results After outlining, the original document objects appear in outline format in the outline view where you can edit the results. Use the outline toolbar to change the object hierarchy. The document outline appears in this window.
Editing Outline Results To filter which objects appear in the outline: 1 Click the Filter Objects button in the Outline toolbar to open the Filter Objects dialog box. 2 Select which objects you want to see in the outline. By default, body text is deselected to make the outline easier to read. Select which objects you want to see in the outline and click this button. These are the objects that OmniPage Web looks for in your document during outlining.
Selecting HTML Components Selecting HTML Components You can make your Web site even more usable by adding HTML components. Components are parts of an HTML document that make it interesting and functional, such as a hyperlinked table of contents, copyright notice, or navigation panel. To select and format HTML Components: 1 Click the HTML Options button in the Standard toolbar or choose HTML Options... in the Tools menu to open the HTML Options dialog box.
Working with Documents Working with Documents OmniPage Web’s thumbnail, image, text, outline, and HTML views allow you to look at and work with pages in the current document. Once pages are recognized, the image, text, and thumbnail views are visible. Image view Thumbnail view Drag this splitter up or down to resize a view. Text view Drag this splitter to the left or right to resize a view.
Working with Documents Once recognition is complete, OmniPage Web analyzes the structure of your document and creates an outline. After outlining, the thumbnail view is hidden behind the outline view, and the text view is replaced by the HMTL view. Image view Outline view Drag this splitter up or down to resize a view. HTML view Drag this splitter to the right to make the thumbnail view visible after outlining. Drag this splitter to the left or right to resize a view.
Working with Documents You can also click your right mouse button in the view you want to resize and select a size option in the shortcut menu. (If you are resizing the image view, click outside of a zone.) Changing Pages Before outlining, the thumbnail view, image view, and text view all display the same page of a document. After outlining, the image and HTML views display the same section of the document that is currently selected in the outline view.
Working with Documents • Click the section of the outline that you want to display. The selected section of the outline is displayed in bold. • Click the Next Page or Previous Page buttons at the lower-right corner of the OmniPage Web desktop. • Choose Next Page, Previous Page, or Go to Page... in the Edit menu. Reordering Pages You can reorder pages in a document by dragging their thumbnails to different positions in the thumbnail view.
Working with Documents Deleting Pages If you delete a page from a document in OmniPage Web, the thumbnail, original image, and recognized text for that page are all deleted. To permanently delete pages: • Choose Delete Current Page in the Edit menu to delete the currently displayed page. • Select one or more thumbnails of pages you want to delete and press the Delete key. • Right-click on a thumbnail and select Clear.
Saving a Document Saving a Document To save your document: Processing Documents 1 Choose Save As... in the File menu. You can also click the Export button with Save As... selected in the drop-down list. The Save As dialog box appears. 2 Select a folder location and file type for your document. To use your document on the World Wide Web, save it as an HTML file type. Be sure to view your document on as many browsers as possible to be sure the formatting is supported.
Saving a Document To save original images: 1 Choose Save Image... in the File menu. The Save Image dialog box appears. 2 Select a folder location and file type for your document. See “Supported File-Format Types” on page 77 for a complete list of supported file types. 3 Type in a file name and select Save and Image options. 4 Click OK. The image is saved to disk as specified. (Zones and recognized text are not saved with the file.
Testing Your HTML Document Testing Your HTML Document It is important to test your HTML document once you have finished processing and formatting. To test your HTML document: 1 Set Save and Launch as the command in the Export button’s drop-down list. 2 Click the Export button to save your document as an HTML file and launch your Web browser. • Check the image download speed.
Testing Your HTML Document 42 Chapter 3
Chapter 4 OmniPage Web Settings This chapter describes the settings in the AutoWeb toolbar, the Options dialog box, and the HTML Options dialog box. Please also look in OmniPage Web’s online Help for more detailed information on settings. The settings you select for processing documents can greatly affect HTML results. You may have to experiment with different settings to get the results you want.
Setting AutoWeb Toolbar Commands Setting AutoWeb Toolbar Commands The AutoWeb toolbar buttons allow you to take a document through each step of the process. Every toolbar button has different process commands that can be set for the operations you want to perform. OmniPage Web can go through all steps automatically, or you can start each step individually.
Setting AutoWeb Toolbar Commands AUTO Button Commands Use the AUTO button to process a new document from start to finish or to finish processing an open document. The AUTO button’s drop-down list contains the AutoWeb and Web Wizard commands. AutoWeb Select AutoWeb to finish processing a new or open document according to the selected process commands. See “Automatic Processing” on page 23 for more information.
Setting AutoWeb Toolbar Commands Image Button Commands Use the Image button to bring a document image into OmniPage Web’s image view. The Image button’s drop-down list contains the Load Image and Scan Image commands. Load Image Select Load Image to load existing image files such as TIFF, DCX, BMP, JPG, or PCX files. Scan Image Select Scan Image to scan paper documents in your scanner.
Setting AutoWeb Toolbar Commands Zone Button Commands Use the Zone button to automatically create zones on document images. Zones are bordered areas that specify what will be recognized as text or retained as graphics on an image. The Zone button’s drop-down list contains the Single-Column Pages, Multiple-Column Pages, Spreadsheet Pages, and Mixed Pages commands and the names of any zone templates you have created. See “Creating Zones for OCR” on page 27 for more information.
Setting AutoWeb Toolbar Commands OCR Button Commands Use the OCR button to perform the selected OCR operation on document images. The OCR button’s drop-down list contains the Perform OCR and OCR and Proof commands. Perform OCR Select Perform OCR to recognize text on document images. During OCR, OmniPage Web analyzes the image and identifies characters to produce editable text. See “Performing OCR on a Document” on page 28 for more information.
Setting AutoWeb Toolbar Commands Outline Button Commands Use the Outline button to perform the selected Outline operation on document images. The Outline button’s drop-down list contains the Outline and Defer Outlining commands. Outline Select Outline to outline the recognized document structure. During outlining, OmniPage Web detects original objects such as headings, body text, headers and footers, and links cross-references, e-mail addresses, and URLs to their destinations.
Setting AutoWeb Toolbar Commands Export Button Commands Use the Export button to save recognized text and retained graphics and to launch your Web browser to view your HTML document. The Export button’s drop-down list contains the Save As, Save and Launch, and Defer Export commands. Save As Select Save As to save a recognized document to disk as an OmniPage Web document (*.wmt) or an HTML file.
Selecting Options Selecting Options Click the Options button or choose Options... in the Tools menu to open the Options dialog box. Click each tab to view and select different settings. Click for a description of each setting. Default settings are shown in most examples that follow. However, documents require different settings depending on their input attributes and your output goals. To get the best results, learn how to identify document characteristics and make selections for them.
Selecting Options Accuracy Settings Click the Accuracy tab to select settings that affect OCR accuracy. The Language Analyst evaluates and replaces unknown words with words most likely to be correct during OCR. Select the type of characters that are in your document. Usually, these settings should be selected for optimal accuracy.
Selecting Options Scanner Settings Click the Scanner tab to select settings for scanning pages. The Scanner tab appears only if you have installed Scan Manager, and depending on your particular scanner, you might need to have your scanner connected and turned on for the Scanner tab to appear. This is recommended for black and white pages. This is recommended for pages with colored backgrounds, colored text, or pages containing grayscale grphics.
Selecting Options Page Format Settings Click the Page Format tab to select settings that determine how the formatting of a page is handled during OCR and outlining. Select a setting that best describes how your original page looks. The resolution is the number of dots, or pixels, that make up an image. A higher resolution will produce a better quality image. The resolution cannot be changed after an image has been loaded into OmniPage Web.
Selecting Options Language Settings Click the Language tab to select language settings for your document. Select the language that appears most in your document. Select additional languages for a multilanguage document. You must have installed those languages during installation. This is the character used in place of unknown characters. You can enter your own choice. OmniPage Web is intended for English-only documents.
Selecting Options Process Settings Click the Process tab to set commands and settings for each step of OCR. The Web Wizard will guide you through the HTML-conversion process when you click the AUTO button on the AutoWeb toolbar. Specifies where newly loaded or scanned images are to be added to an open document. 56 These specify the processing steps that you want. Click this to change the browser or editor that automatically launches when you select Save and Launch.
Selecting HTML Options Selecting HTML Options Click the HTML Options button or choose HTML Options... in the Tools menu to open the HTML Options dialog box. This is the central location for HTML settings. Click each tab to view and select different settings. Click for a description of each setting.
Selecting HTML Options General Settings Click the General tab to set commands and settings for your HTML document. Select this if you do not want your HTML document formatted. Specifies what you want OmniPage Web to use as the title of your HTML document. Specifies where you want your HTML document divided into separate pages. 58 Select this to have OmniPage Web create a link to the original page image in your HTML document. Select whether or not you want to use graphic navigation controls.
Selecting HTML Options Components Settings Click the Components tab to select which components you want included in your HTML document, and where you want the components to appear on the final Web page. Select the order in which you want the components to appear on your Web page by clicking the up and down arrow buttons. OmniPage Web Settings Select options for each component.
Selecting HTML Options Component Styles Settings Click the Component Styles tab to select formatting options for each component in your HTML document. Select this for more formatting options if you know your visitors have browsers that support cascading style sheets. Select the components you want to edit. 60 Available formatting options change for each component.
Chapter 5 Customizing Your Web Page OmniPage Web has many features that allow you to create customized Web pages. This chapter describes how to use these features. Please continue reading this chapter for information on these topics: • Making Your Web Page More Effective • Using Themes • Making Your Web Page More Effective • Customizing Zones • Creating User Dictionaries • OmniPage Web’s user dictionaries are saved in the data folder in your installation folder.
Making Your Web Page More Effective Making Your Web Page More Effective Organizing electronic documents is a challenge, but if done well, can allow your Web page visitors to quickly navigate through large amounts of information and cross-reference other topics without having to dig through unnecessary text. Here are some suggestions for making your Web page easy and enjoyable to visit. Organize your Web page to make it quicker and easier for visitors to skim the information.
Making Your Web Page More Effective If your image is dark, make sure you change the text colors to light shades so that they show up, and that you make the document background color dark. Otherwise, if the image fails to load (or takes a long time to load), the text will be unreadable. Include pictures to illustrate your text. • Add images. • Add image maps. • Add links to original images.
Using Themes Using Themes A theme has Web page design attributes including font, page layout, border style, and background. Themes allow you to instantly format your HTML document, and are useful to create consistantly-formatted Web pages. OmniPage Web provides a selection of fun and professional themes for you to use, or you can create and save one of your own. To select a theme: 1 Click the HTML Options button in the Standard toolbar, or choose HTML Options... in the Tools menu. 2 Click Load Themes...
Using Themes To save a new theme: Customizing Your Web Page 1 Open the HTML Options dialog box and select one of the provided themes, or begin selecting your own settings. 2 Click Save Themes... to open the Save Themes dialog box. 3 Type in a file name for the new theme. All the current settings in the HTML Options dialog box are saved as a theme file with an .hfo extension. 4 Click OK.
Adjusting Page Images Before OCR Adjusting Page Images Before OCR You can rotate and straighten page images in OmniPage Web’s image viewer before zoning and OCR take place. This is recommended to improve OCR accuracy on pages that are not oriented correctly. If you need to rotate or straighten a page, be sure to do so before you create zones because all zones are deleted during these operations. To rotate a page image: 1 Click on the page image to make the image viewer active.
Customizing Zones Customizing Zones Zones are borders created around areas of a page image to identify what will be recognized as text or retained as a graphic during the HTMLconversion process. Zones play a big part in determining outline results. You can create zones automatically, manually, or with a template. See the online Help for more information.
Customizing Zones Reordering Zones The numbered order of zones determines the order in which text will be placed on a recognized page, objects will be placed in the outline, and components will appear in the HTML document. Make sure the zone order is acceptable before performing OCR and outlining your document. To reorder zones: 1 Click the Reorder Zones button. The numbers in the zones disappear. 2 Click within the zone you want recognized first. The number 1 appears in the zone.
Customizing Zones 4 Hold down the mouse button and drag the handle in the direction that you want to enlarge or reduce the zone. 5 Release the mouse button when you are done. The zone border changes to display the modified zone area. Deleting Zones You can delete the current zones if you want to create new zones. You can also delete individual zones that you do not want to process during OCR. Any part of a page image not enclosed by a zone is ignored during OCR.
Customizing Zones Zone Type Every zone on a page has a zone-type setting.
Customizing Zones 2 Click the Zone Properties button to open the Zone Properties dialog box. Close button The settings in this dialog box will be blank if multiple zones with different settings are selected. 3 Select a zone type for the selected zones. If you change an irregular-shaped zone to a Table type zone, OmniPage Web substitutes the largest rectangle that fully encloses the irregular area. 4 Select a zone content for the selected zones.
Creating User Dictionaries Creating User Dictionaries Two dictionaries are used when you perform OCR and check for errors: the dictionary for the language you are using, and a user dictionary where you can add special words manually. You can create multiple user dictionaries, but you can only use one at a time. You can select a user dictionary in the Language tab of the Options dialog box. To customize a user dictionary: 1 Choose Edit User Dictionary... in the Tools menu.
Chapter 6 Technical Information This chapter provides troubleshooting and other technical information about using OmniPage Web. Please also read the online Readme file and the Scanner Setup Notes. The Scanner Setup Notes list all supported scanners and any connection or software-driver issues. The Readme file contains last-minute information relating to OmniPage Web.
General Troubleshooting Solutions General Troubleshooting Solutions Although OmniPage Web is designed to be easy to use, problems sometimes occur. Many of the onscreen error messages contain selfexplanatory descriptions of what to do — check connections, close other applications to free up memory, and so on. Sometimes that is all the troubleshooting help you need. Please see your Windows documentation for information on optimizing your system and application performance.
General Troubleshooting Solutions Testing OmniPage Web Restarting Windows 95 or 98 in safe mode or Windows NT in VGA mode allows you to test OmniPage Web on a simplified system. This is recommended when you cannot resolve crashing problems or if OmniPage Web has stopped running altogether. See Windows online help for more information. Your scanner will not run with OmniPage Web in safe mode or VGA mode, so do not test scanner problems in this configuration.
General Troubleshooting Solutions Low Memory Problems OmniPage Web may run poorly under low-memory conditions. This may be indicated by various error messages or if OmniPage Web works slowly and accesses the hard drive often. Try these solutions for low memory conditions: • Restart your computer. • Close other open applications to release memory. • Close unnecessary OmniPage Web windows. • Defragment your hard disk to free up contiguous blocks of disk space. See Windows online help for instructions.
Supported File-Format Types Supported File-Format Types OmniPage Web can open these file-format types: BMP, Bitmap (*.bmp) OmniPage Web Document (*.wmt) DCX (*.dcx) PCX (*.pcx) GIF (*.gif) TIFF uncompressed (*.tif)† JPEG (*.jpg) TIFF Group 3 or 4, compressed (*.tif)† TIFF Packbits (*.tif) †TIFF files can be single- or multiple-page; line art, grayscale, or color. They can be up to 600 dpi, but 300 dpi is recommended for optimal OCR accuracy. Image files can be loaded at bit depths of 1, 8, or 24.
Scanner Setup Issues OmniPage Web can save recognized text to these file formats: HTML (*.htm)† OmniPage Web document(*.wmt) †When OmniPage Web saves a document in HTML format, additional files are created. These files may include graphics files, image map files, or cascading style sheet files (*.css). Scanner Setup Issues This section contains information on setting up your scanner and solutions for scanning problems you may encounter.
Scanner Setup Issues Scanner Drivers Supplied by Caere OmniPage Web is shipped with special scanner drivers that allow it to communicate with supported scanners. These scanner driver files are installed on your computer when you install Caere Scan Manager. These drivers often work in conjunction with the drivers from your scanner manufacturer. To use your scanner with OmniPage Web, you must select the appropriate scanner in Caere Scan Manager.
Scanner Setup Issues Problems Connecting OmniPage Web to Your Scanner Try these solutions if you experience a problem between OmniPage Web and your scanner or if you receive a scanner error message when you launch OmniPage Web. • Make sure the scanner is supported by OmniPage Web with your version of Windows 95 or 98, or Windows NT. A list of tested scanners is provided in the Scanner Setup Notes.
Scanner Setup Issues Scanner Message on Launch The first time you launch OmniPage Web after installing or changing your current scanner in the Caere Scan Manager, you may get this message: This scanner’s configuration is set using the system-level driver. If it asks for no more information, click OK in the dialog box. You may also have the option to select the following: • SCSI ID or scanner configuration information Consult your scanner documentation for the correct information.
Scanner Setup Issues Scanning Tips OCR results will be poor if an image is not scanned properly. Remember the following tips when you scan: • Take the color and quality of your document into account when scanning. High-quality documents return better recognition results than low-quality documents. Shaded, colored, or low-quality documents may result in poor recognition accuracy unless adjustments are made before scanning. • Always try to scan an original document instead of a photocopy.
OCR Problems OCR Problems This section contains information and solutions for possible OCR problems. Topics in this section include: • System Crash During OCR • Text Does Not Get Recognized Properly • Problems With Fax Recognition System Crash During OCR Try these solutions if a crash occurs during OCR or if processing takes a very long time: • Resolve low memory problems. See “Low Memory Problems” on page 76 for more information. • Resolve low disk space problems.
OCR Problems document again. See “Changing Zone Properties” on page 69 for more information. • Adjust the Brightness slider in the Scanner settings of the Options dialog box. Lighten the setting for thick, run-together text characters or dark backgrounds. Darken the setting for thin, broken text characters. • Make sure the correct main and secondary document languages are selected in the Language settings.
Index A Accuracy settings 52 Acquiring images 24 Add to Zones button 67 Adding components to your HTML document 33 pages to a document by loading image files 25 pages to a document by scanning 24 words to your user dictionary 72 ADF 24 Adjusting page images before OCR 66 view of pages 35 AUTO button automatic processing 23 described 45 using the Web Wizard 22 Auto Zones command 47 Auto zoning procedure 27 Automatic processing 23 AutoWeb toolbar described 16 Export button 50 Image button 46 location of 8, 1
saving original images 40 saving recognized text 39 F Fax files 26 Faxes improving recognition accuracy of 84 Filter objects 18, 31 Finishing the current document 23 G Getting images 24 Getting online Help 3 Going to a particular page 36 Graphic editor see OmniPage Web’s online help Graphics zone type for 70 Green text 29 H Handwritten text 28 Hard disk space minimum required 6 Help, online 3 Home page for Caere 4 HTML components 13 converting to 13, 21 editor 6 formatting components 33 selecting compone
document structure 13 editing 31 Outline button 49 Outline commands Defer Outlining 49 Outline toolbar buttons in 18 Outlining 13, 30 automatic processing 23 P Page Format settings 54 Pages changing 36 deleting 38 loading images files 25 reordering 37 resizing view of 35 scanning 24 PaperPort, and missing Scan Image command 80 Performing OCR 28 Printing text and images 38 Process commands AUTO 45 Export 50 OCR 48 Outline 49 setting 44 Zone 47 Process settings, Options dialog box 56, 58, 59, 60 Processing d
Standard 17 Zone 17, 67 Troubleshooting 74 to 84 general solutions 74 low disk space problems 76 low memory problems 76 OCR problems 83 product support services 4 scanning problems 78 text does not get recognized 83 U Undoing changes 38 User dictionary creating or editing 72 for Microsoft Word 72 Using online help 3 V Viewing and resizing pages 35 Viewing original images 30 Visioneer scanners, and missing Scan Image command 80 Z Zone borders see Zones 68 Zone button 47 Zone properties changing 70 describ