ReadirisTM Pro 12 User Guide
ReadirisTM Pro 12 – User Guide Table of Contents Copyrights ........................................................................................... 1 Chapter 1 Introducing Readiris ................................................ 3 Save time, avoid retyping.................................................. 3 The Readiris series ............................................................ 6 Chapter 2 Installing Readiris .................................................... 9 System requirements ........
Table of Contents Scanning paper documents.............................................. 26 Chapter 6 Adjusting scanned documents ............................... 33 Chapter 7 Saving documents as image files ........................... 39 Chapter 8 Windowing documents........................................... 41 Windowing documents automatically ............................. 41 Windowing documents manually .................................... 43 Using windowing templates ...................................
ReadirisTM Pro 12 – User Guide Creating XPS documents ................................................ 75 Selecting the XPS options ............................................... 77 iHQC compressing XPS documents ............................... 78 Selecting the graphics options......................................... 79 Chapter 11 Saving and loading settings ................................. 81 Chapter 12 Recognizing multipage documents ......................
ReadirisTM Pro 12 – User Guide Copyrights ReadirisPro12-dgi-110209-04 Copyrights © 1987-2009 I.R.I.S. All Rights Reserved. I.R.I.S. owns the copyrights to the Readiris software, to the online help system and to this publication. The information contained in this document is the property of I.R.I.S. Its content is subject to change without notice and does not represent a commitment on the part of I.R.I.S.
ReadirisTM Pro 12 – User Guide CHAPTER 1 INTRODUCING READIRIS SAVE TIME, AVOID RETYPING Congratulations on acquiring Readiris. This software package will undoubtedly be of great help in recapturing your texts, tables, graphics, barcodes and handprinted texts. As efficient as computers are, you have to key in your information first. If you have ever retyped a 15 page report or a large table of figures, you know how tedious and time-consuming it can be.
Chapter 1 – Introducing Readiris spreadsheet, archive them as PDF or XPS files, etc. To recognize faxes and convert PDF documents, drag their image files from Windows Explorer to the Readiris application window. Or send an image promptly to Readiris via the context menu. Readiris recognizes tabular data and recreates them as worksheets in your spreadsheet software or as table objects inside your word processor; your numeric data are immediately ready for further processing.
ReadirisTM Pro 12 – User Guide solutions you confirm are memorized, increasing the system speed and confidence and rendering the system more intelligent as you go along. This powerful learning tool also allows you to train Readiris on special characters such as mathematical symbols and dingbats and to handle distorted fonts. To increase your productivity further, Readiris not only recognizes your texts, but can format them for you as well. Various levels of formatting are available.
Chapter 1 – Introducing Readiris THE READIRIS SERIES The table below gives an overview of the available versions: Readiris Home 12 Limited features 25 recognition languages Supports PDF, DCX, DJV, DJVU, JPG, JPEG, J2C, J2K, JP2, PNG, TIF, TIFF, BMP, PCX Generates PDF Image-Text, DOCX, ODT, WordML, SpreadsheetML, RTF, HTM, XML, TXT, TIFF, etc.
ReadirisTM Pro 12 – User Guide Readiris Pro 12 Asian Readiris Corporate 12 Asian Basic features Basic features 128 recognition languages 128 recognition languages Supports PDF, DCX, DJV, DJVU, JPG, Supports PDF, DCX, DJV, DJVU, JPG, JPEG, J2C, J2K, JP2, PNG, TIF, TIFF, JPEG, J2C, J2K, JP2, PNG, TIF, TIFF, BMP, PCX. BMP, PCX.
Chapter 1 – Introducing Readiris BMP, PCX. BMP, PCX. Generates four types of PDF files, PDF- Generates four types of PDF files, PDF- iHQC (level I), four types of XPS, XPS- iHQC (level I-III), PDF/A, four types of iHQC (level I), DOCX, ODT, XLS, XPS, XPS-iHQC (level I), DOCX, ODT, WordML, SpreadsheetML, RTF, HTM, XLS, WordML, SpreadsheetML, RTF, XML, TXT, TIFF, etc. HTM, XML, TXT, TIFF, etc.
ReadirisTM Pro 12 – User Guide CHAPTER 2 INSTALLING READIRIS SYSTEM REQUIREMENTS This is the minimal system configuration required to use Readiris: a 486-based Intel PC or compatible. A Pentium-based PC is recommended. 256 MB RAM. 120 MB free disk space. (105 MB of disk space suffices when you do not install the sample files) the Windows Vista, Windows XP or Windows 2000 operating system. Note that some scanner drivers may not work under the latest version(s) of Windows.
Chapter 2 – Installing Readiris SOFTWARE INSTALLATION To install the software: Log on to Windows as administrator or make sure you have the necessary administration rights. Connect your scanner to your PC and install the corresponding software. Test your scanner. If you experience any problem contact your scanner manufacturer. Insert the Readiris CD-ROM in the CD-ROM drive and follow the on-screen instructions to install the software.
ReadirisTM 12 Pro – User Guide Repeat the installation process to install any additional software from the CD-ROM. UNINSTALLING THE SOFTWARE There is only one correct way to uninstall Readiris: by using the Windows (un)install wizard. You are strongly recommended not to uninstall Readiris or any of its software modules by manually erasing the program files. To uninstall Readiris: Close the application. On the Start menu, click Control Panel. Under the Programs icon, click Uninstall a program.
Chapter 2 – Installing Readiris be entitled to special offers on I.R.I.S. products. To register: Use the Registration wizard on the Register menu. Follow the onscreen instructions. PRODUCT SUPPORT Once you have registered your product, you are entitled to product support from I.R.I.S. on all basic software functionalities. Contact I.R.I.S. at: Europe: support@irislink.com Tel:+32 10 45 13 64 USA: support@irisusa.com Tel.:+1 800 447 4744 Asia-Pacific: support@irislink.com Tel.
ReadirisTM Pro 12 – User Guide CHAPTER 3 GETTING STARTED RUNNING READIRIS To run Readiris: Start Readiris from the Windows Start menu or double-click the shortcut on your desktop. Click anywhere in the startup screen to launch Readiris. The OCR Wizard automatically opens. USING THE OCR WIZARD The OCR Wizard allows you to define all the settings needed to operate Readiris efficiently. When you start Readiris, click anywhere in the startup screen to start the OCR Wizard.
Chapter 3 – Getting started Step 1 Select the image source. You can capture images using your scanner or open image files. Select the rotation and deskewing options you want to use. For more information, see the section Selecting the options. To familiarize yourself with Readiris, use the sample images provided with the software. They can be found on the Readiris CD-ROM and in the subfolder Samples of the Readiris installation folder. Click Next to go to the next step.
ReadirisTM Pro 12 – User Guide Select the required output format or application in the Send to or External file list. Click the various tabs and select the options of your choice. Options that are unavailable for the chosen format/application appear dimmed. For more information, see the chapter Formatting and saving documents. Click OK to save the settings. Click Next to go to the next step. Step 5 Click GO to open/scan and recognize the document.
Chapter 3 – Getting started The Readiris interface is composed of: the SmartTasks (in the middle) The SmartTasks are predefined commands that allow you to use the most frequent Readiris functions at the touch of a button. Click the SmartTask you want to use to scan, recognize and send your documents to the target application or output format of your choice. The SmartTasks apply default settings but can be configured easily by right-clicking to fit more particular needs.
ReadirisTM Pro 12 – User Guide The document panel displays statistical information about the documents that are open in Readiris, such as the scan and OCR time, the resolution, width and height of the documents etc. CHANGING THE USER INTERFACE LANGUAGE The user interface of Readiris is available in a wide range of languages. To change the user interface language: On the Settings menu, click User Interface Language. In the Language list, select the required language, then click OK to confirm.
ReadirisTM Pro 12 – User Guide CHAPTER 4 THE READIRIS SMARTTASKS USING THE READIRIS SMARTTASKS When starting Readiris, click anywhere in the Readiris startup screen and click Cancel when the OCR Wizard launches. The Readiris SmartTasks will be displayed. The SmartTasks are predefined commands that allow you to use the most frequent Readiris functions at the touch of a button. Simply click the SmartTasks to scan documents or image files to the target applications and output formats of your choice.
Chapter 4 – The Readiris SmartTasks The various SmartTask buttons allow you to: 1. Scan and recognize documents and send them directly to Word for text processing; Microsoft Word is the default target application. See the section Formatting text documents to learn more about the other available applications. 2. Scan and recognize documents and send them directly to OpenOffice for text processing; OpenOffice.org Writer is the default target application.
ReadirisTM Pro 12 – User Guide The documents will be sent as PDF Image-Text by default via your default e-mail application. See the section Formatting documents to learn more about the other available formats. Note that the SmartTasks apply predefined settings but can be configured easily to fit more particular needs. To configure the SmartTasks: Right-click the SmartTask you want to use. Select Scanner or Image files as image source.
Chapter 4 – The Readiris SmartTasks o When you select Image files and click the SmartTask, Readiris opens the Input dialog box in which you can select the image files you want to process. For more information on opening image files, see the section Opening image files. Click Configure to change the output format and its options. Note that the available output formats and options depend on the selected SmartTask.
ReadirisTM Pro 12 – User Guide CHAPTER 5 SCANNING DOCUMENTS SELECTING THE OPTIONS Before scanning paper documents or opening image files, you can select several image enhancement options. When enabled, these options will be applied during the opening and scanning of documents. Operation Click the Options button on the main toolbar to select several image enhancement options. o Click Page Deskewing to straighten pages scanned at an angle.
Chapter 5 – Scanning documents This way, scanned or opened images will be split up in windows automatically. You can also use the windowing tools on the image toolbar to modify the page analysis results or to window documents manually. For more information, see the chapter Windowing documents. When you are done defining all the settings (Scanner settings, Options), click the Scan or Open button to scan documents or open image files.
ReadirisTM Pro 12 – User Guide Tip: you can also drag image files to the Readiris image window to open them. Tip: Right-click any image file you want to open, point to Open With and click IOCR application. The Readiris software will open and display the image. Tip: when loading multipage image files (TIFF images and DCX faxes) and PDF documents, you can define the page range (in case you only need a certain chapter of a document for instance). To do so, click Open on the main toolbar.
Chapter 5 – Scanning documents Select the image file of your choice and click Open. Note: the options of the Input dialog box also apply to document scanning and are discussed in the Scanning paper documents section. Note that you can specify other settings before opening (or scanning) documents. For more information, see the sections below.
ReadirisTM Pro 12 – User Guide When you process paper documents, Readiris will start your scanner as soon as you click the Scan button and display the scanned document in the interface. To scan documents: Click the Scanner button to set the scanner settings. Note that several of the options in the Scanner dialog box are also available in the Open dialog box. Select the correct scanner model. If your scanner is not in the list, select Twain other models and click OK.
Chapter 5 – Scanning documents Format and Resolution Readiris supports a wide range of paper formats and resolutions. Note that it is recommended to use a scan resolution of 300 dpi. Use a resolution of 400 dpi when recognizing business cards, Asian text or very small print. Color mode Readiris can scan documents and open image files in color, black-and-white and grayscale.
ReadirisTM Pro 12 – User Guide Note that this option never increases the resolution of images scanned with too little detail. Scanning multipage documents When scanning multipage documents and using a scanner equipped with a document feeder, select the ADF (automatic document feeder ) option. Place the pages you want to scan in the feeder and start scanning.
Chapter 5 – Scanning documents Using a digital camera Select Digital camera when you are using a camera as scan source. Readiris uses special recognition routines to process digital camera images. Tips for using a digital camera as scan source: Calibrate the camera by photographing a white document. Always select the highest image resolution. Enable the macro mode of the camera to take close-ups. Only use optical zoom, not digital zoom. Hold the camera directly above the document.
ReadirisTM Pro 12 – User Guide When you are done defining all the settings (Scanner settings, Options), click Scan to scan documents. Note: pay attention to line skew. Line skew over 0.5° increases the risk of OCR errors.
ReadirisTM Pro 12 – User Guide CHAPTER 6 ADJUSTING SCANNED DOCUMENTS When opening or scanning extremely light or extremely dark grayscale and color images, it may be necessary to adjust those images before executing the recognition, in order to obtain satisfactory OCR results. To adjust images: Open or scan a color-grayscale document. Make sure that the scanner settings are correct.
Chapter 6 – Adjusting scanned documents o Select Smoothen color image to even out the image. This option renders grayscale and color images more homogeneous by smoothening out differences in intensity. As a result, a stronger contrast is created between the foreground (text) and background (artwork). Note: sometimes smoothening is the only way to separate text from a colored background.
ReadirisTM Pro 12 – User Guide Example 1: lighten a dark image to eliminate the page background. (Color image) (Binarized image. The default binarization settings yield a black image) (The lightened image yields satisfactory recognition results) Example 2: darken an image when the text is so light it doesn't show up in the binarized image.
Chapter 6 – Adjusting scanned documents (Binarized image. The default brightness settings yield fragmented characters) (The darkened image yields satisfactory recognition results) o Use the slider to increase or decrease the Contrast. The Contrast settings determine the contrast between darker and lighter zones of an image. Use these settings to make character shapes stand out against a colored background.
ReadirisTM Pro 12 – User Guide Despeckling removes small spots from black-and-white images. Click Apply to preview the results. If the results are satisfactory, click OK. If not, change the settings again. Click Recognize + Save to recognize the document.
ReadirisTM Pro 12 – User Guide CHAPTER 7 SAVING DOCUMENTS AS IMAGE FILES Paper documents you scan do not need to be OCRed right away. They can be saved as image files. To do so: Scan the document. On the File menu, click the commands Save Full Page as Image or Save All Pages as Image. Afterwards, open the saved image file and perform the recognition. Saving graphics only You can also choose to save the graphics windows without the text of the document. To do so: Scan or open the document.
ReadirisTM Pro 12 – User Guide CHAPTER 8 WINDOWING DOCUMENTS WINDOWING DOCUMENTS AUTOMATICALLY When scanning or opening documents, Readiris will automatically apply Page Analysis to split up the documents in different windows. The Page Analysis option is selected by default. Click the Options button and disable Page Analysis should you want to avoid automatic page analysis. The page analysis results can be modified manually after automatic page analysis.
Chapter 8 – Windowing documents Page analysis detects text, graphic and table zones automatically. Barcode zones and handprinted zones need to be drawn manually. For more information, see the section Windowing documents manually. Each window type has its own color code: text windows are orange, graphics are purple and table windows pink. Barcode zones are green and handprinted zones blue. The windows are sorted top-down, left to right. Numbers indicate the sort order of the windows.
ReadirisTM Pro 12 – User Guide The part of the page you select will be analyzed automatically. You will be prompted whether you want to exclude the same outer zone from page analysis on every page of the document. WINDOWING DOCUMENTS MANUALLY Besides windowing documents automatically by means of Page Analysis, Readiris allows you to window documents manually. Manual windowing comes in handy when having to modify the automatic page analysis results.
Chapter 8 – Windowing documents Draw a frame around the text blocks, graphics, tables, barcodes and handprinting zones you want to window. For more information on recognizing barcodes and handprinting, see the sections Recognizing barcodes and Recognizing handprinted text, respectively. When you are done windowing the document, click the Recognize + Save button to execute the OCR.
ReadirisTM Pro 12 – User Guide ones. Whenever two windows of the same type intersect, they become a polygon automatically. Automatic page analysis Should the current page be too complex to window manually, click the Analyze page button on the image toolbar to window the page automatically. Note that barcode zones and handprinted zones always need to be drawn manually.
Chapter 8 – Windowing documents Right-click any of the selected windows, point to Window, then to Type and then click the required window type. Modifying the window size Click the window you want to modify. Place the mouse pointer over a marker (on the sides and in the corners of the window). Click the marker and drag the mouse to modify the window size. Moving windows Select the window you want to move. Click inside the window and drag the mouse to modify the position of the window.
ReadirisTM Pro 12 – User Guide or Right-click the selected windows, point to Window, then click Delete. Deleting small windows Some documents, faxes for instance, often have "stray" dots on pages, causing Readiris to create superfluous windows that do not contain text. To erase all small windows, click Delete Small Windows on the Edit menu. This option erases all windows smaller than 0.5" and re-sorts the remaining zones.
Chapter 8 – Windowing documents On the File menu, click the command Load Layout. Select the layout file you saved. To apply the layout to all opened or scanned pages, select Apply Layout to All Pages in the Layout file dialog box. Click Open to load the layout file. Note that when you add a document to Readiris, the layout file must be loaded again as page analysis is enabled by default. Ignore exterior zone As an alternative to windowing templates, you can use the option Ignore exterior zone.
ReadirisTM Pro 12 – User Guide Click Recognize + Save to execute the OCR.
ReadirisTM Pro 12 – User Guide CHAPTER 9 RECOGNIZING DOCUMENTS INTRODUCTION To recognize documents, Readiris applies linguistics during the recognition phase. As a result, Readiris recognizes text, tables, graphics, barcodes and handprinted text in all kinds of documents. Readiris even copes with complex columnized documents, lowquality documents, faxes, dot matrix printouts, badly scanned and copied documents containing too light or dark font shapes, etc.
Chapter 9 – Recognizing documents sure of but also allows to increase the system's accuracy. All solutions you confirm are memorized temporarily during recognition, increasing the system speed and confidence and rendering the system more intelligent as you go along. This powerful learning tool also allows you to train Readiris on special characters such as mathematical symbols and dingbats and to handle distorted fonts.
ReadirisTM Pro 12 – User Guide The 5 most recently selected languages are moved to the top of the language list. Important: select the document language before executing page analysis when you are dealing with Asian, Hebrew and Arabic documents. Specific page analysis routines are used for these documents. The recognition can also be limited to a numeric character set to optimally recognize tables and figures.
Chapter 9 – Recognizing documents Recognizing documents with mixed languages Readiris also allows you to enable mixed character sets. That way Readiris switches languages in the middle of a sentence automatically and recognizes English words (proper names etc.) that occur in "exotic" languages. Click the globe button on the main toolbar and select the required language combination in the language drop-down list. Note: when processing Asian or Hebrew documents, mixed characters sets are used automatically.
ReadirisTM Pro 12 – User Guide Font type Readiris distinguishes between "regular" and dot matrix printed documents. Dot matrix symbols (of the type 9 pin) are made up of isolated, separate dots. Special segmentation and recognition techniques are required to recognize dot matrix documents and need to be activated. To select the font type: On the Settings menu, point to Font type. The font type is set to Automatic by default.
Chapter 9 – Recognizing documents Click Fixed if all characters of the typeface have the same width. This is often the case in old typewriter documents. Click Proportional if the characters of the typeface have a different width. Virtually all fonts in newspapers, magazines and books are proportional. Important: these document characteristics do not apply to Asian, Hebrew or Arabic documents. USING INTERACTIVE LEARNING Readiris offers an interactive learning function.
ReadirisTM Pro 12 – User Guide At the end of the recognition, Readiris enters the interactive learning phase. The characters the recognition system isn't sure of are displayed. If the results are correct: o Click the Learn button to save the result as sure. The learning results are temporarily stored in the computer memory, for the duration of the recognition. Readiris will no longer display the learned characters when OCRing the rest of the document.
Chapter 9 – Recognizing documents Use this command for damaged characters which could be confused with other characters if learned. E.g. the number 1 and the letter I, which have an identical form in many fonts. o Click Delete to delete characters from the output. Use this button to prevent document noise from appearing in the output file. o Click Undo to correct mistakes. Readiris keeps track of the last 32 operations. o Click Abort to abort interactive learning. All learning results will be deleted.
ReadirisTM Pro 12 – User Guide Click Recognize + Save to recognize the document. Readiris enters the interactive learning phase. Use the buttons of the dialog box to save characters in the font dictionary. To use an existing font dictionary: On the Learn menu click Font Dictionary. Select the dictionary you want to use and click Open. On the Learn menu click either Append Font Dictionary or Read Font Dictionary. When selecting Append Font Dictionary, make sure to enable Interactive Learning.
ReadirisTM Pro 12 – User Guide CHAPTER 10 FORMATTING AND SAVING DOCUMENTS FORMATTING DOCUMENTS The documents you OCR in Readiris can be saved in various output formats. Readiris saves OCR results as Adobe Acrobat PDF files, Microsoft XPS files, Word, WordML, RTF and OpenDocument text files, HTML and XML files, SpreadsheetML worksheets, and Ansi and Unicode text files.
Chapter 10 – Formatting and saving documents o sends documents to an application, which will open automatically, or; o saves documents as an external file. The option Send by e-mail creates a new e-mail message and inserts the recognized document as e-mail attachment. Click the different tabs to select the settings you want to apply. Settings that are unavailable for the selected output format appear dimmed.
ReadirisTM Pro 12 – User Guide Note that when saving a multipage document as external file, you can create a separate output file for each page in Readiris or save all pages that belong to the same document to a single output file. Simply click the corresponding options in the Output File dialog box: Create one file per page and Create one file per document, respectively.
Chapter 10 – Formatting and saving documents Layout options The option Create body text avoids text formatting by Readiris. Readiris generates a continuous, running text. The option Retain word and paragraph formatting takes an intermediate position between body text and autoformatting. The font type, size and type style are maintained across the recognition. The tabs and the alignment of each block are recreated. The text blocks and columns aren't recreated; the paragraphs just follow each other.
ReadirisTM Pro 12 – User Guide o The option Use columns instead of frames creates columnized documents. Columnized texts are easier to edit than documents containing multiple frames: the text flows naturally from one column to the next. Note: when the system is unable to detect columns in the source document, this formatting mode uses frames as a fallback position. o The option Insert column breaks inserts a hard column break at the end of each column.
Chapter 10 – Formatting and saving documents The option Merge lines into paragraphs enables automatic paragraph detection. Readiris wordwraps the recognized text until a new paragraph starts, and "reglues” hyphenated words at the end of a line. The option Include graphics includes the graphics in autoformatted files. This is essential to create a true copy of a document.
ReadirisTM Pro 12 – User Guide Click the Paper size tab and use the arrow buttons to apply and exclude paper sizes. Readiris will go through the active paper sizes in the indicated order and will use the first paper size that is sufficiently large to hold the scanned document.
Chapter 10 – Formatting and saving documents For more information on formatting options, see the section Formatting text documents. SpreadsheetML options When selecting Microsoft Excel 2002/2003 as target application, specific SpreadsheetML options are available. Click the tab SpreadsheetML options to display them: Note that the layout option Recreate source document becomes unavailable when this format is selected.
ReadirisTM Pro 12 – User Guide The option Convert figures into numbers encodes recognized figures as numbers. As a result, you can execute arithmetical operations on those cells. The text cells (in any table) remain text. Note that only figures inside tables are encoded as numbers. Excel exclusively executes mathematical operations on data that is encoded as numbers. The option Create one worksheet per page sees to it that one worksheet is created per scanned page.
Chapter 10 – Formatting and saving documents The option Merge lines into paragraphs enables automatic paragraph detection. Readiris wordwraps the recognized text until a new paragraph starts, and "reglues” hyphenated words at the end of a line. The option Retain colors of background recreates the background color of each cell. Paper sizes Depending on the format you selected, you can indicate preferred paper sizes: Click the Paper size tab and use the arrow buttons to apply and exclude paper sizes.
ReadirisTM Pro 12 – User Guide CREATING PDF DOCUMENTS Readiris generates four types of PDF output: Text, Text-Image, Image-Text and Image. To generate PDF output: Click the Format button on the main toolbar and select the PDF type of your choice in the Send to or External file drop-down list: PDF Image When you select PDF Image, Readiris generates image-only PDF documents, it does not execute OCR.
Chapter 10 – Formatting and saving documents PDF Text-Image When you select PDF Text-Image, Readiris recognizes text and creates searchable PDF documents that contain the page image and the recognized text. The page image is contained beneath the text. SELECTING THE PDF OPTIONS To select the PDF options: Click the Format button on the main toolbar and select the PDF type of your choice in the Send to or External file drop-down list. Depending on the PDF type you select, several options are available.
ReadirisTM Pro 12 – User Guide Create bookmarks The option Create bookmarks creates bookmarks for each text block, graphic and table in Adobe Acrobat PDF files. Embed fonts Select the option Embed fonts to embed fonts in Adobe Acrobat PDF files. Embedding fonts prevents font substitution and ensures that readers, regardless of their computer configuration, see the text in its original fonts. Embedding fonts increases the file size of recognized documents somewhat.
Chapter 10 – Formatting and saving documents On the PDF Options tab, select the required compression level. Readiris Pro supports Level I - Good size and Level I - Good quality compression. Readiris Corporate also supports both Level II and III Good size and Good quality compression as well as Custom compression. In Level II compression the option Compress symbols is enabled automatically to compress text compactly.
ReadirisTM Pro 12 – User Guide REPURPOSING PDF DOCUMENTS Next to generating PDF documents, Readiris can also repurpose PDF files: Readiris converts image PDFs into text PDFs or any other supported text format and unlocks read-only PDF content. Warning: Readiris does not open user password-protected PDF documents. Operation Click the Open button on the main toolbar and select the PDF file you want Readiris to repurpose.
Chapter 10 – Formatting and saving documents XPS stands for XML Paper Specification and is a fixed-layout format developed by Microsoft. To generate XPS output: Click the Format button on the main toolbar and select the XPS type of your choice in the Send to or External file drop-down list: XPS Image When you select XPS Image, Readiris generates image-only XPS documents, it does not execute OCR.
ReadirisTM Pro 12 – User Guide XPS Text-Image When you select XPS Text-Image, Readiris recognizes text and creates searchable XPS documents that contain the page image and the recognized text. The page image is contained beneath the text. SELECTING THE XPS OPTIONS To select the XPS options: Click the Format button on the main toolbar and select the XPS type of your choice in the Send to or External file drop-down list. Depending on the XPS type you select, several options are available.
Chapter 10 – Formatting and saving documents IHQC COMPRESSING XPS DOCUMENTS Besides four types of "regular" XPS output, Readiris offers iHQC compressed XPS output. XPS documents of the types Image-Text and Image can be hyper-compressed by means of iHQC. iHQC stands for intelligent High-Quality Compression, I.R.I.S.' proprietary, efficient compression technology. iHQC is to images what MP3 is to music and what DivX is to movies.
ReadirisTM Pro 12 – User Guide SELECTING THE GRAPHICS OPTIONS Depending on the output format and target application you select, advanced graphics options may be available. The graphics options can be used to alter the image quality and resolution. To access the graphics options: Click the Format button on the main toolbar and select the output format of your choice in the Send to or External file drop-down list. Click the Graphics tab to display the options.
Chapter 10 – Formatting and saving documents Tip: When saving documents as HTML files to post on a web site, reduce the resolution to 70 dpi (screen resolution). JPEG quality Graphics stored inside PDF, XPS, Word and RTF documents are saved in the JPEG format. Use the slider to adjust the JPEG quality. JPEG 2000 compression When saving files in the PDF or XPS format, Readiris can apply JPEG 2000 compression to the color-grayscale images stored inside those files.
ReadirisTM Pro 12 – User Guide CHAPTER 11 SAVING AND LOADING SETTINGS Any settings you specify in Readiris are saved automatically for future use after you close the application. To restore the factory settings, click the command Restore Factory Settings on the File menu. When scanning various groups of documents which all require different settings, it is useful to save separate settings files for each group. Operation Select the settings you want to use for a certain document group.
ReadirisTM Pro 12 – User Guide CHAPTER 12 RECOGNIZING MULTIPAGE DOCUMENTS OPENING AND RECOGNIZING MULTIPLE IMAGE FILES Readiris is designed to process multiple image files at a time. To open multiple image files: Click Open on the main toolbar.
Chapter 12 – Recognizing multipage documents Note that you can also drag-and-drop image files from Windows Explorer to the Readiris image window to open them. The page toolbar will display the opened image files. Tip: hold the mouse cursor over the page thumbnails to display the settings information per page. The page toolbar can be used to edit multipage documents. For more information, see the section Editing multipage documents.
ReadirisTM Pro 12 – User Guide SCANNING AND RECOGNIZING MULTIPAGE DOCUMENTS Readiris is designed to process documents consisting of multiple pages. Readiris Pro processes documents of up to 50 pages. Readiris Corporate processes documents of an unlimited number of pages. To scan multipage documents in Readiris, you can either use the automatic document feeder function when using a sheet-fed scanner or use interval scanning function when you are using a flatbed scanner.
Chapter 12 – Recognizing multipage documents The scanner will automatically scan another page after the indicated number of seconds without you having to click the Scan button every time. Click Abort in the interval scanning dialog box to end the automatic scanning or press ESC on the keyboard. Click Pause in the interval scanning dialog box to freeze the scanning interval or press the space bar on the keyboard. Click Resume when you’re ready to continue.
ReadirisTM Pro 12 – User Guide Drag the page to the correct position. Or right-click a page and click Move Page Up or Down. Deleting a page: Right-click the page you want to delete and click Delete page. Or select the page and hit the Delete button on your keyboard. Excluding a page from recognition: Right-click the page you want to exclude and click Exclude page. Or clear its page number box in the document panel. Excluded pages are stricken out in the page toolbar.
ReadirisTM Pro 12 – User Guide CHAPTER 13 RECOGNIZING HANDPRINTED TEXT Next to typed text, tables, graphics and barcodes, Readiris recognizes handprinted text. Handprinting consists of separated block letters. It takes highly specialized ICR software (intelligent character recognition) to recognize handprinted characters. To recognize handprinting: Click the handprinting button on the image toolbar. Draw a frame around the handprinted text. Click Recognize + Save on the main toolbar.
Chapter 13 – Recognizing handprinted text Recognized symbols Handprinting recognition is limited to the Latin alphabet and supports numerals (0-9), uppercase letters (A-Z) and the punctuation symbols comma, period, plus sign and hyphen. Accents, umlauts and other special characters are not supported. Notes Readiris supports handprinting, not handwriting. For more information, see the section Handprinting rules.
ReadirisTM Pro 12 – User Guide Use a sufficiently thick ballpoint. Black pens yield better results than blue pens. Do not use pencils. Don't stylize too much. Excessively stylized characters increase the risk of OCR errors. Don't open loops which should be closed, don't close loops which should be open. Avoid broken characters. Avoid retracing. Retracing reduces the image quality and clarity of handprinted symbols. Characters that are entirely stricken out will not be recognized.
Chapter 13 – Recognizing handprinted text The horizontal underlining bar does not have to touch the rest of the font form. Tip: when less than optimal results are obtained, use the I.R.I.S. writing form and adapt your writing style. The blank I.R.I.S. writing form serves as a full-page template on which block letters can be filled out correctly and in the right size. The form can be found on the Readiris CD-ROM and in the Readiris installation folder.
ReadirisTM Pro 12 – User Guide CHAPTER 14 RECOGNIZING BARCODES INTRODUCING BARCODE READING Next to optical character recognition of 128 languages, Readiris also offers barcode reading. All widespread barcode symbologies are supported: Codabar, Code 128, Code 39, Code 39 extended, Code 39 HIBC, Code 93, Datalogic 2 of 5, Discrete 2 of 5, EAN-13, EAN-8, Interleaved 2 of 5, MSI Pharmaceutical, MSI-Plessey, Kodak patch code, PDF-417, PostNet, UCC-128, UPC-A and UPC-E.
Chapter 14 – Recognizing barcodes o Select the symbologies you want Readiris to recognize. o Determine whether you want Readiris to verify or remove the check digits. Click the barcode button on the image toolbar and draw a frame around the barcodes zones in the document. Click Recognize + Save on the main toolbar. The entire document including the barcode content will be recognized. Note: right-click a barcode zone and click Copy as Data to copy its content to the clipboard.
ReadirisTM Pro 12 – User Guide INDEX A color image ..................... 26, 33 accuracy vs. speed................ 54 color mode ............................ 28 ADF ..................................... 85 contrast ........................... 28, 36 adjusting scanned documents 33 D Arabic documents ............ 4, 52 deskewing ............................. 23 Asian documents .............. 6, 52 despeckling ........................... 36 Asian edition .................4, 7, 52 digital camera .
Index G M graphics options ................... 79 main toolbar.......................... 16 grayscale image ................... 26 manual windowing ............... 43 H Middle-East edition ...... 4, 7, 52 handprinting ................... 89, 90 mixed languages ................... 54 Hebrew documents .......4, 7, 52 multipage documents ...... 83, 85 HTML output ....................... 61 I image toolbar ....................... 16 N numeric ................................. 53 O installation ..
ReadirisTM Pro 12 – User Guide PDF options ......................... 72 supported image formats ...... 25 PDF output ..................... 61, 71 system requirements ............... 9 product support .................... 12 R recreating source documents 64 registration ........................... 11 T tables .................................... 67 text documents...................... 63 U repurposing PDF documents 75 Unicode output ..................... 61 resolution .............................