ReadirisTM Corporate 12 User Guide
ReadirisTM Corporate 12 – User Guide Table of Contents Copyrights ........................................................................................... 1 Chapter 1 Introducing Readiris ............................................... 3 Save time, no more retyping ............................................. 3 Readiris series ................................................................... 6 Chapter 2 Installing Readiris ................................................... 9 System requirements ...
Table of Contents Scanning paper documents.............................................. 25 Chapter 6 Adjusting scanned documents .............................. 29 Chapter 7 Zoning documents ................................................. 35 Zoning documents automatically .................................... 35 Zoning documents manually ........................................... 37 Using zoning templates ................................................... 42 Chapter 8 Recognizing documents..............
ReadirisTM Corporate 12 – User Guide Repurposing PDF documents.......................................... 72 Selecting the page size .................................................... 73 Chapter 10 Saving and loading settings ................................ 75 Chapter 11 Recognizing large volumes of scanned images .. 77 Batch Processing ............................................................. 77 Setting up a watched folder .............................................
ReadirisTM Corporate 12 – User Guide Copyrights ReadirisCorporate12-dgi-190609-01 Copyrights © 1987-2009 I.R.I.S. All Rights Reserved. I.R.I.S. owns the copyrights to the Readiris software, to the online help system and to this publication. The information contained in this document is the property of I.R.I.S. Its content is subject to change without notice and does not represent a commitment on the part of I.R.I.S.
ReadirisTM Corporate 12 – User Guide CHAPTER 1 INTRODUCING READIRIS SAVE TIME, NO MORE RETYPING Introduction Congratulations on acquiring Readiris. This software package will undoubtedly be of great help in recapturing your texts, tables and graphics, barcodes and handprinted text. As efficient as computers are, you have to key in your information first. If you have ever retyped a 15 page report or a large table of figures, you know how tedious and time-consuming it can be.
Chapter 1 – Introducing Readiris and drag your scanned documents to the Dock icon. They will be processed on the spot. General information Readiris is based on the most advanced recognition technologies. Font-independent text recognition is complemented by self-learning techniques. The system is able to learn new characters and words through contextual and linguistic analysis. This means that the OCR accuracy of the recognition system will improve as it goes along.
ReadirisTM Corporate 12 – User Guide are memorized, increasing the system speed and confidence and rendering the system more intelligent as you go along. This powerful learning tool also allows you to train Readiris on special characters such as mathematical symbols and dingbats and to handle distorted fonts. To increase your productivity further, Readiris not only recognizes your texts, but can format them for you as well. Various levels of formatting are available.
Chapter 1 – Introducing Readiris states, telephone and fax numbers, etc. The resulting data can be sent directly to your contact management software such as Address Book. The data can also be stored in a structured file, in vCard format for instance, and imported in any address database. Readiris is Twain and Image Capture compliant and supports a wide range of flatbed and sheetfed scanners, “all-in-one” devices or “MFPs” (”multifunctional peripherals”) and digital cameras.
ReadirisTM Corporate 12 – User Guide Readiris Pro 12 Readiris Corporate 12 Basic features Basic features 125 recognition languages 125 recognition languages Generates 4 types of PDF files, PDF- Generates 4 types of PDF files, PDF- iHQC files, ODT, DOCX, XLSX, HTML, iHQC files, ODT, DOCX, XLSX, HTML, RTF, Unicode files RTF, Unicode files Generates PDF/A output Large volume recognition Automated processing Barcode recognition Business card recognition Readiris Pro 12 Asian Readiris Corporate 12
Chapter 1 – Introducing Readiris Large volume recognition Automated processing Barcode recognition Business card recognition 8
ReadirisTM Corporate 12 – User Guide CHAPTER 2 INSTALLING READIRIS SYSTEM REQUIREMENTS This is the minimal system configuration required to use Readiris: A Mac OS computer with Intel or G3 processor. The operating system Mac OS X 10.4 or higher. Earlier versions of the Mac OS operating system are not supported. 220 MB of free hard disk space. SOFTWARE INSTALLATION How to install Readiris: Log on to your Mac operating system as an administrative user.
Chapter 2 – Installing Readiris Double-click the Readiris installer and follow the on-screen instructions. Agree with the terms of the license agreement. A standard installation type is offered. This will install Readiris, Drop2Read and the sample images. To modify the installation type, click Customize. Then click Install to start the actual installation. When the installation is finished, click Close.
ReadirisTM Corporate 12 – User Guide SOFTWARE REGISTRATION In order to use Readiris Corporate you are required to register. By doing so, you will also: be kept informed of future product developments and related I.R.I.S. products; be entitled to product support; be entitled to special offers on I.R.I.S. products. To register: Click Register Readiris on the Help menu. You will be directed to the registration web page. Simply follow the on-screen instructions.
Chapter 2 – Installing Readiris I.R.I.S. Software Maintenance and Support Services I.R.I.S. also offers a Software Maintenance and Support Services Program, which allows you to obtain major software upgrades of Readiris Corporate. To obtain the program's application form, please contact I.R.I.S. at the following e-mail address: readiris.maintenance@irislink.com.
ReadirisTM Corporate 12 – User Guide CHAPTER 3 GETTING STARTED RUNNING READIRIS To run Readiris: Click the Readiris icon on the dock. Or double-click the Readiris application in the Readiris folder under Applications. If you acquired Readiris Corporate you will be prompted to register. Click Register on the Internet and complete the registration process to acquire your software key. Enter the software key you receive by e-mail in the required field. The Readiris interface will open.
Chapter 3 – Getting Started USER INTERFACE The Readiris interface is composed of: the main toolbar (left toolbar) Use the main toolbar commands and options to scan and recognize documents. the image toolbar (right toolbar) Use the image toolbar buttons to edit documents in the Readiris interface. Point to the different buttons to display their tooltips. the Readiris menu bar (top of screen) The Readiris menu bar contains all the commands and options you also find on the main and image toolbars.
ReadirisTM Corporate 12 – User Guide When a document has been opened or scanned in Readiris you can view its page thumbnails in the image drawer. Click the drawer icon to open it. The drawer can open both on the right-hand and left-hand side of the Readiris interface, depending on its position on your screen. The drawer allows you to move pages inside a document: simply click the pages you want to move and drag them to another position.
Chapter 3 – Getting Started The drawer also allows you to delete pages by dragging them to the Dock trash. CHANGING THE USER INTERFACE LANGUAGE Readiris opens in the user interface language that is currently activated in your system preferences. To change the user interface language in Readiris: Click the System Preferences icon on the Dock. Then open the International section. Drag the language of your choice to the top of the list and close the International window.
ReadirisTM Corporate 12 – User Guide Before you can use a Twain scanner, however, its drivers need to be installed on your Mac. Operation: Connect your scanner to your Mac and install the corresponding drivers and/or software. Test your scanner. If you experience any problems contact your scanner manufacturer. Run Readiris. On the Readiris menu click Preferences. When the scanner drivers have been installed successfully, a list of supported scanners will be available.
ReadirisTM Corporate 12 – User Guide CHAPTER 4 USING DROP2READ Drop2Read is a simple yet efficient utility that allows you to recognize documents instantly, without the Readiris being displayed. The Drop2Read utility is installed in a default installation of Readiris. To process documents: Simply drag your documents to the Drop2Read icon on the Dock. The Drop2Read window will open and Drop2Read will process your documents using default settings.
Chapter 4 – Using Drop2Read Click the lists to change the settings. Any settings you change will be saved when you close the Drop2Read window. The next time you want to process documents using the same settings, simply drag the documents to the Drop2Read icon on the Dock. Note that Drop2Read uses basic settings. Use Readiris if you want to apply advanced settings when processing documents. Tip: for more information about the available output formats, see the section Formatting documents.
ReadirisTM Corporate 12 – User Guide CHAPTER 5 SCANNING AND OPENING DOCUMENTS SELECTING THE DOCUMENT TYPE Before scanning documents or opening image files in Readiris Corporate, you must select the document type. Readiris can either process Text pages or Business cards. Operation Click the Document type icon on the main toolbar and select the document type. Depending on the document type you select different output formats will be available.
Chapter 5 – Scanning and opening documents SELECTING THE OPTIONS Before scanning paper documents or opening image files, you can determine several image enhancement options. When selected, these options will be applied during the opening and scanning of documents. Operation Click the Options button on the main toolbar to select several image enhancement options. o Click Page Deskewing to straighten pages scanned at an angle.
ReadirisTM Corporate 12 – User Guide o Page Analysis is enabled by default. This way, scanned or opened images will be split up in zones automatically. You can also use the zoning tools on the image toolbar to modify the page analysis results or to zone your documents manually. For more information, see the section Zoning documents manually. When you are done selecting the options, click the Scan or Open button to scan documents or open image files.
Chapter 5 – Scanning and opening documents images, PICT images, PNG images, QuickDraw GX images, QuickTime images, Silicon Graphics images, Targa images, (uncompressed, packbits and Group 3 compressed) TIFF images, multipage TIFF images, Windows bitmaps (BMP) and PDF documents. Select the image file of your choice and click Open. To zoom in on the opened image, use the magnifying glass on the image toolbar or Cmd-click inside the image.
ReadirisTM Corporate 12 – User Guide image files to the recognized document or click Yes to start a new document. SCANNING PAPER DOCUMENTS With Readiris you can either process paper documents you scan with your scanner or process already existing images files of various formats. To scan documents: First select the scanner settings. To access them, click Preferences on the Readiris menu. Make sure your scanner is connected to your Mac and configured correctly.
Chapter 5 – Scanning and opening documents Calibrate Click the Calibrate button should it be necessary to calibrate your scanner. Format You can either choose an automatic scanning format or a custom format for which you can indicate the page height and width. Depth Readiris supports black-and-white, grayscale and color images. Resolution Select a scanning resolution of 300 dpi. When you are scanning business cards it is recommended to use a scanning resolution of 400 dpi.
ReadirisTM Corporate 12 – User Guide and background (artwork). Sometimes smoothening is the only way to separate text from a colored background. Note that this function is not the same as the one you find in the Adjust image options on the Process menu. o Select Process as 300 dpi when you are processing images of an incorrect or unknown resolution. The images will be processed as if they had a 300 dpi resolution. The resolution of digital camera images is nearly always unknown.
Chapter 5 – Scanning and opening documents When you are done defining all the settings, click OK. Then click the Scan button to scan documents. Note: pay attention to line skew. Line skew over 0.5° increases the risk of OCR errors.
ReadirisTM Corporate 12 – User Guide CHAPTER 6 ADJUSTING SCANNED DOCUMENTS During recognition Readiris converts color and grayscale images into binarized, black-and-white images, on which it performs the OCR. When opening or scanning extremely light or extremely dark grayscale and color images, it may be necessary to adjust their binarized counterparts in order to obtain satisfactory OCR results. To adjust images: Open or scan a color-grayscale document. Make sure that the scanner settings are correct.
Chapter 6 – Adjusting scanned documents Note: sometimes smoothening is the only way to separate text from a colored background. (Original image) (Binarized black-and-white image) (Smoothened image) o Use the slider to increase or decrease the Brightness. The Brightness settings determine the overall brightness of the image. Use these settings to darken or lighten the image when the text is illegible. Example 1: lighten a dark image to eliminate the page background.
ReadirisTM Corporate 12 – User Guide (Binarized image. The default binarization settings yield a black image) (The lightened image yields satisfactory recognition results) Example 2: darken an image when the text is so light it doesn't show up in the binarized image. (Color image) (Binarized image.
Chapter 6 – Adjusting scanned documents o Use the slider to increase or decrease the Contrast. The Contrast settings determine the contrast between darker and lighter zones of an image. Use these settings to make character shapes stand out against a colored background. (Color image) (Default contrast settings yield broken characters) (Increased contrast settings yield satisfactory recognition results) o Use the slider to increase or decrease the Despeckle options.
ReadirisTM Corporate 12 – User Guide You can also save a selection of pages by clicking Save Selected Pages on the File menu.
ReadirisTM Corporate 12 – User Guide CHAPTER 7 ZONING DOCUMENTS ZONING DOCUMENTS AUTOMATICALLY When scanning or opening documents, Readiris will automatically apply Page Analysis to split up the documents in different zones. The Page Analysis option is selected by default. Click the Options button and disable Page Analysis should you want to avoid automatic page analysis. The page analysis results can be modified manually after automatic page analysis.
Chapter 7 – Zoning documents Page analysis detects text, graphic, table and barcode zones automatically. Handprinting zones need to be drawn manually. For more information, see the section Zoning documents manually. Each zone type has its own icon: The zones are sorted top-down, left to right. Numbers indicate the sort order of the zones. The sort order and zone types can be changed, however. For more information, see the section Zoning documents manually.
ReadirisTM Corporate 12 – User Guide ZONING DOCUMENTS MANUALLY Besides zoning documents automatically by means of Page Analysis, Readiris allows you to zone documents manually. Manual zoning comes in handy when having to modify the automatic page analysis results. It also allows you to create zoning templates. For more information on zoning templates, see the section Using zoning templates. Note that handprinting zones always need to be zoned manually.
Chapter 7 – Zoning documents For information about recognizing barcodes and handprinting, see the sections Recognizing barcodes and Recognizing handprinted text, respectively. To select other zone types, click the zone type icon that is currently selected, and choose another zone type. Or click the Layout menu, point to Layout Mode and select the zone you want to draw. When you are done splitting up the document in recognition zones, click the Recognize + Save button to execute the OCR.
ReadirisTM Corporate 12 – User Guide Drawing polygons Zoning documents manually is not limited to rectangular shapes. You can create polygonal zones by merging rectangular ones. Whenever two zones of the same type intersect, they become a polygon automatically. Automatic page analysis Should the current page be too complex to zone manually, click the Analyze page button on the image toolbar to zone the page automatically. Note that barcode zones and handprinting zones always need to be drawn manually.
Chapter 7 – Zoning documents Changing the zone type To change the zone type of a zone, Ctrl-click the zone and select the required zone type. You can also change the zone type of several zones simultaneously: Click the pointer button on the image toolbar, then click Select Zones Tip: when the pointer is not visible on the image toolbar this means one of the 5 zone types is currently selected. Click the corresponding icons on the image toolbar, then click Select Zones.
ReadirisTM Corporate 12 – User Guide Moving zones Select the zone you want to move. Click inside the zone and drag the mouse to modify the position of the zone. Recognizing a particular zone Ctrl-click the zone you want to recognize and select Copy as Text. The results are sent to the pasteboard as body text. This also works for handprinted text. Graphic zones and barcode zones can also be copied to the pasteboard.
Chapter 7 – Zoning documents Deleting small zones Some documents, faxes for instance, often have "stray" dots on pages, causing Readiris to create superfluous zones that do not contain text. To erase all small zones, click Delete Small Zones on the Layout menu. This option erases all zones smaller than 0.5" and re-sorts the remaining zones. USING ZONING TEMPLATES When OCRing many documents with a similar page layout, it may be useful to use zoning templates instead of automatic page analysis.
ReadirisTM Corporate 12 – User Guide When you want to use the same zoning template next time you use Readiris, click the command Open in the Layout menu. Frame the Area to Analyze As an alternative to zoning templates, you can use the option Frame the Area to Analyze. That way, you can define one particular area on the page that needs to be OCRed. Any data outside the OCR area will be excluded from recognition.
ReadirisTM Corporate 12 – User Guide CHAPTER 8 RECOGNIZING DOCUMENTS INTRODUCTION To recognize documents, Readiris applies linguistics during the recognition phase. As a result, Readiris recognizes text, tables and graphics, barcodes and handprinted text in all kinds of documents. Readiris even copes with complex columnized documents, lowquality documents, faxes, dot matrix printouts, badly scanned and copied documents containing too light or dark font shapes, etc.
Chapter 8 – Recognizing documents allows to increase the system's accuracy. All solutions you confirm are memorized temporarily during recognition, increasing the system speed and confidence and rendering the system more intelligent as you go along. This powerful learning tool also allows you to train Readiris on special characters such as mathematical symbols and dingbats and to handle distorted fonts. The interactive learning results can also be stored permanently in font dictionaries for future use.
ReadirisTM Corporate 12 – User Guide Important: select the document language before executing page analysis when you are dealing with Asian or Hebrew documents. Specific page analysis routines are used for these documents. The recognition can also be limited to a Numeric character set to optimally recognize tables and figures. Readiris then only recognizes the numerals 0-9 and the following series of symbols: To activate numeric mode, select Numeric at the top of the Primary language list.
Chapter 8 – Recognizing documents Recognizing documents with mixed languages Readiris also allows you to enable mixed character sets. That way Readiris switches languages in the middle of a sentence automatically and recognizes English words (proper names etc.) that occur in "exotic" languages. Click the globe button on the main toolbar and select the required language combination in the Primary language list. Note: when processing Asian or Hebrew documents, mixed characters sets are used automatically.
ReadirisTM Corporate 12 – User Guide Pages with a different language than the overall language are marked in red in the drawer. This also works when recognizing business cards. Unlike secondary languages, there are no limitations here. Note: the tooltip of each page in the drawer indicates which language applies to that page.
Chapter 8 – Recognizing documents USING USER LEXICONS During recognition, Readiris is assisted by linguistic databases to recognize text correctly. These linguistic databases are standard lexicons and are available for every supported language. As powerful as these standard lexicons may be, the recognition accuracy can still be boosted using customized user lexicons.
ReadirisTM Corporate 12 – User Guide Duplicate words are rejected automatically. Click Save to save the lexicon file in the folder of your choice. Return to the Readiris Settings menu and point to User Lexicon. Click Open and select the user lexicon file of your choice in the dialog box. Note that in order for Readiris to recognize the words in the user lexicon, the correct language must have been selected. Click the globe icon on the main toolbar to do so.
Chapter 8 – Recognizing documents DEFINING THE DOCUMENT CHARACTERISTICS Next to the document language, other document characteristics such as the Font type and Character pitch play an important role in the recognition process. Font type Readiris distinguishes between "regular" and dot matrix printed documents. Dot matrix symbols (of the type 9 pin) are made up of isolated, separate dots. Special segmentation and recognition techniques are required to recognize dot matrix documents and need to be activated.
ReadirisTM Corporate 12 – User Guide characters have the same width, or proportional, in which case the characters have a different width. To select the character pitch: On the Settings menu, point to Character Pitch. The character pitch is set to Automatic by default. Click Fixed if all characters of the typeface have the same width. This is often the case in old typewriter documents. Click Proportional if the characters of the typeface have a different width.
Chapter 8 – Recognizing documents train Readiris on special symbols it is unable to recognize initially, such as mathematical and scientific symbols and dingbats. To enable interactive learning: On the Learn menu, click Interactive Learning. Click the Recognize + Save button to recognize the document. Readiris enters the interactive learning phase. The characters the recognition system isn't sure of are displayed. If the results are correct: o Click the Learn button to save the result as sure.
ReadirisTM Corporate 12 – User Guide If the results are incorrect: o Type in the correct characters and click the Learn button. Note: if you are dealing with documents that contain special characters make sure you click the command Special Characters on the Edit menu. Double-click the characters you want to insert. or o Click Don't learn to save the result as unsure. Use this command for damaged characters which could be confused with other characters if learned. E.g.
Chapter 8 – Recognizing documents All learning results will be deleted. Next time you click Recognize + Save, interactive learning will start again. USING FONT DICTIONARIES When scanning many documents of the same type, font quality and printing quality, you may not want to repeat the learning process every time. Therefore, it is useful to use font dictionaries. Font dictionaries contain font information learned during interactive learning and can substantially increase the recognition results.
ReadirisTM Corporate 12 – User Guide Select the dictionary you want to use and click Open. Click Recognize + Save to recognize the document.
ReadirisTM Corporate 12 – User Guide CHAPTER 9 FORMATTING AND SAVING DOCUMENTS FORMATTING DOCUMENTS Readiris allows you to recognize and save your documents in numerous output formats: With Readiris you can generate several types of text-based documents. Readiris offers OpenDocument text, Open XML (docx), RTF and Unicode text output. Note that it takes the latest version of Microsoft Word (2008) to open docx files. To open docx files in Microsoft Word 2004 you need to download a Docx convertor.
Chapter 9 – Formatting and saving documents (gridded) (non-gridded) Readiris offers 4 types of PDF output. See the section Creating PDF doccuments for more information. With Readiris you can save your documents as image files without recognizing them. Readiris can save documents as JPEG, JPEG 2000, Photoshop, PICT, PNG, TIFF and Windows bitmap images. Operation Click the output format icon on the main toolbar. Select the required output format from the Format list.
ReadirisTM Corporate 12 – User Guide The Layout and Graphics options are covered in the sections Selecting the Layout options and Selecting the Graphics options. Options that are unavailable for the selected output format appear dimmed. You can also send the recognized documents directly to a target application, which will open automatically.
Chapter 9 – Formatting and saving documents SELECTING THE LAYOUT OPTIONS Depending on the output format you select, different layout options are available. To access the Layout options: Click the output format icon on the main toolbar. Select the required output format from the Format list. The available layout options for the selected format will be displayed: Options that are not available appear dimmed. o The option Create body text avoids text formatting by Readiris.
ReadirisTM Corporate 12 – User Guide Readiris generates a true copy of the source document, no longer a scanned image. Readiris also recreates any hyperlinks to e-mail addresses and web sites. The option Use columns instead of frames creates columnized documents. Columnized texts are easier to edit than documents containing multiple frames: the text flows naturally from one column to the next.
Chapter 9 – Formatting and saving documents Readiris wordwraps the recognized text until a new paragraph starts, and "reglues” hyphenated words at the end of a line. o The option Include graphics includes the graphics in autoformatted files. This is essential to create a true copy of a document. Use the graphic options on the Graphics tab to determine the color mode and resolution of the graphics stored inside the output files.
ReadirisTM Corporate 12 – User Guide Depth Readiris saves graphics in their original depth by default. Readiris can also save graphics in black-and-white, grayscale and color. Quality You can choose between Low, Normal and High quality graphics. Resolution Readiris retains the original resolution by default. You can also choose to reduce the resolution to a lower dpi. Note that you cannot increase the resolution.
Chapter 9 – Formatting and saving documents When you are done selecting the options, click OK. Then click Recognize+Save to recognize the document. SAVING DOCUMENTS AS IMAGE FILES Although Readiris is an OCR application it also allows you to save your documents as image files without recognizing them. Readiris can save documents as JPEG, JPEG 2000, Photoshop, PICT, PNG, TIFF and Windows bitmap images. Operation Click the output format icon on the main toolbar.
ReadirisTM Corporate 12 – User Guide In case you just want to save your images without opening them, select None in the Send to list. Then click Recognize+Save on the main toolbar to save your document as image file. Or click Save document on the File menu. Notes: You can also use the command Copy graphic zones on the Layout menu to move all graphics on a page to the pasteboard. You can also drag the image thumbnails from the Drawer to the Desktop to save them in the JPEG format.
Chapter 9 – Formatting and saving documents PDF Text When you select PDF Text, Readiris recognizes text and creates searchable PDF files. The page image is not contained in these single-layered PDF files. PDF Text-Image When you select PDF Text-Image, Readiris recognizes text and creates searchable PDF documents that contain the page image and the recognized text. The page image is contained beneath the text.
ReadirisTM Corporate 12 – User Guide SELECTING THE PDF OPTIONS To select the PDF options: Click the output format icon on the main toolbar and select PDF. Depending on the PDF type you select, several options are available. Click the PDF options tab to access them: Version Select which version of the PDF format you want to generate. Note: It takes Adobe Acrobat 5.0 and higher to open PDF 1.4 documents. It takes Adobe Acrobat 6.0 and higher to open PDF 1.5 documents.
Chapter 9 – Formatting and saving documents It takes Adobe Acrobat 7.0 and higher to open PDF 1.6 documents. It takes Adobe Acrobat 8.0 and higher to open PDF 1.7 documents. PDF/A documents Next to "regular" PDF documents, Readiris offers PDF/A output. Simply select the option Conforms to PDF/A. PDF/A files are used for long-term archiving and contain only what is strictly needed for opening and viewing them. Note: use Adobe Reader instead the standard Preview application to open PDF/A documents.
ReadirisTM Corporate 12 – User Guide and Image can be hyper-compressed by means of iHQC without loss of image quality. iHQC stands for intelligent High-Quality Compression, I.R.I.S.' proprietary, efficient compression technology. iHQC is to images what MP3 is to music and what DivX is to movies. Select either Good size to obtain the smallest possible documents or Good Quality to obtain slightly larger documents of higher quality.
Chapter 9 – Formatting and saving documents When you set an open document password, you will be prompted to enter that password when opening the PDF output. When you set a permissions password, you will only be able to perform the actions specified in the security settings. If you do want to change these settings, you must enter the permissions password. The Readiris security settings are similar to the standard protection features offered by Adobe Acrobat.
ReadirisTM Corporate 12 – User Guide Warning: Readiris does not open user password-protected PDF documents. Operation Click the Open button on the main toolbar and select the PDF file you want Readiris to repurpose. If necessary, indicate the pages you want to open. Click the output format icon on the main toolbar and select PDF from the Format list. Then select the PDF type of your choice and click OK to close the settings.
Chapter 9 – Formatting and saving documents Click the output format icon on the main toolbar and select one of the output formats mentioned above from the Format list. Then click the Page Sizes tab to access the options. Check the page sizes you want to include and clear the ones you want to exclude. Readiris goes through the active page sizes in the indicated order and uses the first page size that is sufficiently large to hold the scanned document.
ReadirisTM Corporate 12 – User Guide CHAPTER 10 SAVING AND LOADING SETTINGS When you exit Readiris you will be prompted so save any settings you specified and use them as default settings. The next time you run Readiris, the program will open using the new default settings. To restore the factory settings, click the command Restore Factory Settings on the Settings menu.
Chapter 10 – Saving and loading settings Click Recognize + Save to recognize the document, using the correct settings.
ReadirisTM Corporate 12 – User Guide CHAPTER 11 RECOGNIZING LARGE VOLUMES OF SCANNED IMAGES BATCH PROCESSING Readiris offers a powerful functionality for recognizing batches of scanned images: Batch Processing Batch Processing executes the recognition on all scanned images in a specific folder. Indicate to Readiris in which folder your documents are located, start the OCR process and all your documents will be converted to the required output format.
Chapter 11 – Recognizing large volumes of scanned images These folders may be different but do not need to be. Select the processing options: o Select Process subfolders to process all subfolders of the image folder. If the output folder differs from the image folder, all subfolders will be recreated in the output folder, mirroring the structure of the image folder. o Select Overwrite text files to overwrite previous recognition results.
ReadirisTM Corporate 12 – User Guide SETTING UP A WATCHED FOLDER Next to executing Batch Processing, Readiris can monitor a Watched Folder. Any image files you place or change inside the watched folder will be processed by Readiris. You can leave the OCR software running day after day. Note: the Watched folder function is especially convenient when you are using a scanner that stores your images automatically in a predefined folder.
Chapter 11 – Recognizing large volumes of scanned images o Select Process subfolders to process all subfolders of the image folder. If the output folder differs from the image folder, all subfolders will be recreated in the output folder, mirroring the structure of the image folder. o Select Overwrite text files to overwrite previous recognition results. o Select Delete images after processing to delete the files in the image folder. Click OK to monitor the Watched Folder.
ReadirisTM Corporate 12 – User Guide CHAPTER 12 SEPARATING AND INDEXING DOCUMENT BATCHES SEPARATING DOCUMENT BATCHES When scanning or opening multiple documents it is essential to indicate to Readiris where one document ends and the other begins. You can do this by means of blank pages or barcode pages. Separating scanned documents When you are scanning documents, insert a blank page or barcode page between the different documents in your scanner's document feeder.
Chapter 12 – Separating and indexing document batches Select Detect blank pages or Detect cover pages with a barcode, depending on the type of separator page you are using. Readiris will detect blank pages or barcode pages and mark them as cover pages. A page is blank when it only contains noise. Note that you can delete all blank pages simultaneously after recognition should this be necessary: click the command Delete Blank Pages on the Process menu to do so.
ReadirisTM Corporate 12 – User Guide Click OK to close the settings. Then click the Scan button to scan the documents. The scanned images will be displayed in Readiris and the blank pages or barcode pages will be marked as cover pages. Click the Recognize + Save button to process the documents. The document batch will be split up and saved in separate output documents. Separating opened documents manually Click the Open button on the main toolbar and select the documents you want to open.
Chapter 12 – Separating and indexing document batches Click the Recognize + Save button to process the documents. INDEXING DOCUMENT BATCHES Besides separating document batches, Readiris allows you to index document batches. Readiris can generate an XML index file containing detailed information on the processed documents and, if selected, also the OCR results. The XML index file can be used afterwards for programming purposes.
ReadirisTM Corporate 12 – User Guide Select Generate an XML index. An XML index file will be created per document. The index file contains detailed information such as the detected barcode separator, the page range, the output file name and the cover page text (if selected). To include the text of the cover pages in the XML index, select the corresponding option. Note that these reading results are not included in the output document. Click OK to save the document processing settings.
ReadirisTM Corporate 12 – User Guide CHAPTER 13 RECOGNIZING HANDPRINTED TEXT Next to typed text, tables, graphics and barcodes, Readiris recognizes handprinted text. Handprinting consists of separated block letters. To recognize handprinting: Click the pointer button on the image toolbar. Select Draw Handprinting Zones. Draw a frame around the handprinted text you want to recognize. Click Recognize + Save on the main toolbar. The entire document including the handprinted text will be recognized.
Chapter 13 – Recognizing handprinted text Recognized symbols Handprinting recognition is limited to the Latin alphabet and supports numerals (0-9), uppercase letters (A-Z) and the punctuation symbols comma, period, plus sign and hyphen. Accents, umlauts and other special characters are not supported. Notes Readiris supports handprinting, not handwriting. Uppercase characters are replaced by lowercase characters after recognition, unless they occur at the beginning of a sentence.
ReadirisTM Corporate 12 – User Guide CHAPTER 14 RECOGNIZING BARCODES INTRODUCING BARCODE READING Next to optical character recognition of 125 languages, Readiris also offers barcode reading. Barcodes can either be recognized manually or automatically when they are used for indexing purposes.
Chapter 14 – Recognizing barcodes Then select Draw Barcode zones. Draw a frame around the barcode zones you want to recognize. Click Recognize + Save on the main toolbar. The entire document including the barcode content will be recognized. Note: Ctrl-click a barcode zone and click Copy as Data to copy its content to the pasteboard. Automatic barcode reading Barcodes can be used as separators to separate documents in a document batch.
ReadirisTM Corporate 12 – User Guide CHAPTER 15 RECOGNIZING BUSINESS CARDS INTRODUCING BUSINESS CARD READING Next to recognition of "regular" documents, Readiris also offers business card recognition. Readiris allows you to scan business cards, recognize them and convert them into an address database.
Chapter 15 – Recognizing business cards Tip: select a scanning resolution of 400 to 500 dpi to recognize business cards successfully. To do so, click Preferences on the Readiris menu and change the resolution. The necessary options are enabled invisibly by default: Readiris applies Page Deskewing and Page Analysis and Detects the Page Orientation automatically. If necessary you can also apply Despeckling options to remove small dots from your business cards.
ReadirisTM Corporate 12 – User Guide Change the zone types, if necessary: Ctrl-click the zone you want to change and select another zone type. Click the globe button to select the correct card style. If you are scanning business cards of different countries you can change the card style manually per card in the image drawer: simply Ctrl-click a card thumbnail in the drawer and click Country to select a different card style. Click the format icon to select the output format.
Chapter 15 – Recognizing business cards Business cards can be saved in the HTML, Unicode and vCard format or be sent to Address Book. Depending on the format you select, you can choose to include the field names and/or the card images of your business cards. When you select Unicode, several Field delimiters are available. Field delimiters are the symbols that separate the various database fields inside an address record.
ReadirisTM Corporate 12 – User Guide application. You will create a new e-mail message and add the vCard file as attachment. Click Recognize + Save to recognize the business card(s) and export them. The Interactive Learning option is also available for business card reading. For more information, see the section Using interactive learning.
ReadirisTM Corporate 12 – User Guide INDEX A color image ..................... 26, 29 accuracy vs. speed................ 46 color mode ............................ 26 Address Book....................... 91 contrast ................................. 32 adjusting scanned documents 29 cover pages ........................... 81 Asian documents ...........4, 6, 45 D Asian edition .................. 4, 6, 7 deskewing ....................... 22, 91 automatic zoning .................. 35 despeckling .
Index font dictionaries ................... 56 layout options ....................... 62 font type ............................... 52 line skew............................... 28 G graphics options ................... 64 grayscale image ............. 26, 29 H loading settings ..................... 75 M main toolbar.......................... 14 manual zoning ...................... 37 handprinting ......................... 87 mixed languages ................... 48 Hebrew documents .......
ReadirisTM Corporate 12 – User Guide PDF documents .................... 67 separating documents ........... 81 PDF/A output ....................... 70 smoothening color images ... 26, 29 PDF-IHQC output ................ 70 speed vs. accuracy ................ 46 primary language ................. 46 spreadsheet documents ......... 59 product support .................... 11 supported image formats ...... 24 R system requirements ............... 9 recreate source document ..... 62 T registration .........
Index Z 100 zoning templates ...................