Operation Manual

Readiris 15 - User Guide
50
PDF Image-Text. This file type is most commonly used. It contains two layers: the recognized
text, and the original image on top of the text. This way, you have both access to the
recognized text and you still see the original image.
Note: since the image covers the text, any recognition mistakes will not be visible.
PDF Text-Image. This file type is the opposite of PDF Image-Text. It contains the original
image in the background, and the recognized text on top of the image.
Note: any recognition mistakes will be visible in this format.
PDF Text. This file type contains the recognized text, but does not contain the original image
of your document. Any images in the original document are included as graphics in the PDF
file.
PDF Image. When you select this file type, Readiris doesn't execute the text recognition on
your document. Your PDF file will not be text-searchable, it only contains the image of your
original document.
Note: many options are available for PDF files. Readiris can generate hyper-compressed PDF files,
password-protected PDF files, PDF/A-compliant PDF files. See the section PDF Options for more
information.
Tip: with Readiris you can also turn Image PDFs into text-searchable PDFs.
DOCX
DOCX is the standard text processor format used since Microsoft Word 2008. It is a standard format
in several applications on the Microsoft Windows operating systems.
DOCX is also supported by Pages for Mac and DOCX with simplified layout is supported in TextEdit.
ODT
ODT stands for "Open DocumentText". It is an open-source file format.
ODT files can be opened with any OpenOffice-compatible text processor.
RTF
RTF stands for "Rich Text Formatting". It is a free document file format developed by Microsoft Inc.
to facilitate document exchange.
Use the RTF format when you do not have the possibility to use the DOCX or ODT formats. It is
recommended to use Microsoft Word to open RTF documents generated by Readiris.
XLSX
XLSX is the standard spreadsheet file format used since Microsoft Excel 2008. XLSX files are created
using the Open XML standard. Each cell in an XLSX file can have a different formatting.
HTML
HTML stands for "Hypertext Markup Language". It is the predominant markup language for web
pages. It provides a means to describe the structure and formatting of text-based information in a
document. This file format can be opened in Microsoft Excel, in Web browsers such as Safari, and in
Web page editors such as Adobe Dreamweaver.
Note: HTML is the recommended format when saving documents to Evernote.