4.0

29
Optical Character Recognition (OCR) is normally used only for PDF pages
without an accessible text layer or when non-standard character encoding is
detected, but you can require it for any conversion.
Handling Image-only PDF Files
PDF files without a text layer are a special case for conversion. You can
decide how the program should handle these pages: convert them with the
built-in Optical Character Recognition (OCR), transfer them as images to
the target document or skip them. You can require inspection of the first
pages (up to ten) in PDF files you open. Optionally, you can set conversion
to be stopped, if no text-layer pages are detected. If you have ScanSoft
®
OmniPage
®
, you can use this to have more control over the
recognition process.
Language Support
PDF Converter supports over 100 languages, including Danish, Dutch,
English, Finnish, French, German, Italian, Norwegian, Polish, Portuguese,
Spanish and Swedish. The program can convert multi-lingual documents. A
full list of supported languages is provided in online Help. Correct language
choice is important for converting image-only pages and handling
non-standard encoding.