Specifications

Print2CAD OCR 2013- 65
Print2CAD
OCR 2013
8.1 Types of Text in PDF Files
The text in PDF les can be placed as strings or individual characters. How can you nd
out if your PDF le contains real text? The best method is to analyse the PDF le with the
analysis function of Print2CAD and see if there are any text entities indicated. Another
method is to open the PDF le in a PDF Reader and zoom the text to maximum view.
If the letters still have smooth edges (displaying an arc, not a polyline), your PDF le
most likely features real text. If the edges of the letters are not smooth, Print2CAD will
not convert the “text” to real text without activating the OCR function.The reason for
this is a mathematical contradiction between the vectorization procedure and the OCR
procedure (Optical Character Recognition). The two procedures can not be combined
without creating severe errors.
Figure: Real PDF text with smooth edge