2018.2

Table Of Contents
Codepage 1252 (ANSI - Latin 1) is used for many Latin language documents, since it can be
used for Afrikaans, Basque, Catalan, Danish, Dutch, English, Faroese, Finnish, French,
Galician, German, Icelandic, Indonesian, Italian, Malay, Norwegian, Portuguese, Spanish,
Swahili and Swedish. Codepage 932 is often used for Japanese.
Input
Any text-based data file.
Processing
The characters in the data file are converted from the old encoding to the new one.
Output
The data file in its new encoding format. Metadata, Job Info variables and other variables are
unchanged.
Task properties
General Tab
l
Source encoding: Select the current data encoding. Note that the source encoding is not
selected automatically and you must therefore select the proper encoding from this list in
order for the conversion process to be performed successfully.
l
Target encoding: Select the encoding to which you want the data to be converted.
l
Include target encoding signature: This option is only available when converting to
UTF-8 (Windows code page 65001) or UCS-4 (code page 12000 or 12001). Select to
include the character encoding signature—also known as the byte order mark—at the
beginning of the target string.
l
Default character on translation: You may enter a character to be used to replace all
those characters that cannot be found in the source encoding. If you leave this box empty,
they will be simply stripped from the data, so you may consider using a space as a place
holder for unidentified characters.
On Error Tab
For a description of the options on the On Error tab see "Using the On Error tab" on page88.
Page 356