User Guide
Chapter 22550
The XML Parser Xtra supports the following character sets:
• ISO-8859-1 Also called Latin 1. This is the most common Western encoding used on the web.
It matches the default character set used on Windows in most Western countries. It does not
match the character set used in most Western versions of the Mac OS (MacRoman) and does
not match character sets commonly used in most non-Western countries. The first 127
characters (binary codes 1-127) are the same in most countries.
• UTF-8 An 8-bit encoding of the Unicode character set. This is the XML default character set.
• US-ASCII Supports only 7-bit characters.
• EUC-JP The EUC character set, used widely in Japan.
• SHIFT_JIS Also used widely in Japan. Shift-JIS is the character set used by default in Japanese
versions of Windows and the Mac OS.
• UTF-16 A 16-bit encoding of the Unicode character set.
For many developers, Latin 1 is the most convenient encoding to use.