HP-UX 11.0 - 11i Internationalization Features White Paper

Encoding Characters
Converting Between Encodings
Chapter 2
39
Affected iconv conversions are conversions between sjis and UCS2 or UTF-8 as well as conversions between
eucJP and UCS2 or UTF-8.
The sjis=ucs2, ucs2=sjis, eucJP=ucs2 and ucs2=eucJP iconv conversion tables are affected. These tables
are shared by both UCS2 and UTF-8 conversions.
No compatibility problems are anticipated. However, if compatibility concerns arise with regard to persistent
data stored either in Unicode (UCS2) or UTF-8 on an HP-UX system, it is possible to generate a simple
conversion script to search for each occurrence of an incorrect value in either UCS2 or UTF-8 and convert it to
the correct value, based on the mappings in Table 2-25.
Correction for Korean
A patch provides a defect fix to address standards nonconformance for Korean Unicode (UCS2)/UTF-8
character mappings.
The currently supplied Korean iconv converter tables do not conform to the Unicode 2.1 and ISO 10646 (with
1997 amendments) standards in addition to the Korean national standard, KSC-5700. The current mappings
are considered obsolete by all noted standards organizations.
The enhancement provides a set of standards-conformant iconv converter tables for converting between
eucKR and Unicode/UTF-8. Specifically, the obsolete region of 0x3d2e–0x4dff has been remapped to the
0xac00–0xd7ff region specified in Unicode 2.1 for Hangul.
Without this modification, it is impossible to share data with any other system that is standards-conformant
in adhering to the Unicode 2.1/ISO 10646/KSC-5700 standards.
Affected iconv conversions are any conversions between eucKR and UCS2 or UTF-8.
The iconv conversion tables affected by this modification are eucKR=ucs2 and ucs2=eucKR. These tables are
shared by both UCS2 and UTF-8 conversions.
No compatibility problems are anticipated. However, if compatibility concerns arise with regard to persistent
data stored either in Unicode (UCS2) or UTF-8 on an HP-UX system, it is recommended that the previously
installed ucs2=eucKR table be saved and renamed prior to installation of this fix. Persistent data can then be
converted back to eucKR using this old table and then reconverted to the correct Unicode/UTF-8
representation.
Japan Vendor Council iconv Converters [11i v1.6]
New iconv converters have been introduced to allow for greater interoperability of data sharing within
Japanese computing environments. The following items are related to this change:
•New Japanese iconv conversion tables are now available that conform to the Open Group Japanese
Vendor Council (TOG/JVC) CDE/Motif Technical working group recommendations that ensure the
interoperability of ISO 10646/Unicode/JIS X0221 within Japanese computing environments.
Table 2-25 Mapping Between Old and New Unicode Characters for Japanese
Old UCS2 UCS2 Old UTF-8 UTF-8 Char Name
0xFFE3 0x203E 0xefbfa3 0xe280be Overline
0x2015 0x2014 0xe28095 0xe28094 Em Dash
0xFFE5 0x00A5 0xefbfa5 0xc2a5 Yen Sign
0x02DC 0xFF5E 0xcb9c 0xefbd9e Full-width Tilde