HP-UX 11i December 2003 Release Notes
Internationalization
Corrected Character Mappings to iconv(1) and iconv(3C)
Chapter 16
310
Affected iconv conversions are conversions between sjis and UCS2 or UTF-8 as well as
conversions between eucJP and UCS2 or UTF-8.
sjis=ucs2, ucs2=sjis, eucJP=ucs2 and ucs2=eucJP are the affected iconv conversion
tables. These tables are shared by both UCS2 and UTF-8 conversions.
No compatibility problems are anticipated. However, if compatibility concerns arise with
regard to persistent data stored either in Unicode (UCS2) or UTF-8 on an HP-UX
system, it is possible to generate a simple conversion script to search for each occurrence
of an incorrect value in either UCS2 or UTF-8 and convert it to the correct value, based
on the following mappings:
Correction for Korean
A patch provides a defect fix to address standards non-conformance for Korean Unicode
(UCS2)/UTF-8 character mappings.
The currently supplied Korean iconv converter tables do not conform to the Unicode 2.1
and ISO-10646 (with 1997 amendments) standards in addition to the Korean national
standard, KSC-5700. The current mappings are considered obsolete by all noted
standards organizations.
The enhancement provides a set of standards-conformant iconv converter tables for
converting between eucKR and Unicode/UTF-8. Specifically, the obsolete region of
0x3d2e - 0x4dff has been re-mapped to the 0xac00 - 0xd7ff region specified in Unicode 2.1
for Hangul.
Without this modification, it is impossible to share data with any other system which is
standards-conformant in adhering to the Unicode 2.1/ISO-10646/KSC-5700 standards.
Affected iconv conversions are any conversions between eucKR and UCS2 or UTF-8.
The iconv conversion tables affected by this modification are eucKR=ucs2 and
ucs2=eucKR. These tables are shared by both UCS2 and UTF-8 conversions.
0x818F 0xA1EF 0xFFE5 0x00A5 Yen Sign
n/a 0x8FA2B7 0x02DC 0xFF5E Full-width Tilde
Table 16-14 Mapping for UCS2/UTF-8 Conversion
Old UCS2 UCS2 Old UTF-8 UTF-8 Char Name
0xFFE3 0x203E 0xefbfa3 0xe280be Overline
0x2015 0x2014 0xe28095 0xe28094 Em Dash
0xFFE5 0x00A5 0xefbfa5 0xc2a5 Yen Sign
0x02DC 0xFF5E 0xcb9c 0xefbd9e Full-width Tilde
Table 16-13 Changes Applied to Shift-JIS/EUC—WUCS2/UTF-8
Conversion (Continued)
sjis eucJP
incorrect
UCS2
correct
UCS2
Character Name