HP-UX 11i June 2003 Release Notes

New and Changed Internationalization Features
Corrected Character Mappings to iconv(1) and iconv(3C)
Chapter 16
312
Affected iconv conversions are conversions between sjis and UCS2 or UTF-8 as well as
conversions between eucJP and UCS2 or UTF-8.
sjis=ucs2, ucs2=sjis, eucJP=ucs2 and ucs2=eucJP are the affected iconv conversion
tables. These tables are shared by both UCS2 and UTF-8 conversions.
No compatibility problems are anticipated. However, if compatibility concerns arise with
regard to persistent data stored either in Unicode (UCS2) or UTF-8 on an HP-UX
system, it is possible to generate a simple conversion script to search for each occurrence
of an incorrect value in either UCS2 or UTF-8 and convert it to the correct value, based
on the following mappings:
Correction for Korean
A patch provides a defect fix to address standards non-conformance for Korean Unicode
(UCS2)/UTF-8 character mappings.
The currently supplied Korean iconv converter tables do not conform to the Unicode 2.1
and ISO-10646 (with 1997 amendments) standards in addition to the Korean national
standard, KSC-5700. The current mappings are considered obsolete by all noted
standards organizations.
The enhancement provides a set of standards-conformant iconv converter tables for
converting between eucKR and Unicode/UTF-8. Specifically, the obsolete region of
0x3d2e - 0x4dff has been re-mapped to the 0xac00 - 0xd7ff region specified in Unicode 2.1
for Hangul.
Without this modification, it is impossible to share data with any other system which is
standards-conformant in adhering to the Unicode 2.1/ISO-10646/KSC-5700 standards.
Affected iconv conversions are any conversions between eucKR and UCS2 or UTF-8.
The iconv conversion tables affected by this modification are eucKR=ucs2 and
ucs2=eucKR. These tables are shared by both UCS2 and UTF-8 conversions.
0x818F 0xA1EF 0xFFE5 0x00A5 Yen Sign
n/a 0x8FA2B7 0x02DC 0xFF5E Full-width Tilde
Table 16-13 Mapping for UCS2/UTF-8 Conversion
Old UCS2 UCS2 Old UTF-8 UTF-8 Char Name
0xFFE3 0x203E 0xefbfa3 0xe280be Overline
0x2015 0x2014 0xe28095 0xe28094 Em Dash
0xFFE5 0x00A5 0xefbfa5 0xc2a5 Yen Sign
0x02DC 0xFF5E 0xcb9c 0xefbd9e Full-width Tilde
Table 16-12 Changes Applied to Shift-JIS/EUCWUCS2/UTF-8
Conversion (Continued)
sjis eucJP
incorrect
UCS2
correct
UCS2
Character Name