HP-UX 11i June 2001 Release Notes
New and Changed Internationalization Features
Corrected Character Mappings to iconv(1) and iconv(3C)
Chapter 15 283
Correction for Japanese
A patch corrects four incorrect Japanese character mappings that occur
between Shift-JIS/EUC and Unicode (UCS2)/UTF-8.
The following table summarizes the changes applied:
Affected iconv conversions are conversions between sjis and UCS2 or
UTF-8 as well as conversions between eucJP and UCS2 or UTF-8.
sjis=ucs2, ucs2=sjis, eucJP=ucs2 and ucs2=eucJP are the affected iconv
conversion tables. These tables are shared by both UCS2 and UTF-8
conversions.
No compatibility problems are anticipated. However, if compatibility
concerns arise with regard to persistent data stored either in Unicode
(UCS2) or UTF-8 on an HP-UX system, it is possible to generate a simple
conversion script to search for each occurrence of an incorrect value in
either UCS2 or UTF-8 and convert it to the correct value, based on the
following mappings:
sjis eucJP incorrect
UCS2
correct
UCS2
Character
Name
0x8150 0xA1B1 0xFFE3 0x203E Overline
0x815C 0xA1BD 0x2015 0x2014 Em Dash
0x818F 0xA1EF 0xFFE5 0x00A5 Yen Sign
n/a 0x8FA2B7 0x02DC 0xFF5E Full-width
Tilde
Old UCS2 UCS2 Old UTF-8 UTF-8 Char
Name
0xFFE3 0x203E 0xefbfa3 0xe280be Overline
0x2015 0x2014 0xe28095 0xe28094 Em Dash
0xFFE5 0x00A5 0xefbfa5 0xc2a5 Yen Sign
0x02DC 0xFF5E 0xcb9c 0xefbd9e Full-width
Tilde