HP-UX 11i v3 Internationalization Features

5
Big5-2003 and CNS11643
System-level support is provided for Big5-2003 and CNS11643-2004, two Traditional Chinese
character sets.
The Big5 standard, established in 1984 and revised in 2003 (Big5-2003), was designed to
provide a small basic character set to encode the contemporary Traditional Chinese characters.
Big5-2003 was defined as a Big5 extension and has only two planes with 13,051 T-Chinese
characters and 778 symbols, totaling 13,829 characters.
CNS11643, established in 1992 and revised in 2004 (known as CNS11643-2004 or
CNS11643 version 3), was designed to provide sufficient character codes for encoding all the
contemporary Traditional Chinese characters. CNS11643-2004 was defined as a CNS11643
extension and can have 80 code planes. It can support 706,880 (8836 x 80) code points. The
original CNS11643 standard has only 16 code planes. All of the legal Traditional Chinese
characters of personal names allowed by the Taiwan government are included in plane 4 and
planes 12 though 15.
HP-UX11i v3 supports Big5-2003 in the zh_TW.big5 locale. Bitmap fonts, TrueType fonts, and
printing functionality have been enhanced to display and print these Big5-2003 characters. The
iconv command supports code conversion of Big5-2003 to and from CNS/EUC and Unicode
encodings. A codemap table, as well as phonetic and Tsang-Chieh dictionaries, is available for the
Big5-2003 character set.
HP-UX11i v3 supports CNS11643-2004 for planes 1 through 7 and 15 in the zh_TW.eucTW and
zh_TW.utf8 locales. Bitmap fonts, TrueType fonts, and printing functionality have been enhanced to
display and print these CNS11643-2004 characters. For the zh_TW.eucTW locale, display
support is limited to plane 1-4 characters only. The iconv command supports code conversion of
CNS11643-2004 to and from big5 and Unicode encodings. A codemap table and internal code
input methods are available for the CNS11643-2004 character set. Phonetic and Tsang-Chieh
dictionaries are available for the base (planes 1-4) CNS11643 character set.
KS X 1001
HP-UX 11i v3 supports the latest Korean national character set, KS X 1001:2002.
In 1998, the Euro and registered signs were added to KS X 1001. In 2002, the postal code mark
was added. HP-UX 11i v3 supports the three additional characters to comply with KS X
1001:2002.
Both ko_KR.eucKR and ko_KR.utf8 locales support the new standard. Bitmap fonts, TrueType fonts,
and iconv have been enhanced to support it.
GB18030
The GB18030 Standards calls for one-to-one mapping with the Unicode Standard. HP-UX11i v3
supports mapping between GB18030 and Unicode beyond plane 0 (BMP) to plane 16. The
support for bidirectional conversions between GB18030 and various Unicode variants is provided
by the new converter methods. In accordance with the Unicode 5.0 standard, mappings that
contain code points in BMP that are not valid Unicode characters have been eliminated from the
GB18030 to Unicode and Unicode to GB18030 converter tables.