HP-UX 11i v3 Internationalization Features

11
ucs2le
ucs4 -
ucs4be -
ucs4le
utf8
Note 1: default endianness depends on the operating system. HP-UX is a big-endian OS. On HP-
UX, the iconv command interprets UTF-16/32 data as big-endian in the absence of a byte order
mark.
Note 2: On HP-UX, the iconv command treats UCS-2 and UTF-16, and UCS-4 and UTF-32 as
functionally equivalent.
Conversions Between Unicode Variants and Local Character Sets
The Unicode variants previously supported in bidirectional conversions between local codesets and
Unicode were limited to ucs2 and utf8. These conversions are now extended to other Unicode
variants. In many cases, all Unicode variants listed above are supported in bidirectional
conversions. In some cases, explicit conversions to ucs2be may not be available; however, as
stated earlier, these are functionally identical to conversions to ucs2.
You can obtain a list of all supported conversions by executing the iconv command with the new ‘-l’
option. For more information, see the iconv command entry later in this document or refer to the
iconv(1) manual page.
Updates for Specific Character Sets
This section provides details of updated character sets.
Japanese Mainframe Character Set Converter Support
The iconv command now supports an extended area of Japanese mainframe character sets,
including conversions to and from several Unicode variants (including UTF-8 and UCS-4).
Mainframe (NEC JIPS, Fujitsu JEF, and Hitachi KEIS) character set converters, provided by iconv,
have been enhanced to convert vendor defined characters, located in an extended character set,
from/to SJIS, eucJP and Unicode.
Bidirectional conversions between Fujitsu JEF character sets (jefc, jefcEX, jefk, jefkEX, jefc9p,
jefc9pEX, jefck9p, and jefk9pEX) and Unicode variants (ucs2be, ucs2le, ucs4, ucs4be, and ucs4le)
have been added.
Bidirectional conversions between NEC JIPS character sets (jipsj, jipsjEX, jipsec, jipsecEX, jipsek,
and jipsekEX) and Unicode variants (ucs2be, ucs2le, ucs4, ucs4be, and ucs4le) have been added.
Bidirectional conversions between Hitachi KEIS character sets (keis7k, keis7kEX, keis8k, keis8kEX,
keis7c, keis7cEX, keis8c, and keis8cEX) and Unicode variants (ucs2be, ucs2le, ucs4, ucs4be, and
ucs4le) have been added.