Open System Services Shell and Utilities Reference Manual (G06.25+, H06.03+)

locale(4) OSS Shell and Utilities Reference Manual
The use of the ellipsis keyword results in a locale that may collate differently when compiled
with different character set description (charmap) source les. For this reason, the localedef
command will issue a warning when the ellipsis keyword is encountered.
The UNDEFINED special symbol includes all coded character set values not specied expli-
citly or with an ellipsis symbol. These characters are inserted in the character collation order at
the point indicated by the UNDEFINED special symbol in the order of their character code set
values. If no UNDEFINED special symbol exists and the collation order does not specify all
collation elements from the coded character set, a warning is issued and all undened characters
are placed at the end of the character collation order.
The following is an example of a collation order statement in the LC_COLLATE locale
denition source le category:
order_start forward;backward
UNDEFINED IGNORE;IGNORE
<LOW>
<space> <LOW>;<space>
.. <LOW>;...
<a> <a>;<a>
<a-acute> <a>;<a-acute>
<a-grave> <a>;<a-grave>
<A> <a>;<A>
<A-acute> <a>;<A-acute>
<A-grave> <a>;<A-grave>
<ch> <ch>;<ch>
<Ch> <ch>;<Ch>
<s> <s>;<s>
<ss> <s><s>;<s><s>
<eszet> <s><s>;<eszet><eszet>
... <HIGH>;...
<HIGH>
order_end
This example is interpreted as follows:
The UNDEFINED special symbol indicates that all characters not specied in the
denition (either explicitly or by the ellipsis symbol) are ignored for collation purposes.
All collating elements between <space> and <a> have the same primary equivalence
class and individual secondary weights based on their coded character set values.
All versions of the letter a uppercase and lowercase, and with or without diacriticals
belong to the same primary collation class.
The <c><h> multicharacter collating element is represented by the <ch> collating sym-
bol and belongs to the same primary equivalence class as the <C><h> multicharacter
collating element.
The <eszet> character is collated as an <s><s> string. That is, one <eszet> character is
expanded to two characters before comparing.
The LC_CTYPE Category
The LC_CTYPE category of a locale denition source le denes character classication, case
conversion, and other character attributes. This category begins with an LC_CTYPE category
header and terminates with an END LC_CTYPE category trailer.
1114 Hewlett-Packard Company 527188-003