Open System Services Shell and Utilities Reference Manual (G06.29+, H06.08+, J06.03+)

locale(4) OSS Shell and Utilities Reference Manual
blank Defines blank characters. If this keyword is not specified, the space and horizontal tab
characters are included in this character class. Any characters defined by this state-
ment are automatically included in the space class.
toupper Defines the mapping of lowercase characters to uppercase characters. Operands for
this keyword consist of comma-separated character pairs. Each character pair is
enclosed in () (parentheses) and separated from the next pair by a ; (semicolon). The
first character in each pair is considered a lowercase character; the second character is
considered an uppercase character. Only characters defined by the lower and upper
keywords can be specified. If toupper is not defined, a through z is mapped to A
through Z by default.
tolower Defines the mapping of uppercase characters to lowercase characters. Operands for
this keyword consist of comma-separated character pairs. Each character pair is
enclosed in () (parentheses) and separated from the next pair by a ; (semicolon). The
first character in each pair is considered an uppercase character; the second character
is considered a lowercase character. Only characters defined by the lower and upper
keywords can be specified.
The tolower keyword is optional. If this keyword is not specified, the mapping
defaults to the reverse mapping of the toupper keyword, if specified. If the toupper
and tolower keywords are both unspecified, the mapping for each defaults to that of the
C locale.
Additional keywords can be provided to define new character classifications. For example:
vowel <a>;<e>;<i>;<o>;<u>;<y>
The LC_CTYPE category does not support multicharacter elements. For example, the German
Eszet character is traditionally classified as a lowercase letter. There is no corresponding upper-
case letter; in proper capitalization of German text, the Eszet character is replaced by the two
characters SS. This kind of conversion is outside of the scope of the toupper and tolower key-
words.
The following is an example of a possible LC_CTYPE category listed in a locale definition
source file:
LC_CTYPE
#"alpha" is by default "upper" and "lower"
#"alnum" is by default "alpha" and "digit"
#"print" is by default "alnum", "punct" and the space character
#"graph" is by default "alnum" and "punct"
#"tolower" is by default the reverse mapping of "toupper"
#
upper <A>;<B>;<C>;<D>;<E>;<F>;<G>;<H>;<I>;<J>;<K>;<L>;<M>;\
<N>;<O>;<P>;<Q>;<R>;<S>;<T>;<U>;<V>;<W>;<X>;<Y>;<Z>
#
lower <a>;<b>;<c>;<d>;<e>;<f>;<g>;<h>;<i>;<j>;<k>;<l>;<m>;\
<n>;<o>;<p>;<q>;<r>;<s>;<t>;<u>;<v>;<w>;<x>;<y>;<z>
#
digit <zero>;<one>;<two>;<three>;<four>;<five>;<six>;\
<seven>;<eight>;<nine>
#
space <tab>;<newline>;<vertical-tab>;<form-feed>;\
<carriage-return>;<space>
#
cntrl <alert>;<backspace>;<tab>;<newline>;<vertical-tab>;\
1116 Hewlett-Packard Company 527188-021