HP Reference Information Storage System Version 1.6 User Guide revision 2 (T3559-90810, August 2007)

Table 9 Supported character sets
Supported character
set
Description
ISO-8859-1
Western European, extended ASCII
WINDOWS-1252
(Code pages supported by Windows) Latin 1
US-AS CII
7-bit American Standard Code for Information Interchange
UTF-8
Universal (all langua g es)
ISO-8859-2
Eastern European
KOI8-R
Cyrillic (Russian and Bulgarian)
ISO-8859-5
Cyrillic (Bulgarian, Belarusian, Russian)
WINDOWS-1254
(Code pages supported by Windows) Turkish
GB18030
Chinese (Mainland)
BIG5
Chinese (Taiwan)
GB23 12
Chinese (Mainland)
EUC-KR
Korean
KS_C-5601-198 7
Korean
ISO-20 22-JP
Japanese
EUC-JP
Japanese
SHIFT-JIS
Japanese
Matching words
Matching words is not case-sensitive: cat, Cat, cAt,andCAT all match. Corresponding uppercase and
lowercase letters, such as A and a, are treated the sam e in all respects.
There are two kinds of query words: words tha t contain occurrences of one or both of the wildcard
characters * and ?, and literal words that do not contain wildcards.
Literal words that do not contain wildcards
Words containing occurrences of one or both wildcard characters * and ?
A literal word in a query expression matches the same word, character for character (case ignored) , in
an archived document. A word with wildcard characters (* or ?) matches a d ocument word in the same
way, character by character, except for the following:
A ? matches any single character in a document word. For example, b??t matches beat, beet,
boat, blot, best, bust, bout,andsoon.
An * matches any sequence of characters in a document word, including a sequence of no
characters. For example, f*t matches the document words foot, feet, t, fault,andft;andf*
matches any document word beginning with f.
You can use any number of wildcard characters (* or ?) in a query word, but you cannot use a wildcard
at the be ginning of a query word. An error message results. For example, *ion is not a valid query.
Matching similar words
Topics include:
Fuzzy words, page 36
Measuring word similarity, page 36
User Guide
35