HP StorageWorks Reference Information Storage System V1.1 User Guide (February 2005)

LO
Chapter 6:
Query syntax and matching
Query expression syntax and matching
6-4 HP StorageWorks Reference Information Storage System User Guide, February 2005
Word characters and separators
Word characters
include all uppercase and lowercase letters, digits, and the
following additional characters:
_
(underscore)
#
(number/pound/hash sign)
&
(ampersand)
All other characters are
separators
(except, in queries, the wildcards
?
and
*
,
and the special query characters
~
,
,
-
, and
!
).
However, the following rule also applies:
&&
, by itself, is not a word. It is a Boolean operator. When combined with
at least one more word character,
&&
can be part of a word. For example,
a&&b
is a word.
Query analysis and document indexing are not case-sensitive. Uppercase and
lowercase letters are treated the same.
Regular expression definition of English word characters
Note:
This section provides information intended only for users familiar
with regular-expression notation.
The following regular expression provides, in succinct form, a complete speci-
fication of English word characters (except for the treatment of
&&
as a non-
word):
[ A-Za-z0-9_#& ]+
Letters and digits in different character sets
Letters and digits defined
All letters and digits are word characters. Just what the system considers a
letter or a digit depends on the character set encoding used. For the US ASCII
encoding, the letters are uppercase and lowercase English letters (
A–Za–Z
).