HP StorageWorks Reference Information Storage System V1.0 User Guide (May 2004)
Query Expression Syntax and Matching Chapter 5:
Query Syntax and Matching
HP StorageWorks Reference Information Storage System User Guide, April 2004 5-7
Stop Words
Some words, called
stop words
, are ignored altogether by the search engine
and the document indexer, so there is no sense using them in query expres-
sions. Stop words correspond to very common English words (or contraction
endings: s, t). These are the stop words:
a
,
and
,
are
,
as
,
at
,
be
,
but
,
by
,
for
,
if
,
in
,
into
,
is
,
it
,
no
,
not
,
of
,
on
,
or
,
s
,
such
,
t
,
that
,
the
,
their
,
then
,
there
,
these
,
they
,
this
,
to
,
was
,
will
,
with
Note:
The current set of stop words is appropriate only for English
documents and queries. Nevertheless, you can archive non-
English documents and you can use non-English (ISO 8859–1
Latin–1) letters in queries.
See Also
•
Query Expression Examples
, on page 5-13, for examples of query expres-
sions with stop words.
Matching Words
Matching of words is not case-sensitive:
cat
,
Cat
,
cAt
, and
CAT
all match each
other. Corresponding uppercase and lowercase letters, such as
A
and
a
, are
treated the same in all respects.
There are two kinds of query words: words that contain occurrences of one or
both of the
wildcard
characters
*
and
?
, and
literal words
that do not contain
wildcards.
A literal word in a query expression matches the same word, character for
character (case ignored), in an archived document. A word with wildcard
characters (
*
or
?
) matches a document word in the same way, character by
character, except:
• ?
– matches any single character in a document word. For example,
b??t
matches
beat
,
beet
,
boat
,
blot
,
best
,
bust
,
bout
, and so on.
• *
– matches any sequence of characters in a document word, including a
sequence of no characters. For example,
f*t
matches the document words
foot
,
feet
,
fit
,
fault
, and
ft
;
f*
matches any document word beginning with
f
.