HP StorageWorks Reference Information Storage System V1.0 User Guide (May 2004)

Query Expression Syntax and Matching Chapter 5:
Query Syntax and Matching
HP StorageWorks Reference Information Storage System User Guide, April 2004 5-3
Query Expression Syntax and Matching
Query Expressions Are Composed of Words
Query expressions can be as simple or as complex as you need. The essential
idea behind document retrieval is that query words are compared with
document words to find a match. But you can also:
Combine query words using logical (Boolean) operators (
AND
,
OR
,
NOT
) –
see
Boolean Query Expressions
, on page 5-10
Look for document words that are textually similar, but not necessarily
identical, to query words – see
Matching Similar Words
, on page 5-8
Look for word sequences in a document: words that are near each other,
and in a particular order – see
Matching Word Sequences
, on page 5-9
Together, these query constructs provide considerable power to find what you
need, provided you learn to use them well.
The way a query expression is interpreted involves an operation similar to
how documents are indexed when they are archived. The text is parsed
(broken down) into
words
. The remaining characters are ignored, as
separators. A query expression is fundamentally composed of words, no
matter how complex it is.
For indexing and searching, a word need not belong to a natural language,
such as English. For example,
wt6_ht3
is a legitimate document word or query
word. Some query words even contain wildcards, as in
f??t
.
See Also
Matching Words
, on page 5-7, for information on words with wildcards.
Word Characters
When RISS examines a query expression to determine its words, some
characters are not included in query words, but are treated as word
separators. When a document is archived, indexing determines the document
words, available for searching, in the same way.