HP StorageWorks Reference Information Storage System V1.0 User Guide (May 2004)
RISS Concepts Chapter 1:
RISS Overview
HP StorageWorks Reference Information Storage System User Guide, April 2004 1-5
The exact set of file types that are considered loose office documents (that is,
whose contents are indexed) depends on the RISS configuration.
Note:
Email message formatting has no bearing on indexing. Only the
words you see in your email client are indexing candidates. Invis-
ible source-code words, such as HTML markup tags, are ignored.
Message MIME Types (Advanced Users)
An email message can contain message parts of possibly different MIME
(Multipurpose Internet Mail Extensions) Content-Types. The following
Content-Types are indexed and each corresponds to one of the loose office
document types:
• text/plain
• text/html
• application/msword
• application/vnd.ms-excel
• application/vnd.ms-powerpoint
• application/msaccess
• application/pdf
• application/ms-tnef
An email message that is entirely plain text, not MIME, is indexed. Also, if you
attach an email message to another email message, the attached email
message is not indexed.
Hidden text, including hidden columns in Excel and deleted text in Office
documents that may have been saved in a previous version using the Fast
Save feature, may be indexed due to the type of indexing technology used by
RISS.
See Also
•
Chapter 5,
Query Syntax and Matching
, for details on stop words and
just which document contents are indexed.