HP IAP Version 2.0 User Guide (November 2008)

Understanding searching and document indexing
You can search for any documents archived in your repository (or any other repositories to which you
have access), w
hether the documents are email messages or les. When you search for a d ocument,
your query is ch
ecked against an index of words that is updated each time a document is archived.
Indexing the co
ntents of a document involves cataloging the document words to p repare them for later
searching. Separators (such as punctuation) between words are ignored during indexing. Note that there
is a time delay from when les are archived to when they are indexed. Do cuments archived less than an
hour ago may or may not appear in query or search results depending on the system’s conguration.
You can search the contents of a document only if the contents have been indexed. You can search for
other kinds o
f les only by using external identifying information.
Indexed document types
In addition to email messages, the following les are indexed:
Plain text les
Rich text les (.rtf)
HTML (HyperText Markup Language) les
Files used by the following Microsoft Ofce programs: Word, Excel, PowerPoint, and Acc ess
PDF (Portable Document Format) les viewed with Adobe Acrobat Reader
Zip les
Embedded messages (RFC 822 messages)
NOTE:
Email message formatting has no bearing on indexing. O nly the words you see in your email client are
indexing candidates. Invisible source-code words, such as HTM L markup tags, a re ignored.
NOTE:
For zip les and embedded messages, the content inside the les is expanded and indexed.
We support indexing of MS Ofce les for MS Ofce 2007 and prior releases.
Message MIME t ypes (advanced users)
An email m essage can contain message parts of possibly different MIME (Multipurpose Internet Mail
Extensions) Content-Types. The following Content-Types are indexed and each corresponds to one of
the indexed document t ypes:
text/xml
text/plain
text/html
application/rtf
application/msword
application/vnd.ms-excel
application/vnd.ms-powerpoint
application/msaccess
application/pdf
application/zip
12
IAP overview