User Guide
Setting MIME Types 181
Setting MIME Types
You can use the MIME type criteria options -mimeinclude, -indmimeinclude,
-mimeexclude and -indmimeexclude to include or exclude MIME types.
Syntax restrictions
When you specify MIME type criteria, keep in mind the following restrictions.
Using the wildcard character (*)
The asterisk (*) wildcard character does not operate as a regular expression for the
value of the MIME type criteria. Instead you can only use it to replace the entire
MIME type or MIME sub-type.
For example, the following value is a valid substitute for text/html:
text/*
The following value is NOT a valid substitute for text/html:
text/h*
Multiple parameter values
When you specify a series of parameter values for a single instance of one of the
MIME type criteria, and you use quotes, you must enclose each separate parameter
value in single quotes.
For example:
-mimeinclude ’text/plain’ ’application/*’
If you enclose the entire sequence of parameter values,
-mimeinclude ’text/plain application/*’
the Verity Spider will consider the entire expression as a single value.
You can also use multiple instances of the MIME type criteria, each with a single
parameter value, where quotes are necessary only if you use the wildcard character
(*).
For example:
-mimeinclude text/plain
-mimeinclude ’application/*’.Setting MIME Types
MIME types and Web crawling
When you index a Web site, the Verity Spider evaluates your MIME Type criteria
against the "Content-Type" HTTP headers sent by the Web server hosting that Web
site. That Web server passes along MIME Type information based on its own internal
tables.