User Guide
Content Options 171
-indmimeexclude
Syntax: -indmimeexclude mime_1 [mime_n] ...
Specifies that only those MIME types which match the expressions be followed but
not indexed.
On Windows NT, you should include double quotes around the argument to protect
the special characters such as (*). On UNIX, you should use single quotes. Note that
this is only required when you run the indexing job from a command line. Quotes are
not necessary within a command file (-cmdfile).
Use this option to gather some documents, such as HTML tables of contents, to gain
access to other documents for indexing. The -mimeexclude option, on the other
hand, prevents specified documents from being followed at all. For the mime
variable, you can include the asterisk ( * ) wildcard for text strings. For example:
’text/*’
You cannot use the question mark ( ? ) wildcard, and the -regexp option does not
allow you to use regular expressions.
-indmimeinclude
Syntax: -indmimeinclude mime_1 [mime_n] ...
Specifies that only those MIME types which match the expressions be followed and
indexed.
The -mimeinclude option would not allow you to index desired documents if the
starting URL is not followed. For the mime variable, you can include the asterisk ( * )
wildcard for text strings. For example:
’text/*’
On Windows NT, you should include double quotes around the argument to protect
the special character (*). On UNIX, you should use single quotes. Note that this is
only required when you run the indexing job from a command line. Quotes are not
necessary within a command file (-cmdfile).
You cannot use the question mark ( ? ) wildcard, and the -regexp option does not
allow you to use regular expressions.
Example
If you want to index all Word documents at http://web.verity.com, you cannot use:
vspider -collection collname -style style_dir -start
http://web.verity.com -mimeinclude ’application/msword’
This is because the starting point does not match the -mimeinclude criteria. Now,
you can use -indmimeinclude to follow all documents (unless, of course, you have
specified any of the exclude options) and index only those documents that match
your criteria. Simply replace -mimeinclude with -indmimeinclude in the above
example.