User Guide

Content options 125
-mimeinclude
Syntax:
-mimeinclude mime_1 [mime_n] ...
Specifies MIME types to be included.
In Windows, include double-quotation marks around the argument to protect the special
characters, such as the asterisk (*). On UNIX, use single-quotation marks. This is only required
when you run the indexing job from a command line. Quotation marks are not necessary within
a command file (the
-cmdfile option).
The default is to include all MIME types. For the mime variable, you can include the asterisk (*)
wildcard for text strings; for example:
'text/*'
You cannot use the question mark (?) wildcard, and the -regexp option does not let you use
regular expressions.
-mindocsize
Syntax:
-mindocsize integer
Specifies the minimum size, in kilobytes, for documents to be indexed. Any documents smaller
than the value specified by the
-mindocsize option are ignored.
The default is to index documents of any sizes.
-skip
Ty pe: Web crawling only
Syntax:
-skip HTML_tag "exp"
Specifies that Verity Spider not index any HTML document that contains the text of exp within
the given HTML_tag. For multiple HTML_tag and exp combinations, use multiple instances of
the
-skip option.
You can use wildcard expressions, where the asterisk (*) is for text strings and the question mark
(?) is for single characters; for example:
'/my_doc*/year199?'
In Windows, include double-quotation marks around the argument to protect the special
characters, such as the asterisk (*). On UNIX, use single-quotation marks. This is only required
when you run the indexing job from a command line. Quotation marks are not necessary within
a command file (the
-cmdfile option).
If you use backslashes, you must double them so that they are properly escaped; for example:
C:\\test\\docs\\path
To use regular expressions, also specify the -regexp option.
Example 1
To skip all HTML documents that contain the word "personnel" in the Title element, use the
following:
-skip title "personnel"