User Guide

Content options 121
Where the -exclude option prevents Verity Spider from even following anything that matches
the specified expressions, the
-indexclude option allows Verity Spider to follow anything while
only skipping that which matches the specified expressions.
For document types, use the
-indmimeexclude option instead.
Note: When specifying a URL, you must use full, absolute paths using the same format as appears in
the HTML hyperlink. If the link is relative, you must change it to absolute to use it with -indexclude.
See also -regexp.
-indinclude
Syntax:
-indinclude exp_1 [exp_n] ...
Specifies that only those files and paths in URLs that match the expressions be followed and
indexed. If you use backslashes, you must double them so that they are properly escaped; for
example:
C:\\test\\docs\\path
You can use wildcard expressions, where the asterisk (*) is for text strings and the question mark
(?) is for single characters; for example:
'/my_doc*/year199?'
In Windows, include double-quotation marks around the argument to protect the special
characters, such as the asterisk (*). On UNIX, use single-quotation marks. This is only required
when you run the indexing job from a command line. Quotation marks are not necessary within
a command file (the
-cmdfile option).
To use regular expressions, also specify the
-regexp option.
Where the
-include option prevents Verity Spider from even following anything that does not
match the specified expressions, the
-indinclude option allows Verity Spider to follow anything
while only indexing that which matches the specified expressions.
Example
If you want to index all documents that include "search" in the URL at http://web.verity.com,
you cannot use the following:
vspider -collection collname -start http://web.verity.com
-include '*search*'
This is because the starting point does not match the -include option criteria. Instead, use the
-indinclude option to follow all documents (unless you have specified any of the exclude
options) and index only those documents that match your criteria. Replace the
-include option
with the
-indinclude option in the preceding example.
Note: When specifying a URL, you must use full, absolute paths using the same format that appears
in the HTML hyperlink. If the link is relative, you must change it to absolute to use it with the
-indinclude option.
See also -regexp.