User Guide

168 Chapter 8 Verity Spider
Content Options
-casesen
Details Makes processing case-sensitive by specifying that the spider process
separately keys that differ only in case. Use only for indexing UNIX servers.
-exclude
Syntax: -exclude exp_1 [exp_n] ...
Files, paths and URLs matching the specified expression(s) will not be followed. If
you use backslashes, you must double them so they are properly escaped. For
example:
C:\\test\\docs\\path
You can use wildcard expressions, where the asterisk ( * ) is for text strings and the
question mark ( ? ) is for single characters. For example:
’/my_doc*/year199?’
On Windows NT, you should include double quotes around the argument to protect
the special characters such as (*). On UNIX, you should use single quotes. Note that
this is only required when you run the indexing job from a command line. Quotes are
not necessary within a command file (
-cmdfile).
To use regular expressions, also specify the
-regexp option.
To specify a file, path or URL which you want followed but not indexed, use
-indexclude. For document types, use -mimeexclude instead. For example, specify
-mimeexclude application/pdf rather than
-exclude *.pdf.
Note
When specifying an URL, you must use full, absolute paths using the same format as
appears in the HTML hyperlink. If the link is relative, you must change it to absolute
to use it with -exclude.
See also -regexp.
-include
Only those files, paths and URLs which match the specified expression or
expressions will be followed. If you use backslashes, you must double them so they
are properly escaped. For example:
C:\\test\\docs\\path
You can use wildcard expressions, where the asterisk ( * ) is for text strings and the
question mark ( ? ) is for single characters. For example:
’/my_doc*/year199?’