User Guide
Verity Spider Syntax 149
Using a command file
If you want simpler reuse and archiving of your indexing commands, you should take
advantage of the abstraction offered by the
-cmdfile option. By using an ASCII text
file to store a task’s options, you also avoid the pitfall of using special characters in an
option’s parameter value.
For example, the
-processbif option requires the use of "!*" and therefore any task
using that option must also use the
-cmdfile option.
Command-line option reference
The following sections describe the Verity Spider V3.7 options. Note that option
names are case-sensitive.
-start
A starting point for an indexing job. You can specify multiple instances, or use
multiple values in a single instance.
When you execute an indexing job from a command-line and you do not use a
command file (with -cmdfile), you must URL-escape any special characters in the
starting point. To URL-escape a special character, use
"%hex-ASCII-character-number" in place of the character. For example, you would
use /time%26/ instead of /time&/. This allows the operating system to properly
process the command string.
In the event an indexing task halts, you can re-run the task as-is. The persistent store
for the specified collection is read and only those candidate URLs that are in the
queue but not yet processed are parsed. Candidate URLs correspond to URLs of the
following status as reported by vsdb:
cand, used, inse, upda, dele, fail
.
Note
By using -start with -refresh, you provide a starting point for Verity Spider and
therefore do not need to use at least one of -host, -domain, -nofollow or
-unlimited
For this repository type... The starting point is...
Web The URL or URLs from which the Verity Spider is to
begin indexing. Use other options such as -jumps to
control how far from the starting point Verity Spider
goes.
File system The starting directory or directories in which
the Verity Spider will start indexing. All subdirectories
beneath the starting point will be indexed unless you
use -pathlen, or any of the inclusion or exclusion
criteria.