User Guide
166 Chapter 8 Verity Spider
-pathlen
Syntax: -pathlen num_pathsegments
Limits indexing to the specified number of path segments in the URL or file system
path. The path length is determined as follows:
The host name and drive letter are not included. For example, neither
www.spider.com:80/ nor C:\ would be included in determining the path length.
All elements following the host name are included.
The actual filename, if present, is included. For example, /world.html would be
included in determining the path length.
Any directory paths between the host and the actual filename are included.
Example
For the following URL, the path length would be 4:
http://www.spider:80/comics/fun/funny/world.html
<-1-> <2> <-3-> <---4--->
For the following file system path, the path length would be 3:
C:\files\docs\datasheets
<-1-> <-2-> <---3--->
The default value is 100 path segments.
-refreshtime
Syntax: -refreshtime timeunits
Specifies that any documents which have been indexed since the timeunits value
began are not to be refreshed.
The syntax for timeunits is:
n day n hour n min n sec
Where n is a positive integer. Note that there must be spaces, and since the first three
letters of each time unit is parsed, you can use the singular or plural form.
If you specify:
-refreshtime 1 day 6 hours
Only those documents which were last indexed at least 30 hours and 1 second ago,
will be refreshed.
Note
This option is valid only with the -refresh option. When you use vsdb -recreate, the
last indexed date is cleared.