System information
158
CONFIGURING AND ADMINISTERING COLDFUSION 9
Using Verity Utilities
Last updated 2/21/2012
Purging a collection in the background
The following command purges the specified collection in the background:
mkvdk -purge -purgeback -collection path
Specifying persistent service
The following command runs the mkvdk command as a persistent process, so that servicing is performed repeatedly
after
num idle seconds:
mkvdk -persist -sleeptime num -collection path
Deleting a collection
The -purge option deletes all documents in a collection, but does not delete the collection itself. To delete a collection,
use operating system commands, such as the
rm command on UNIX, to remove the collection directory structure and
control files.
Optimization keywords
The following table describes the optimization keywords for the -optimize option:
Squeezing deleted documents
When a document is deleted from a collection, its space is not recovered. It is merely marked as deleted and not
available for subsequent searches. Squeezing actually removes deleted documents from the collection’s internal
documents table and word indexes, thus creating a smaller collection and reducing the collection’s disk space. A
smaller collection has a more efficient structure that makes searching slightly faster and uses slightly less memory.
Keyword Description
maxclean Performs the most comprehensive housekeeping possible, and removes out-of-date collection files. Adobe
recommends this optimization only when you are preparing an isolated collection for publication. When using
this type, if the collection is being searched, files sometimes get deleted too early, which can affect search results.
maxmerge Performs maximal merging on the partitions to create partitions that are as large as possible. This creates
partitions that can have up to 64000 documents in them.
readonly Marks the collection as read-only and unchanged after the function call is done. This is appropriate for CD-ROM
collections.
spanword Creates a spanning word list across all the collection’s partitions. A collection consists of numerous smaller units,
called partitions, each of which includes a word list. Optionally, a spanning word list can be built with an ngram
index.
ngramindex Builds an ngram index for the collection. An ngram index is designed to improve the search performance for
queries with the <TYPO> and <WILDCARD> operators. An ngram index cannot be built without a spanning word
list. You can build a spanning word list and ngram index in the same command, for example:
mkvdk -collection collname -optimize spanword -ngramindex
squeeze Squeezes deleted documents from the collection. Squeezing deleted documents recovers space in a collection,
and improves search performance. (For more information about squeeze, see
“Squeezing deleted documents” on
page 158.) Using this option invalidates the search results.
vdbopt Configures the collection’s Verity databases (VDBs). Each collection consists of smaller units called VDBs. This
keyword has the effect of linearizing the data in a VDB, and making the collection metadata contained in the VDB
more streamlined. It also lets the VDB grow to a much larger size.
tuneup Performs the same as combining the maxmerge, vdbopt, and spanword keywords.
publish Performs the same as all of the optimization types combined. Use this keyword to optimize the collection for the
best possible retrieval performance, such as for publication to a network on a server or on a CD-ROM.