User guide
Chapter 5, Creating Clients, Projects, Custodians, and Jobs
5-44 Ipro eCapture User Guide www.iprotech.com
Q1 2014 877-324-4776
• Filter Binary Unicode - Use a text selection algorithm to filter text
from binary files. The algorithm scans for sequences of single-byte,
UTF8, or Unicode in the file. This option is recommended for forensic
searches, especially when files may contain text in languages other than
English.
• Filter Binary - Extract plain text items from the binary files.
• Index Binary - Index all of the contents of binary files as single-byte
text.
• Skip Binary - Do not index binary files.
Hyphens
These settings determine how hyphens will be treated during an EDD search.
• Hyphens as spaces - Treats hyphens found in the files as spaces. For
example, a search for “first-class” will match incidences of “first class” in
the files being searched.
• Hyphens as searchable - Searches hyphens. For example, a search for
“first-class” will match only incidences of “first-class” in the files being
searched.
• Ignore Hyphens - Ignores hyphens entered in the search criteria. For
example, a search for “first-class” will match incidences of “firstclass” in
the files being searched.
• Index all three ways - Searches for all three possible treatments of
hyphens to ensure that matches are found regardless of which of these
three ways the search criteria is entered.
Parent/Child Text Handling
These options are used to specify how text of parent and child documents
should be handled during indexing and are specific to emails (Lotus Notes and
Outlook) and any edocs (non-emails) that contain embedded documents.
• Index child text with parent text - merges and indexes the text
of a child document with that of its parent.
• Separate child and parent text - indexes the text of a child doc-
ument separately from its parent. The following string is added as
an include filter: *.MSG *.MSG>*.body *.EML *.EML>*.body. This
occurs while indexing. Two documents will be produced in the
index for .EML and .MSG files. One is for the body and the other is










