User guide
Creating a Discovery Job
www.iprotech.com Ipro eCapture User Guide 5-37
877-324-4776 Q1 2014
If a node-level error on the PST is requeued after the Discovery Job is
complete, the source PST is copied again. The working copy is made
again in this instance only if the option is selected.
E-mail De-duplication
Method of gathering and creating the MD5Hash has changed for newly created
Projects. Hashing of e-mails now uses the GMT time to ensure proper de-dupli-
cation across time zones.
In most cases, MD5 hash values are calculated on the file itself. For more reli-
able de-duplication of emails though, it is required that de-duplication occur on
the information contained within it and not the file itself. There are many rea-
sons for this; the simplest is the fact that when an email is saved out of its
container (PST, NSF, etc) the file that is created contains information that
would change the hash value of the same email each time that the email was
saved out.
When an email is discovered within Ipro eCapture, it is assigned a hash value
based on fields chosen by the user. The values of these fields are concatenated
and the text is hashed. Select from the following email fields to generate the
hash value:
•Subject
•From/Author
• Attachment Count
• Body: From the Body Whitespace drop-down list, select either Include
(default) or Remove. Whitespace in the e-mail body could cause slight
differences between the same e-mails, which could result in different
hashes being generated. Remove - removes all whitespace between lines
of text in the e-mail body prior to hashing. Include - keeps the
whitespace.
• E-mail Date: The following message types use the specified date values:
Outlook: Sent Date, Lotus Notes: Posted Date, RFC822: Date, and
GroupWise: Delivered Date. See the section How Ipro eCapture Handles
Dates - Time Zones on page 5-20 for additional information.
•Attachment Names
•Recipients
•CC










