Introduction
In e-Discovery, excluding duplicates and non-inclusive emails from review populations is often essential to streamline the review process. The following saved search demonstrates how to identify a minimal set of unique documents using Reveal's front-end interface.
The Minimum Unique Set Saved Search
Data ingestion in Reveal involves both de-duplication and email threading processes. Results from email threading and loose document de-duplication are stored in separate fields. To identify the minimum set of documents, it is necessary to combine the information from both fields.
Combining Search Fields
The Minimum Unique Set search incorporates the following conditions:
BD EMT is Unique: true
Setting this field to "true" will retrieve only "inclusive" emails and one copy of each unique attachment within the email thread. Note that if the same attachment appears in multiple email threads, it will be returned for each thread under this condition.
BD EMT ThreadID is empty AND BD ExactDupStatus is Unique or Pivot
Combining these two conditions will retrieve a unique copy of loose documents. The condition "BD EMT ThreadID is empty" ensures that the search is focused exclusively on loose documents.
Connect 1 and 2 above using the “OR” operator
The following screenshot shows a saved search after combining all the conditions above: