Find Minimum Unique Set

Introduction

In e-Discovery, excluding duplicates and non-inclusive emails from review populations is often essential to streamline the review process. The following saved search demonstrates how to identify a minimal set of unique documents using Reveal's front-end interface.

The Minimum Unique Set Saved Search

Data ingestion in Reveal involves both de-duplication and email threading processes. Results from email threading and loose document de-duplication are stored in separate fields. To identify the minimum set of documents, it is necessary to combine the information from both fields.

Combining Search Fields

The Minimum Unique Set search incorporates the following conditions:

  1. BD EMT is Unique: true

    Setting this field to "true" will retrieve only "inclusive" emails and one copy of each unique attachment within the email thread. Note that if the same attachment appears in multiple email threads, it will be returned for each thread under this condition.

  2. BD EMT ThreadID is empty AND BD ExactDupStatus is Unique or Pivot

    Combining these two conditions will retrieve a unique copy of loose documents. The condition "BD EMT ThreadID is empty" ensures that the search is focused exclusively on loose documents.

  3. Connect 1 and 2 above using the “OR” operator

The following screenshot shows a saved search after combining all the conditions above: