Hardware & Software Requirements
  • 19 Nov 2024
  • 10 Minutes to read
  • Dark
    Light
  • PDF

Hardware & Software Requirements

  • Dark
    Light
  • PDF

Article summary

The goal of this document is to provide an overview of the minimum hardware and system requirements necessary to deploy a Reveal 11 solution.

This document is intended to be a relatively easy read, with any detailed technical specifications being listed in the specific Appendices or externally referenced documentation. Stated differently, this document will describe what needs to be deployed, along with how-many or how-large, but will not go into extensive technical details on any of the Reveal modules.

The following are suggested minimum hardware requirements, and can grow with you as you scale.

Quantity

Hardware

Notes

1

Kubernetes cluster

The cluster will require Windows and Linux nodes.
Additional resources will be required as workloads grow and to make processes such as imaging, indexing, and AI builds execute faster or to keep up with increased usage.
Kubernetes storage driver that can connect storage dynamically to pods.

2

MS SQL Server

One SQL server is used for Review, one SQL server is used for Processing/AI.
Additional resources will be required as workloads grow and to make processes such as imaging, indexing, and AI builds execute faster or to keep up with increased usage.

5

Reveal processing agents

Additional resources will be required as workloads grow or to keep up with increased usage.

Software Products

The combined solution is comprised of the following two modules which work together to provide a cohesive eDiscovery solution:

  • Reveal 11 -

    Legal document Review solution to allow inspection and output of responsive documents with key capabilities of Assignments, Advanced Searching, Production, Redaction, etc., integrated with modern analytics designed to provide a tactical advantage for lawyers, investigators and knowledge workers who need insight within a communication-based repository. Incorporates data visualization and machine learning that augments the intelligence of subject matter experts, enabling them to dramatically increase the speed to valuable and actionable insights.

  • Reveal Processing -

    Enterprise-level rapid response solution, which allows an organization to easily identify, reduce, analyze, and export collections of Electronically Stored Information (ESI) in hours as opposed to the traditional Processing model which can be days or potentially weeks. Supports over 100 languages and data export in any required format, including Australian protocols.

Components

Before diving into architectural recommendations, a high-level review of the various components involved in the Reveal solution may be helpful.

Component

Usage Description

Reveal Database Server

Microsoft SQL Server used as the RDBMS repository for Windows-based Reveal modules. Needs to be MSSQL 2016 SP2 or later, either Standard or Enterprise Edition

Reveal Web Server

Microsoft IIS Web Server used by the Reveal Review and Reveal AI modules. Must be version IIS V10 or later

Network File Share

Standard Windows (SMB-based) file share used by all three Windows-based Reveal modules. Can be an existing file share within the environment or a new file share. Synergies exist for Reveal Processing and Reveal Review using the same network share, but the Reveal AI network share can be separated from the other Reveal share if desired.

Processing Discovery Manager

Thick client used by Reveal Processing to create Projects, Import/Export Data, and centrally manage Reveal Processing Agents.

Processing Discovery Agents

Background workers used by Reveal Processing to perform batch related activities such as Import, Export, OCR, etc. This component requires the use of Microsoft Outlook 64-bit.

Review Manager (aka OpsCenter)

Thick client used by Reveal Review to create Projects, Load Data, Index Data, and perform other administrative duties.

Review Application Services

Background Windows services used by Reveal Review for performing a variety of batch related activities such as Assignments, Bulk Tagging, Production, Production Export, etc.

Elasticsearch Nodes

Background Elasticsearch services used by Reveal Review and Processing for supporting searches, but also for storing the content of documents. Review 10.3 uses Elasticsearch 6.x; Review 10.4 uses Elasticsearch 7.x.

Reveal AI Service

Background Windows service used by Reveal AI for Modeling, Job Scheduling and Vector creation activities.

Reveal AI Processing / POS Tagger Services

Background Windows services used by Reveal AI for ingesting new data into the Reveal AI system.

Brainspace Application

Linux-based web server used by Brainspace solution for web end-users and API calls. Must be Linux version 7.6 or later.

Brainspace Analytics

Linux-based server used by Brainspace solution for ingesting new documents. Must be Linux version 7.6 or later.

Brainspace On-Demand Analytics

Linux-based server used by Brainspace solution for on-going (after ingestion) analytics activities. Must be Linux version 7.6 or later.

On-Premise Deployment - Medium Solution

In addition to offering software as a service (Saas), Reveal supports customer cloud and on-premise installations. Cloud-based implementation facilitates high scalable access to Reveal’s AWS cloud infrastructure and services. On-prem (behind the firewall) implementation affords customers full control of the data infrastructure.

To assist in provisioning cloud or on-prem Reveal installations, the following are suggested minimum hardware requirements, and can grow with you as you scale.

Quantity

Server

CPU

Memory

Storage

OS Type

2

MS SQL

16

128

2.5 Tb

Windows

5

Reveal Processing Agents

8

16

150 Gb

Windows

1

Reveal AI K8s Node

32

256

120 Gb

Windows

4

Reveal Review Windows K8s Nodes

8

64

120Gb

Windows

4

Reveal Review Linux K8s Nodes

16

128

100 Gb

Linux

1

Reveal Utilities K8s Node

8

64

100 Gb

Linux

3

Brainspace Servers K8s Nodes

16

128

100 Gb

Linux

6

Elasticsearch Data K8s Nodes

4

32

1 Tb

Linux

1

Elasticsearch Coordinator K8s Node

4

32

100 Gb

Linux

2

Reveal Management Load Machines

4

32

100 Gb

Windows

1

File Server

25Tb


Other Deployment considerations regardless of size:

  • Reveal will provide Reveal software and licensing. Client is responsible for Microsoft and other environmental licensing.

  • Reveal can assist with remote deployment if remote control is provided through Zoom or other screen sharing technology.

  • The following is additional software to be installed to support the Reveal Platform:

    • Microsoft .NET Framework 4.7.1

    • Office 2016 64-bit with Outlook default mail profile

    • Lotus Notes 9.0.1 or later with default mail profile

    • SQL Server Management Studio Express (Optional)

    • Security – Antivirus

  • Apply your corporate policy to create backups for file, VMs and SQL.

  • Reveal recommends that if Dell HW is leveraged, Dell Pro Support is purchased to assist with any hardware issues.

As an alternate approach to self-provisioning a mobile solution, Reveal IT Partner George Jon offers an appliance or Kit solution that can be shipped and deployed onsite. White Glove Service is an optional feature for delivery and installation. This service involves a GJ Engineer personally delivering the Kit to client site for installation, testing and admin/backend training anywhere in the world. The cost for this service is $2,500 USD / day plus travel expenses for the Single Server.

If a Kit requires shipping outside of the United States, GJ will need to know both the country where the device is purchased and final destination for shipping/tax purposes and to provide an accurate delivery timeline.

Reveal Cloud AI (Optional)

The Cloud AI platform of Reveal is essentially a middleware layer that handles communication with AWS for cloud analytic functions. At time of writing we support integration with AWS analytics only.

Qty

OS

Purpose

CPU

RAM

Storage

Additional Comments

1

Linux 7.6 or later

Orchestrator

Transcription

Archive Wrk

8

32GB

2TB

This server should have as much free space as the largest image labeling plus the largest transcription job you intend to run times 2.

  • An AWS account should be provisioned.

  • Reveal AWS includes the following additional AI powered services:

    • Image Labeling (Amazon Rekognition)

    • Translation (Amazon Translate)

    • Transcription (Amazon Transcribe)

Important

You must verify that these services are available within your AWS region. To verify these services are available, please see AWS Services Availability link.

Frequently Asked Questions

  1. Question: Are these the only options available?

    Answer: No, not at all. The architectures described in this document illustrate minimum guidelines for likely deployment scenarios, but discussions with Reveal technicians for what-if scenarios are definitely encouraged as every organization's eDiscovery needs will vary greatly based on document complexity, number of graphics versus text in various documents, and the desired workflow within your eDiscovery operations.

  2. Question: Can Virtual or Physical system be used?

    Answer: Yes, but since eDiscovery needs rarely involve static usage, deployment onto Virtual Infrastructure is strongly recommended both for short-term spikes as well as ongoing growth of usage.

  3. Question: Some scenarios list throughput estimates, are those guaranteed results?

    Answer: Unfortunately, no. While those estimates are derived based on real-world usage of the Reveal-Brainspace solutions, numerous factors exist (document complexity, storage speed, etc.) which will impact overall throughput. Also, timeframe expectations vary between organizations. In most scenarios, throughput can be increased with either more robust systems or greater quantities of some components. Please feel free to discuss possible areas to improve throughput for certain activities with Reveal-Brainspace technicians during initial planning or deployment as well as over the long-run.

  4. Question: Is Microsoft Outlook required on each Reveal Processing Agent?

    Answer: Yes, as of now, this is a technical requirement.

  5. Question: Is Microsoft Office required on each Reveal Processing Discovery Manager system?

    Answer: No, this is not a technical requirement, but many users find that having MSOffice installed on these systems can assist with QC and troubleshooting activities.

  6. Question: Do all components involved with the Reveal solution have to remain online 24x7?

    Answer: No, in many scenarios, idle systems can be taken offline when not actively in use. An example of this would include Processing Agents when no Processing is actively being performed.

  7. Question: Is Elasticsearch only used for Review searching?

    Answer: No, the Reveal Review product stores the actual text from documents in Elasticsearch. Elasticsearch needs to be online for almost all Reviewing activities.

  8. Question: What integrations exist between the Reveal products?

    Answer: Reveal Processing can directly export to Reveal Review although the "Uploader" feature in the Reveal Review web front-end is only available currently in the Reveal SaaS offering.

  9. Question: Both the Processing Agent and the Elasticsearch components stress the use of high-speed storage. Is this required?

    Answer: No, but it is strongly recommended. Both of these components are very I/O intensive, so having local, high-speed storage can greatly reduce the duration of many activities.

  10. Question: Some of the resource requirements are significant; can we start smaller than what is listed and expand later as activity increases?

    Answer: Yes, but keep in mind that adjustments in allocated resources may not be deemed necessary until in-progress activities are underway, often with a pending deadline, and adjustments to some resources (typically CPU and Memory) may require an outage which can be difficult to schedule if large batch jobs are executing. The minimum resource allocations described in this document target eventual usage scenarios, but feel free to start smaller, and adjust upward when necessary, always keeping a close eye on resource consumption and live throughput.

  11. Question: The documented Reveal solution appears to be heavily reliant upon Microsoft products; is this a requirement?

    Answer: Yes. While some components such as Elasticsearch and the SMB Network Share can be deployed on other Operating Systems, the basis of the Reveal solution makes specific use of functionality within the Microsoft stack: IIS, MSSQL and .NET/C#.

  12. Question: The documented Brainspace solution appears to be heavily reliant upon Linux operating systems; is this a requirement?

    Answer: Yes. The Brainspace solution is deployed solely on Linux systems.

  13. Question: Can SAN or NAS storage technologies be utilized in the solution?

    Answer: Yes, these are very common. Just keep in mind that during high-load scenarios (Importing, Indexing, Production, etc.), the speed of storage will be a critical factor in the duration of those activities.

  14. Question: Why are so many Reveal Processing Agents recommended?

    Answer: Currently, the Reveal Processing Agent software can perform various tasks, but not at the same time. Multiple Processing Agents help ensure that no activities will be queued waiting for another activity to complete.

  15. Question: Storage requirements for MSSQL system are high, is this required?

    Answer: Best Practices are to utilize full-recovery model in the MSSQL environment. Storage allocations can be trimmed slightly if using simple-recovery model.

  16. Question: Do the Discovery Manager and Review Manager thick clients have to reside on their own systems?

    Answer: No. Typically, customers will deploy Discovery Manager on its own System, with Review Manager either being deployed on the same system as Discovery Manager or one of the Review Application Servers.

  17. Question: How should backups be configured for Reveal Solutions?

    Answer: Best Practices leverage differing backup strategies for each functional area. For example, SQL data should be in Full-Recovery mode and leverage Native SQL backups, while Elasticsearch should leverage built in backup/restore functionality through Curator. Brainspace, file servers, and virtual machines would leverage a form of Virtualization aware backup products.

  18. Question: How many external IP addresses are necessary if we need external reviewers?

    Answer: If using a proxy or load balancer one IP would be necessary. However, Additional IP addresses may be necessary for multiple web servers or publishing multiple services such as Review, Brainspace, and keycloak authentication services.


ESC

Eddy AI, facilitating knowledge discovery through conversational intelligence