- 19 Nov 2024
- 10 Minutes to read
- Print
- DarkLight
- PDF
Hardware & Software Requirements
- Updated on 19 Nov 2024
- 10 Minutes to read
- Print
- DarkLight
- PDF
The goal of this document is to provide an overview of the minimum hardware and system requirements necessary to deploy a Reveal 11 solution.
This document is intended to be a relatively easy read, with any detailed technical specifications being listed in the specific Appendices or externally referenced documentation. Stated differently, this document will describe what needs to be deployed, along with how-many or how-large, but will not go into extensive technical details on any of the Reveal modules.
The following are suggested minimum hardware requirements, and can grow with you as you scale.
Quantity | Hardware | Notes |
1 | Kubernetes cluster | The cluster will require Windows and Linux nodes. |
2 | MS SQL Server | One SQL server is used for Review, one SQL server is used for Processing/AI. |
5 | Reveal processing agents | Additional resources will be required as workloads grow or to keep up with increased usage. |
Software Products
The combined solution is comprised of the following two modules which work together to provide a cohesive eDiscovery solution:
Reveal 11 -
Legal document Review solution to allow inspection and output of responsive documents with key capabilities of Assignments, Advanced Searching, Production, Redaction, etc., integrated with modern analytics designed to provide a tactical advantage for lawyers, investigators and knowledge workers who need insight within a communication-based repository. Incorporates data visualization and machine learning that augments the intelligence of subject matter experts, enabling them to dramatically increase the speed to valuable and actionable insights.
Reveal Processing -
Enterprise-level rapid response solution, which allows an organization to easily identify, reduce, analyze, and export collections of Electronically Stored Information (ESI) in hours as opposed to the traditional Processing model which can be days or potentially weeks. Supports over 100 languages and data export in any required format, including Australian protocols.
Components
Before diving into architectural recommendations, a high-level review of the various components involved in the Reveal solution may be helpful.
Component | Usage Description |
---|---|
Reveal Database Server | Microsoft SQL Server used as the RDBMS repository for Windows-based Reveal modules. Needs to be MSSQL 2016 SP2 or later, either Standard or Enterprise Edition |
Reveal Web Server | Microsoft IIS Web Server used by the Reveal Review and Reveal AI modules. Must be version IIS V10 or later |
Network File Share | Standard Windows (SMB-based) file share used by all three Windows-based Reveal modules. Can be an existing file share within the environment or a new file share. Synergies exist for Reveal Processing and Reveal Review using the same network share, but the Reveal AI network share can be separated from the other Reveal share if desired. |
Processing Discovery Manager | Thick client used by Reveal Processing to create Projects, Import/Export Data, and centrally manage Reveal Processing Agents. |
Processing Discovery Agents | Background workers used by Reveal Processing to perform batch related activities such as Import, Export, OCR, etc. This component requires the use of Microsoft Outlook 64-bit. |
Review Manager (aka OpsCenter) | Thick client used by Reveal Review to create Projects, Load Data, Index Data, and perform other administrative duties. |
Review Application Services | Background Windows services used by Reveal Review for performing a variety of batch related activities such as Assignments, Bulk Tagging, Production, Production Export, etc. |
Elasticsearch Nodes | Background Elasticsearch services used by Reveal Review and Processing for supporting searches, but also for storing the content of documents. Review 10.3 uses Elasticsearch 6.x; Review 10.4 uses Elasticsearch 7.x. |
Reveal AI Service | Background Windows service used by Reveal AI for Modeling, Job Scheduling and Vector creation activities. |
Reveal AI Processing / POS Tagger Services | Background Windows services used by Reveal AI for ingesting new data into the Reveal AI system. |
Brainspace Application | Linux-based web server used by Brainspace solution for web end-users and API calls. Must be Linux version 7.6 or later. |
Brainspace Analytics | Linux-based server used by Brainspace solution for ingesting new documents. Must be Linux version 7.6 or later. |
Brainspace On-Demand Analytics | Linux-based server used by Brainspace solution for on-going (after ingestion) analytics activities. Must be Linux version 7.6 or later. |
On-Premise Deployment - Medium Solution
In addition to offering software as a service (Saas), Reveal supports customer cloud and on-premise installations. Cloud-based implementation facilitates high scalable access to Reveal’s AWS cloud infrastructure and services. On-prem (behind the firewall) implementation affords customers full control of the data infrastructure.
To assist in provisioning cloud or on-prem Reveal installations, the following are suggested minimum hardware requirements, and can grow with you as you scale.
Quantity | Server | CPU | Memory | Storage | OS Type |
2 | MS SQL | 16 | 128 | 2.5 Tb | Windows |
5 | Reveal Processing Agents | 8 | 16 | 150 Gb | Windows |
1 | Reveal AI K8s Node | 32 | 256 | 120 Gb | Windows |
4 | Reveal Review Windows K8s Nodes | 8 | 64 | 120Gb | Windows |
4 | Reveal Review Linux K8s Nodes | 16 | 128 | 100 Gb | Linux |
1 | Reveal Utilities K8s Node | 8 | 64 | 100 Gb | Linux |
3 | Brainspace Servers K8s Nodes | 16 | 128 | 100 Gb | Linux |
6 | Elasticsearch Data K8s Nodes | 4 | 32 | 1 Tb | Linux |
1 | Elasticsearch Coordinator K8s Node | 4 | 32 | 100 Gb | Linux |
2 | Reveal Management Load Machines | 4 | 32 | 100 Gb | Windows |
1 | File Server | 25Tb |
Other Deployment considerations regardless of size:
Reveal will provide Reveal software and licensing. Client is responsible for Microsoft and other environmental licensing.
Reveal can assist with remote deployment if remote control is provided through Zoom or other screen sharing technology.
The following is additional software to be installed to support the Reveal Platform:
Microsoft .NET Framework 4.7.1
Office 2016 64-bit with Outlook default mail profile
Lotus Notes 9.0.1 or later with default mail profile
SQL Server Management Studio Express (Optional)
Security – Antivirus
Apply your corporate policy to create backups for file, VMs and SQL.
Reveal recommends that if Dell HW is leveraged, Dell Pro Support is purchased to assist with any hardware issues.
As an alternate approach to self-provisioning a mobile solution, Reveal IT Partner George Jon offers an appliance or Kit solution that can be shipped and deployed onsite. White Glove Service is an optional feature for delivery and installation. This service involves a GJ Engineer personally delivering the Kit to client site for installation, testing and admin/backend training anywhere in the world. The cost for this service is $2,500 USD / day plus travel expenses for the Single Server.
If a Kit requires shipping outside of the United States, GJ will need to know both the country where the device is purchased and final destination for shipping/tax purposes and to provide an accurate delivery timeline.
Reveal Cloud AI (Optional)
The Cloud AI platform of Reveal is essentially a middleware layer that handles communication with AWS for cloud analytic functions. At time of writing we support integration with AWS analytics only.
Qty | OS | Purpose | CPU | RAM | Storage | Additional Comments |
---|---|---|---|---|---|---|
1 | Linux 7.6 or later | Orchestrator Transcription Archive Wrk | 8 | 32GB | 2TB | This server should have as much free space as the largest image labeling plus the largest transcription job you intend to run times 2. |
An AWS account should be provisioned.
Reveal AWS includes the following additional AI powered services:
Image Labeling (Amazon Rekognition)
Translation (Amazon Translate)
Transcription (Amazon Transcribe)
Important
You must verify that these services are available within your AWS region. To verify these services are available, please see AWS Services Availability link.
The Cloud AI system uses an S3 bucket for staging purposes. This is necessary as AWS services require data to be stored locally in S3 for analysis to be performed.
Information about availability of AWS functions can be found here:
https://docs.aws.amazon.com/general/latest/gr/aws-service-information.htmlBrowse to the desired service and check the API availability in your target region. Below is a list of the services we use:
https://docs.aws.amazon.com/general/latest/gr/rekognition.html https://docs.aws.amazon.com/general/latest/gr/comprehend.html https://docs.aws.amazon.com/general/latest/gr/transcribe.html https://docs.aws.amazon.com/general/latest/gr/translate-service.html https://docs.aws.amazon.com/general/latest/gr/s3.html https://docs.aws.amazon.com/general/latest/gr/ec2-service.html
Frequently Asked Questions
Question: Are these the only options available?
Answer: No, not at all. The architectures described in this document illustrate minimum guidelines for likely deployment scenarios, but discussions with Reveal technicians for what-if scenarios are definitely encouraged as every organization's eDiscovery needs will vary greatly based on document complexity, number of graphics versus text in various documents, and the desired workflow within your eDiscovery operations.
Question: Can Virtual or Physical system be used?
Answer: Yes, but since eDiscovery needs rarely involve static usage, deployment onto Virtual Infrastructure is strongly recommended both for short-term spikes as well as ongoing growth of usage.
Question: Some scenarios list throughput estimates, are those guaranteed results?
Answer: Unfortunately, no. While those estimates are derived based on real-world usage of the Reveal-Brainspace solutions, numerous factors exist (document complexity, storage speed, etc.) which will impact overall throughput. Also, timeframe expectations vary between organizations. In most scenarios, throughput can be increased with either more robust systems or greater quantities of some components. Please feel free to discuss possible areas to improve throughput for certain activities with Reveal-Brainspace technicians during initial planning or deployment as well as over the long-run.
Question: Is Microsoft Outlook required on each Reveal Processing Agent?
Answer: Yes, as of now, this is a technical requirement.
Question: Is Microsoft Office required on each Reveal Processing Discovery Manager system?
Answer: No, this is not a technical requirement, but many users find that having MSOffice installed on these systems can assist with QC and troubleshooting activities.
Question: Do all components involved with the Reveal solution have to remain online 24x7?
Answer: No, in many scenarios, idle systems can be taken offline when not actively in use. An example of this would include Processing Agents when no Processing is actively being performed.
Question: Is Elasticsearch only used for Review searching?
Answer: No, the Reveal Review product stores the actual text from documents in Elasticsearch. Elasticsearch needs to be online for almost all Reviewing activities.
Question: What integrations exist between the Reveal products?
Answer: Reveal Processing can directly export to Reveal Review although the "Uploader" feature in the Reveal Review web front-end is only available currently in the Reveal SaaS offering.
Question: Both the Processing Agent and the Elasticsearch components stress the use of high-speed storage. Is this required?
Answer: No, but it is strongly recommended. Both of these components are very I/O intensive, so having local, high-speed storage can greatly reduce the duration of many activities.
Question: Some of the resource requirements are significant; can we start smaller than what is listed and expand later as activity increases?
Answer: Yes, but keep in mind that adjustments in allocated resources may not be deemed necessary until in-progress activities are underway, often with a pending deadline, and adjustments to some resources (typically CPU and Memory) may require an outage which can be difficult to schedule if large batch jobs are executing. The minimum resource allocations described in this document target eventual usage scenarios, but feel free to start smaller, and adjust upward when necessary, always keeping a close eye on resource consumption and live throughput.
Question: The documented Reveal solution appears to be heavily reliant upon Microsoft products; is this a requirement?
Answer: Yes. While some components such as Elasticsearch and the SMB Network Share can be deployed on other Operating Systems, the basis of the Reveal solution makes specific use of functionality within the Microsoft stack: IIS, MSSQL and .NET/C#.
Question: The documented Brainspace solution appears to be heavily reliant upon Linux operating systems; is this a requirement?
Answer: Yes. The Brainspace solution is deployed solely on Linux systems.
Question: Can SAN or NAS storage technologies be utilized in the solution?
Answer: Yes, these are very common. Just keep in mind that during high-load scenarios (Importing, Indexing, Production, etc.), the speed of storage will be a critical factor in the duration of those activities.
Question: Why are so many Reveal Processing Agents recommended?
Answer: Currently, the Reveal Processing Agent software can perform various tasks, but not at the same time. Multiple Processing Agents help ensure that no activities will be queued waiting for another activity to complete.
Question: Storage requirements for MSSQL system are high, is this required?
Answer: Best Practices are to utilize full-recovery model in the MSSQL environment. Storage allocations can be trimmed slightly if using simple-recovery model.
Question: Do the Discovery Manager and Review Manager thick clients have to reside on their own systems?
Answer: No. Typically, customers will deploy Discovery Manager on its own System, with Review Manager either being deployed on the same system as Discovery Manager or one of the Review Application Servers.
Question: How should backups be configured for Reveal Solutions?
Answer: Best Practices leverage differing backup strategies for each functional area. For example, SQL data should be in Full-Recovery mode and leverage Native SQL backups, while Elasticsearch should leverage built in backup/restore functionality through Curator. Brainspace, file servers, and virtual machines would leverage a form of Virtualization aware backup products.
Question: How many external IP addresses are necessary if we need external reviewers?
Answer: If using a proxy or load balancer one IP would be necessary. However, Additional IP addresses may be necessary for multiple web servers or publishing multiple services such as Review, Brainspace, and keycloak authentication services.