---
title: "Supervised Learning Overview"
slug: "supervised-learning-overview"
updated: 2024-12-06T15:13:33Z
published: 2024-12-06T15:13:33Z
canonical: "docs.revealdata.com/supervised-learning-overview"
---

> ## Documentation Index
> Fetch the complete documentation index at: https://docs.revealdata.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Supervised Learning Overview

## Introduction

*Supervised learning*, also known as supervised machine learning, is a subcategory of machine learning and artificial intelligence. It is the process of using manually coded document examples to train Classifiers (algorithms) to automatically classify data or predict whether a document is relevant, privileged, etc.

Supervised learning's training process uses classifier algorithms to generate predictive scores as to the subject matter of the classifier. Classifiers may be binary (*Positive / Negative* or *Relevant / Not Relevant*) based on reviewer application of mutually exclusive AI tags, or may categorize data by issues defined in a multi-select AI tag. This training results in the creation of an *AI Model* which may be saved to an AI Model Library and reused in similar matters.

## Process

The process begins with creating an AI Tag in Reveal. When you create an AI Tag, the system automatically creates a corresponding *Classifier*. A Classifier is a representation of the AI Model during the training process. As you code documents with your AI Tag, those coding decisions are used to train the Classifier. After a training completes, the Classifier's AI Model will generate Predictive Scores for each document.

Here is a graphical representation of the Supervised Learning workflow.

![](https://cdn.us.document360.io/3e21d801-ca9f-4c51-93db-9cbd32741f3d/Images/Documentation/image-81K3QI20.png)

Before creating a classifier, you must first create a tag in Reveal and flag it as ***Prediction Enabled***. (See [***How to Create & Manage Tags***](/reveal/docs/create-manage-tags) for details on how to do this.) Typically, the tag would be a *mutually exclusive* type and its choices would be ***Positive / Negative*** or ***Responsive / Non-Responsive***. You can add additional choices to the tag that will appear as radio buttons for use in Reveal, for example ***Further Review Required*** or ***Tech Issue***, but these will not be used in AI classification.

This type of classifier is typically applied identifying relevant data in the project, and may also be used to implement one of two review strategies:

- **Prioritized Review**to push documents automatically classified through rounds of training as relevant to be reviewed first. This strategy seeks to expedite assignment of documents scored as probably relevant.
- **Recommended Coding Review** goes one step further in applying classifier coding decisions to unreviewed documents. This use case is often applied to the production of documents to a requesting third party. The establishment and application of target metrics for coding decisions, as reported under View Classification, is used to compare and validate these decisions against benchmarks established by subject matter experts.

Another type of tag that may be used in creating a classifier is a *multi-select*type that offers a checkbox for each choice so that one or many may be selected. This might be used to classify issues that appear in documents under review. Here Reveal will create a classifier for each choice under the tag that is flagged as ***Prediction Enabled***

![Supervised-01](https://cdn.us.document360.io/3e21d801-ca9f-4c51-93db-9cbd32741f3d/Images/Documentation/Supervised-01.png)

Once the connected AI tag is created click **Supervised Learning** to open the **Classifiers** screen.

## View and Edit Classifier

![Supervised-00](https://cdn.us.document360.io/3e21d801-ca9f-4c51-93db-9cbd32741f3d/Images/Documentation/Supervised-00.png)

The Classifiers screen will display all Classifier models currently associated with the Project on cards, with current status and the option to view or edit their details or create a new Classifier.

Classifier cards provide a summary of the model's status:

- **Name:**The short name of the classifier; should label its intended function.
- **Composition** **method**: Whether the model is set for ***Diverse Active***(ongoing assessment and automatic update) or ***Diverse High Scoring***(user-provided training by tagging).
- **Rounds:**How many sets of documents have been provided for evaluation so far, or PRE-TRAINING for new classifiers.
- **Progress:**How many of the provided documents have been coded.
- **Status:**The model's state of readiness for use; glide the pointer over the (?) information mark for more detail.

As of Reveal’s February 2024 release, the classifier screen may also be shown in List view. Clicking the classifier name opens the **View Details** screen, and the **Edit Classifier** button is at the right of the row. Note too the addition of a *Composition* field explaining the mode of batching used and its effect on the training of the classifier.

![Supervised-02](https://cdn.us.document360.io/3e21d801-ca9f-4c51-93db-9cbd32741f3d/Images/Documentation/Supervised-02.png)

As noted above, embedded in the Classifier Name is a link to **View Details**. Within this screen is a Classifier navigation panel that currently has two sections, **PROGRESS** (for the status of the classifier’s training) and **MODEL INSIGHTS** (for reports on features underlying Reveal’s analytics). **PROGRESS** has two entries, **Tagging & Scoring** and **Control Set**.

### **View Classifier – Tagging & Scoring**

![Supervised-03](https://cdn.us.document360.io/3e21d801-ca9f-4c51-93db-9cbd32741f3d/Images/Documentation/Supervised-03.png)

The details screen shows the results and status of training as discussed in the card summary above, including:

- Training type
- Status
- Number of Documents Tagged
  - Number Positive
  - Number Negative

The chart above shows the predicted score distribution for documents not yet tagged when the details screen is first opened. Toggling ***Hide Not Tagged***, the chart below will display **Tagging & Scoring** across the project showing documents as coded (e.g., *Positive* / *Negative*for **Responsiveness**) graphed against the AI model's prediction of Responsiveness relevance. See [***How to Evaluate a Classifier***](/reveal/docs/evaluate-a-classifier) for further details on this and the following classifier reports.

![Supervised-04](https://cdn.us.document360.io/3e21d801-ca9f-4c51-93db-9cbd32741f3d/Images/Documentation/Supervised-04.png)

### View Classifier – Control Set

Control Sets are used to establish target metrics for the classifier. These benchmarks set on sample data by subject matter experts, allow review managers to compare and validate classifier predictive scores. The standard metrics are:

- **Precision** – the percentage of machine scored documents that are in agreement with reviewer tagging.
- **Recall**– the quality comparison of predicted versus manual retrieval of positive documents.
- **F1**– the harmonic mean of Precision and Recall.

### View Classifier – Training Statistics

A graphical and numeric report on each training round as compared with editable Control Set targets is found under the Training Statistics screen added as of the Reveal 2024 February release.

![](https://cdn.us.document360.io/3e21d801-ca9f-4c51-93db-9cbd32741f3d/Images/Documentation/image-WNPFQFFR.png)

### View Classifier – Model Insights - Features

**Features**reports keywords, entities and other values that have contributed to the current classifier’s model and the weight accorded each feature.

![](https://cdn.us.document360.io/3e21d801-ca9f-4c51-93db-9cbd32741f3d/Images/Documentation/image-4L898YDO.png)

### View Classifier – Model Insights – Document Score

**Document Score** applies **MODEL INSIGHTS** to an individual document selected using its **ItemID** control number.

![](https://cdn.us.document360.io/3e21d801-ca9f-4c51-93db-9cbd32741f3d/Images/Documentation/image-ZPYDMG3G.png)

### View Classifier – Model Insights – Reviewer Agreement

**Reviewer Agreement** shows for each training round how often classifier and human reviewers agreed. Positive, Negative and Overall agreement rates are charted as percentages for each cycle. The Positive Score Threshold may be set higher than the 60% shown in the illustration below, and a later round may be selected as a starting point.

![](https://cdn.us.document360.io/3e21d801-ca9f-4c51-93db-9cbd32741f3d/Images/Documentation/image-0VGTOVK7.png)

### View Classifier – Model Insights – Feature History

**Feature History** compares the top-ranking 500 features from the latest round with how they ranked in a previous round. The table may be downloaded to CSV format.

![](https://cdn.us.document360.io/3e21d801-ca9f-4c51-93db-9cbd32741f3d/Images/Documentation/image-0868JBIP.png)

## Edit Classifier

Clicking the **Configure** gear icon in the upper right corner of the card opens the **Edit Classifier** screen, where its configuration may be viewed and updated. See [***How to Build & Configure a Classifier***](/reveal/docs/build-configure-a-classifier) for details on these settings.

## Related

- [Evaluate an AI Model](/evaluate-an-ai-model.md)
- [Apply AI Model from Library](/apply-ai-model-from-library.md)
- [Create a Control Set](/create-a-control-set.md)
- [Email Threading Overview](/email-threading-overview.md)
- [Add AI Model to Library](/add-ai-model-to-library.md)
