Indexing Errors

Prev Next

Investigating errors

There are multiple locations where you can examine the reason behind indexing errors. Click any of the below links to visit the article that discusses the feature.

Common Errors

Common index errors are represented in the below table, including descriptions.

Error

Description

Convert* Empty

Conversion of native was successful, but there was no output text.

Convert* Failures

Error when extracting text and creating html from the native. These are typically corrupt or encrypted files.

Empty Source

The file is empty, so it has no content to extract or convert.

File contains no usable text

Includes non-text searchable files, i.e. non text searchable PDFs or image based files.

For PDF-based files, re-OCRing from the Review Grid may extract additional text.

File Conversion is not supported

The source file was not found during indexing.

File type is not supported

A typical error when running Spreadsheet View index on non-excel file types.

Best practice is to isolate spreadsheet files and run the Spreadsheet View index. See Generate Native PDF and Spreadsheet Views for more information.

Insert Failures

There was a failure when attempting to add the document’s text to the Elastic Index.

Missing

The native was not found.

Not Defined

The path to the OCR or native is missing. For third party loads, check load file paths.

Not Supported

Error when extracting text and creating html from an unsupported file type.

Too large

The file is larger than the size limits set within the Text Set settings. It’s recommended to download and view these files natively.

Unknown Exception: 4

Rarely seen, usually linked to the Extracted text set index where the document contains no indexable text.

* “Convert” is the process of extracting text and rendering html from the native.

Common Alerts

Common alerts are represented in the below table, including descriptions.

Alert

Description

Annotations removed during bulk re-image

If annotations were applied to the Image view, a bulk re-image will override these annotations if the below settings are checked when submitting the image job.

Settings options for document overwriting, highlighting specific overwrite choices available.

Document contains no indexable text

The document exists in the project, but no searchable text could be extracted during indexing. Common reasons include non-text searchable PDFs, image based files, 0KB files, corruption, or encryption.

Source file not found during indexing

The source file could not be located or accessed during indexing. The most common reason is incorrect file paths. As a result, it’s text could not be extracted for searching purposes.

Source file was too large to index

The file is larger than the size limits set within the Text Set settings. It’s recommended to download and view these files natively.

Footer Design