Overview: Enterprise Streaming Discovery

A single, optimized job type included in eCapture is named Enterprise Streaming Discovery. Several characteristics make Streaming Discovery unique:

Streaming Discovery is particularly useful for large data sets. For example, portions from a large data set may have just finished the Discovery phase, while other portions from this same large data set were already pushed to the review tool. Unlike the traditional method, there is no need to wait until the entire data set completes a single phase before it can move on to the next phase. This former method "chunked" the data, thereby requiring stops and starts at each phase and delaying the review process.

The Enterprise Streaming Worker, used for Streaming Discovery, is built into eCapture’s existing framework. Streaming Worker uses in-memory processing, multithreading, and newer technology to take advantage of the powerful computing capability built into newer, high-end computers. At the same time, Streaming Worker can run on existing hardware, including virtual computers.

Streaming Discovery Workflow

The Streaming Discovery Workflow is as follows:

  1. After a data set (mailstores, PSTs, etc.) is loaded , its files are extracted to disk and distributed to the Streaming Workers.
  2. The files then enter the Discovery phase, which encompasses both discovery and data extraction.
  3. The completed document families are retrieved and sent through the basic filtering phase before they are queued for exporting.
  4. Exports are driven by the existing eCapture Export Series method. Data may be pushed to Eclipse, Relativity, and to disk if needed. An Export Interval value is set to determine how often a Streaming Discovery Job will pull document families for Export.
  5. The following diagram demonstrates the workflow process for Streaming Discovery.

Streaming Discovery Job Right-Click Menu Options

When you right-click a Streaming Discovery Job, a context menu displays showing the following options:

Convert Legacy Cases to Streaming Discovery Cases

eCapture allows you to convert a legacy case (which includes Standard Discovery, Data Extract, and Process jobs) to a Streaming Discovery case, which provides greater efficiency and improved performance when running jobs. You can launch this conversion process by using the Case Hash Conversion option, available from a right-click context menu in the eCapture Controller. This option rehashes previously imported emails and stores the generated hash values and item IDs in a new table, allowing Streaming de-duplication. For more information about this task, see Convert Legacy Cases to Streaming Discovery Cases.


Related Topics

Introduction to eCapture

Overview: Enterprise Integration

Create a Streaming Discovery Job

Modify Streaming Discovery Jobs