Understand Analytics Index Fields

Fields used for index analysis are listed in the following table. These fields can be created at the time you configure indexes. Fields are required unless otherwise noted.

field

description

IA Rep Doc

When an index is built, the system groups documents into “clusters.” These are separate from the clusters used by the concept wheel and exclusive to the index. Each cluster is centered on a document that we call the representative document. All documents in the cluster are similar to the representative document to within the cutoff you specify.

IA Doc Treatment

How the document was classified by the analytics engine. A document is classified as one of three types:

  • Normal – Most documents will be classified as Normal. This means the document was composed primarily of non-numeric text and was long enough to be clustered in the index.

  • Short – Documents that do not contain enough significant words.

  • Binary – Documents containing an abundance of numbers, a shortage of white space, unusual punctuation, or some other criteria that identifies the document as something other than normal prose.

IA Sig Words

Each document treated as Normal will display the keywords for that document’s cluster. These are the words that contributed the most to making the documents in the cluster conceptually similar to the representative document. The output is ordered with the words that contributed the most coming first.

All keywords are contained in the representative document, but the keywords won't necessarily be contained in all documents in the cluster.

 

Related Topics

Create Email Full Thread Search

Create Inclusive Email Thread Search

Create Near Duplicate Search Excluding Email Thread