Intro to Predictive Coding: Overview & Interpretation of Terminology June 2014 | Page 9
Chapter Two
9 Technologies and What They
Contribute to Predictive Coding
1. Latent Semantic Analysis. A mathematical approach that
seeks to summarize the meaning of words by looking at the
documents that share those words. LSA builds up a
mathematical model of how words are related to documents
and lets users take advantage of these computed relations to
categorize documents.
2. Probabilistic Latent Semantic Analysis. A second
mathematical approach that seeks to summarize the meaning
of words by looking at the documents that share those words.
PLSA builds up a mathematical model of how words are related
to documents and lets users take advantage of these computed
relations to categorize documents.
3. Support Vector Machine. A mathematical approach that
seeks to find a line that separates responsive from nonresponsive documents so that, ideally, all of the responsive
documents are on one side of the line and all of the nonresponsive ones are on the other side.
4. Nearest Neighbor Classifier. A classification system that
categorizes documents by finding an already classified example
that is very similar (near) to the document being considered. It