Intro to Predictive Coding: Overview & Interpretation of Terminology June 2014 | Page 31
Glossary
Often refers to a simple random sample, in which
each item in the population has an equal chance of
being selected in the sample.
Seed set – a collection of pre-categorized documents that is used as the initial training for a
predictive coding system.
Support vector machine (SVM) – a machinelearning approach used for categorizing data. The
goal of the SVM is to learn the boundaries that
separate two or more classes of objects.
Given a set of already categorized training
examples, an SVM training algorithm identifies the
differences between the examples of each training
category and can then apply similar criteria to
distinguishing future examples.
TAR – Technology Assisted Review. Any of a
number of technologies that use technology,
usually computer technology, to facilitate the
review of documents for discovery. See CAR.