Intro to Predictive Coding: Overview & Interpretation of Terminology June 2014 | Page 31

Glossary Often refers to a simple random sample, in which each item in the population has an equal chance of being selected in the sample. Seed set – a collection of pre-categorized documents that is used as the initial training for a predictive coding system. Support vector machine (SVM) – a machinelearning approach used for categorizing data. The goal of the SVM is to learn the boundaries that separate two or more classes of objects. Given a set of already categorized training examples, an SVM training algorithm identifies the differences between the examples of each training category and can then apply similar criteria to distinguishing future examples. TAR – Technology Assisted Review. Any of a number of technologies that use technology, usually computer technology, to facilitate the review of documents for discovery. See CAR.