Intro to Predictive Coding: Overview & Interpretation of Terminology June 2014 | Page 14
If the answer to at least one of these five questions is yes, then
there is one more question to consider.
• Does your collection contain more than about 5,000 text
documents?
Predictive coding does not require a large set of documents,
but it’s value tends to grow disproportionately as the size of the
document collection grows, because the effort typically
required to train a system does not grow or does not grow as
quickly as the size of the document collection increases. Small
collections can require almost the same level of training effort
as large collections do.