In statistics, “n” denotes the size of a dataset, typically a sample, in terms of the number of observations or records.
Daily Archives: April 12, 2016
Week #17 – Corpus
A corpus is a body of documents to be used in a text mining task. Some corpuses are standard public collections of documents that are commonly used to benchmark and tune new text mining algorithms. More typically, the corpus is a body of documents for a specific text mining task – e.g. a set ofContinue reading “Week #17 – Corpus”