April 12, 2016 - Statistics.com: Data Science, Analytics & Statistics Courses

Week #18 – n

In statistics, “n” denotes the size of a dataset, typically a sample, in terms of the number of observations or records.

A corpus is a body of documents to be used in a text mining task. Some corpuses are standard public collections of documents that are commonly used to benchmark and tune new text mining algorithms. More typically, the corpus is a body of documents for a specific text mining task – e.g. a set ofContinue reading “Week #17 – Corpus”