Statistical Word of the Week

Jul 16, 2013
facebook LinkedIn twitter

Week # 29 - Training data

Also called the training sample, training set, calibration sample.  The context is predictive modeling (also called supervised data mining) -  where you have data with multiple predictor variables and a single known outcome or target variable.

The training data are a subset of all the data that you have available, and are used to fit various models. The models are then applied to another subset(s) of the same data and predicted values of the outcome variable are calculated.  The predicted values are then compared to the actual values, and measures of model performance are calculated and the models are compared.

Promoting better understanding of statistics throughout the world.

The Institute for Statistics Education offers an extensive glossary of statistical terms, available to all for reference and research. We will provide a statistical term every week, delivered directly to your inbox. To improve your own statistical knowledge, sign up here.

Improve my Statistical Knowledge!
Please enter first name.
Please enter last name.
Please enter valid E-mail.
Send me a "Stats Word of the Week".

Rather not have more email?  Bookmark our "Stats Word of the Week" page.

Want to be
notified of future
course offerings?
Please enter first name.
Please enter last name.
Please enter valid E-mail.

What our students say:

© Statistics.com 2004-2014