logo.gif The leading source for professional development COURSES in statistics
 ÖÐÎÄ Course Login
Home > Resources > Glossary

Statistical Glossary

Classification and Regression Trees (CART):

Classification and regression trees (CART) is a set of techniques for classification and prediction. The technique is aimed at finding a rule(s) which could predict the value of a dependent variable Math image from known values of Math image explanatory variables Math image (predictors). The predictor variables Math image may be a mixture of categorical and continuous variables.

The initial data represent a set of objects with known values of the dependent variable Math image and predictors Math image . CART builds trees - i.e. formulates simple if/then rules for recursive partitioning (splitting) of all the objects into smaller subgroups. Each such step may give rise to new "branches". The goal of this process is to maximize homogeneity of the values of the dependent variable Math image in the various subgrops.

All the CART techniques are essentially non-parametric - they do not rely on any particular assumptions about the type of dependence of the dependent variable Math image on predictors Math image (in contrast to various regression techniques) and about statistical properties of the data. This is an essential practical advantage for the cases when apriori information about the data is limited.

There are two main approaches in CART - classification trees (used to predict the class or category of records) and regression trees , (used to predict a continuous value).

Also see: on classification trees on regression trees , CHAID .

And the short course Introduction to Data Mining

Browse Other Glossary Entries