Regression analysis provides a "best-fit" mathematical equation for the relationship between the dependent variable (response) and independent variable(s) (covariates).
There are two major classes of regression - parametric and non-parametric. Parametric regression requires choice of the regression equation with one or a greater number of unknown parameters. Linear regression, in which a linear relationship between the dependent variable and independent variables is posited, is an example. The aim of parametric regression is to find the values of these parameters which provide the best fit to the data. The number of parameters is usually much smaller than the number of data points. In contrast, the <non-parametric regression> requires no such a choice of the regression equation.
Want to learn more about this topic?
Statistics.com offers over 100 courses in statistics from introductory to advanced level. Most are 4 weeks long and take place online in series of weekly lessons and assignments, requiring about 15 hours/week. Participate at your convenience; there are no set times when you must to be online. Ask questions and exchange comments with the instructor and other students on a private discussion board throughout the course.
The aim of this course is to provide an easy introduction to inference and association through a series of practical applications, based on the resampling/simulation approach. Once you have completed this course you will be able to test hypotheses and compute confidence intervals regarding proportions or means, computer correlations and fit simple linear regressions.
This course provides an easy introduction to ANOVA and multiple linear regression through a series of practical applications.
In this course you will learn how multiple linear regression models are derived, use software to implement them, learn what assumptions underlie the models, learn how to test whether your data meet those assumptions and what can be done when those assumptions are not met, and develop strategies for building and understanding useful models.
This course provides a solid introduction to the ideas and techniques of statistical modeling.
This course covers the analysis of data gathered in surveys.
The topics covered in this course include prediction intervals, tolerance intervals, calibration intervals, measurement error, accelerated life testing, measurement system appraisal, reliability and lifetime testing.
Rule induction is an important component of data mining, and this course covers two main styles of generating rules.
This course begins by developing basic concepts such as hyperplanes, features spaces and kernels; then it covers the development of support vector machines (SVM's).