Flexible, affordable statistics education.
Designed to help you master the software you need to enhance your skills and the practical experience you need to get ahead.
Designed to help you master the software you need to enhance your skills and the practical experience you need to get ahead.

Smoothing with P-splines (Using R)
taught by Brian Marx
and Paul Eilers
Splines are combinations of different functions that are used to describe and model data differentially in a smooth fashion over different ranges. In this course, you will learn how to use R software to develop splines for data smoothing.
Medical and social science researchers, data miners, environmental analysts; any researcher who must develop statistical models with "messy" data.
Dates:Add $50 service fee if you require a prior invoice, or if you need to submit a purchase order or voucher, pay by wire transfer or EFT, or refund and reprocess a prior payment. Please use this printed registration form, for these and other special orders.
Courses may fill up at any time and registrations are processed in the order in which they are received. Your registration will be confirmed for the first available course date, unless you specify otherwise. Multiple course registrations may be entitled to tuition discounts; read more.
Smoothing with P-splines (Using R)
taught by Brian Marx
and Paul Eilers
The linear model is ubiquitous in classical statistics, yet real-life data rarely follow a purely linear pattern. In fact, real-life data often do not follow a pattern that is well-described by a single simple function of any sort. Splines are combinations of different functions that are used to describe and model data differentially in a smooth fashion over different ranges.
P-splines are especially popular (over 500 citations for the instructors' original article in Statistical Science introducing P-splines) because they are widely applicable and effective. In this course, you will learn how to use R software for data smoothing via P-splines - a combination of regression on a B-spline basis (basis splines) and difference penalties (on the B-spline coefficients).
Smoothing is presented as an everyday tool for data analysis and statistics. The course starts with univariate P-splines smoothing using B-splines and penalties, then extend such tools into the generalized linear model framework, specifically through Poisson and binomial smoothing. This leads to a natural development in density estimation. The optimal amount of smoothness, via the choice of a positive tuning parameter, is suggested through both cross-validation and AIC. We further extend P-spline modeling, demonstrating its adaptive ability to accommodate rich data structure, including: varying coefficient models, generalized additive models, and full two-dimensional smoothing using tensor product P-splines. We close with some extensions to high dimensional regression, where the regressors are in the form of signals or curves.
You will learn how to balance the competing demands of fidelity to the data and smoothness, and how to optimize the smoothing. The final session of the course covers multidimensional smoothing.
This course is a core requirement or elective in the following Program(s) in Analytics and Statistical Studies (PASS):
Prerequisite(s):
HOMEWORK:
Homework in this course consists of guided data analysis problems using software and guided data modeling problems using software.
This course takes place over the internet at the Institute for 4 weeks. During each course week, you participate at times of your own choosing - there are no set times when you must be online. Course participants will be given access to a private discussion board. In class discussions led by the instructor, you can post questions, seek clarification, and interact with your fellow students and the instructor.
The course typically requires 15 hours per week. At the beginning of each week, you receive the relevant material, in addition to answers to exercises from the previous session. During the week, you are expected to go over the course materials, work through exercises, and submit answers. Discussion among participants is encouraged. The instructor will provide answers and comments, and at the end of the week, you will receive individual feedback on your homework answers.
The text will be provided.
Software:We emphasize the use of modern softwar and we provide functions for R/S-Plus. The text and R-code will be provided. You must have a copy of R for the course. Click Here for information on obtaining a free copy.
Smoothing with P-splines (Using R)
taught by Brian Marx
and Paul Eilers