logo.gif The source for online courses
in statistics
 ÖÐÎÄ Course Login
Home > Our Courses >



Modeling in R

Dr. Sudha Purohit

Aim of Course:

In this course you will learn how to use R to build statistical models and use them to analyze data. Multiple regression is covered first followed by logistic regression. The generalized linear model is then introduced and shown to include multiple regression and logistic regression as special cases. The Poisson model for count data will be introduced and the concept of overdispersion described. You will then learn how to analyse longitudinal data, first using relatively straightforward graphics and simple inferential approaches. This will be followed by describing mixed-effects models and the generalized estimating approach for such data. The emphasis in the course is how to use R to fit the models listed and how to interpret the R output, rather than the theoretical background of the models. Consequently some knowledge of linear models is required (statistics.com has courses in all of them).

Who Should Take This Course:

Anyone who is familiar with R and wants to learn how to use it to build and use statistical models. Important: the course will cover a variety of techniques, and it is expected that most participants will be interested in learning how to use R for some, but not all of them. If you expect to cover all the techniques as you take the course, you should budget a bit more time than the 10-15 hours suggested.

For those enrolled in a Program of Advanced Statistical Studies, this is a required or elective course in the following Programs:

  • Biostatistics (epidemiology) - elective

Course Program:

The course is structured as follows:

SESSION 1: Linear Regression, Logistic Regression
  • Multiple linear regression with R
  • Simple examples, dummy explanatory variables, interpreting regression coefficients. Finding a parsimonious model.
SESSION 2: The generalized model with R.
  • Logistic regression with R.
  • The need for a different model when the response variable is binary. The logistic transform and fitting the model to some simple examples. Deviance residuals.
  • Multiple regression and logistic regression as special cases of the generalized linear model.
  • The Poisson model for count data.
  • The problem of overdispersion.
SESSION 3: Analysing longitudinal data using R.
  • Examples of longitudinal data.
  • Simple graphics for longitudinal data and simple inference using the summary measure approach.
  • The 'long form' of longitudinal data.
  • Models for longitudinal data when independence of the repeated measurements is assumed.
  • Mixed-effects models for longitudinal data.
SESSION 4:
  • Modeling the correlational structure of the repeated measurements.
  • The generalized estimating equation approach for non-normal response variables in longitudinal data.
  • The dropout problem.

The Instructor:

Dr. Sudha Purohit is visiting Lecturer in Statistics at the University of Pune and, before her retirement in 2000, was Head of the Department of Statistics at A. G. College, Pune, India. She is a co-author of three books, Life-Time Data: Statistical Models and Methods, Introduction to Biometry, and (with Dr. Shailaja Deshmukh) Microarray Data: Statistical Analysis Using R. She is a coauthor (jointly with Prof.Shailaja Deshmukh and Dr. Sharad Gore) of Statistics Using R. Her areas of interest are survival analysis, reliability, programming with R and analysis of microarray data. She has published a number of research papers in various peer-reviewed journals.

Organization of the Course:

The course takes place over the internet, at statistics.com. During each course week, you participate at times of your own choosing - there are no set times when you must be online. Course participants will be given access to a private discussion board. In class discussions led by the instructor, you can post questions, seek clarification, and interact with your fellow students and the instructor. The course is scheduled to take place over 4 weeks, and typically requires 15 hours per week. At the beginning of each week, you receive the relevant material, in addition to answers to exercises from the previous session. During the week, you are expected to go over the course materials and work through exercises. Discussion among participants is encouraged. The instructor will provide answers and comments.

Certificates and Grades:

You may be interested only in learning the material presented, and not be concerned with grades or certificates. Or you may be enrolled in a statistics.com Program in Advanced Statistical Studies that requires demonstration of proficiency in the subject, in which case your work will be assessed for purposes of issuing a grade. Or you may require only a "Certificate of Course Completion," along with professional development credit in the form of Continuing Education Units (CEU's). As you begin the class, you will be asked to specify your category.

Credit:

This course offers continuing education units (CEU's). For those successfully completing the course (generally this means marks of 50% or better on the homework), 5.0 CEU's and a certificate will be issued by statistics.com, upon request.

Dates:

Aug. 20 - Sep. 17, 2010
Click here to be notified of future course offerings.

Participants gain access to the online materials on the first day of the course, and typically spend about 15 hours per week (at their convenience). You retain full access to course materials, including discussion board, for two weeks after the course closing date.

Level:

Advanced/Intermediate

Prerequisite:

The equivalent of Introduction to R - Statistical Analysis.

Note that some coursework in statistical modeling, even if is only regression, will help you get more out of this course, or alternatively, will deepen the knowledge you gain from this course. Statistics.com has a variety of courses in modeling. See the section listed here.

Course Text:

Course materials will be provided by the instructor.

Software:

Students must have access to R. For information on obtaining a copy of R, please Click Here.

Registration:

Register Online - $499
Register Online (academic) - $399 (you must be affiliated with a college, university or high school)

Add $50 service fee if you require a prior invoice, or if you need to submit a purchase order or voucher, pay by wire transfer or EFT, or refund and reprocess a prior payment. Please use this printed registration form, for these and other special orders.

Consider registering for this course together with two other R courses or Modeling courses as part of our special 3 course package registration for tuition savings.

Note: Courses may fill up at any time and registrations are processed in the order in which they are received. When a course is marked "full" above your registration will be applied to the next available course date.