Modeling in R

Modeling in R

taught by Institute Staff

Aim of Course:

In this online course, “Modeling in R,” you will learn how to use R to build statistical models and use them to analyze data. Multiple regression is covered first followed by logistic regression. The generalized linear model is then introduced and shown to include multiple regression and logistic regression as special cases. The Poisson model for count data will be introduced and the concept of overdispersion described. You will then learn how to analyse longitudinal data, first using relatively straightforward graphics and simple inferential approaches. This will be followed by describing mixed-effects models and the generalized estimating approach for such data. The emphasis in the course is how to use R to fit the models listed and how to interpret the R output, rather than the theoretical background of the models. Consequently some knowledge of linear models is required ( has courses in all of them).

See also the important note on the course Requirements tab.

Course Program:

WEEK 1: Linear Regression, Logistic Regression

  • Multiple linear regression with R
  • Simple examples, dummy explanatory variables, interpreting regression coefficients; finding a parsimonious model

WEEK 2: Generalized Linear Models With R

  • Logistic regression with R
  • The need for a different model when the response variable is binary, the logistic transform and fitting the model to some simple examples, deviance residuals
  • Multiple regression and logistic regression as special cases of the generalized linear model
  • The Poisson model for count data.
  • The problem of overdispersion

WEEK 3: Analysing Longitudinal Data Using R

  • Examples of longitudinal data
  • Simple graphics for longitudinal data and simple inference using the summary measure approach
  • The 'long form' of longitudinal data
  • Mixed-effects models for longitudinal data

WEEK 4:  Generalized Estimating Equations

  • Modeling the correlational structure of the repeated measurements
  • The generalized estimating equation approach for non-normal response variables in longitudinal data
  • The dropout problem


Homework in this course consists of guided data analysis problems using software and guided data modeling problems using software.

In addition to assigned readings, this course also has practice exercises, and the instructor's expert write-ups on important concepts.

Modeling in R

Who Should Take This Course:

Anyone who is familiar with R and wants to learn how to use it to build and use statistical models.

IMPORTANT:   the course will cover a variety of techniques and at different levels, to meet the needs of different groups of users.  Those with minimal-to-moderate statistics preparation will want to spend time on the more extensive presentation of linear regression, and not attempt to complete all the more advanced segments on other methods.  Those with more experience in statistics may not require as much time in the early stages, but will be better able to work with the more advanced segments.  The goal is to provide guidance in using R to implement various modeling procedures, and not to provide conceptual development of the statistical methods.  Most of the modeling techniques described here are covered in separate courses at  If you take this course first, you will probably not gain a full understanding of the more advanced techniques, but you will be better positioned, software-wise, to implement them when and if you take those courses.  If you take the other courses first, you will have a better understanding of the concepts behind the techniques before tackling them in R, but will be less prepared software-wise when you take the conceptual courses.  Either approach will work, but each has its own costs and benefits.

While we do not require additional specific courses as prerequisites, some familiarity with statistical modeling is needed. has a variety of courses in modeling. See also the "Important" note in "Who Should Take This Course" above.
Organization of the Course:
Options for Credit and Recognition:
Course Text:
Course materials will be provided by the instructor.
Students must have access to R. For information on obtaining a copy of R, please click here.


June 21, 2019 to July 19, 2019 June 19, 2020 to July 17, 2020

Modeling in R


June 21, 2019 to July 19, 2019 June 19, 2020 to July 17, 2020

Course Fee: $589

Do you meet course prerequisites? What about book & software? (Click here to learn more)

We have flexible policies to transfer to another course, or withdraw if necessary (modest fee applies)

Group rates: Click here to get information on group rates. 

First time student or academic? Click here for an introductory offer on select courses. Academic affiliation?  You may be eligible for a discount at checkout.

Register Now

Add $50 service fee if you require a prior invoice, or if you need to submit a purchase order or voucher, pay by wire transfer or EFT, or refund and reprocess a prior payment. Please use this printed registration form, for these and other special orders.

Courses may fill up at any time and registrations are processed in the order in which they are received. Your registration will be confirmed for the first available course date, unless you specify otherwise.

The Institute for Statistics Education is certified to operate by the State Council of Higher Education in Virginia (SCHEV).

Contact Us
Have a question about a course before you register? Call us. We're here for you. (571) 281-8817 or ourcourses (at)

Want to be notified of future courses?

Student comments