Flexible, affordable statistics education.

Designed to help you master the software you need to enhance your skills and the practical experience you need to get ahead.

Modeling in R

taught by Sudha Purohit


Brief Description:

This course will show you how to use R to create statistical models and use them to analyze data.

Instructor(s):
Level: Advanced/Intermediate

Who Should Take This Course:

Anyone who is familiar with R and wants to learn how to use it to build and use statistical models.

Important:   the course will cover a variety of techniques and at different levels, to meet the needs of different groups of users.  Those with minimal-to-moderate statistics preparation will want to spend time on the more extensive presentation of linear regression, and not attempt to complete all the more advanced segments on other methods.  Those with more experience in statistics may not require as much time in the early stages, but will be better able to work with the more advanced segments.  The goal is to provide guidance in using R to implement various modeling procedures, and not to provide a conceptual development of the statistical methods.  Most of the modeling techniques described here have their own statistics.com courses.   If you take this course first, you will probably not gain a full understanding of the more advanced techniques, but you will be better positioned, software-wise, to implement them when and if you take those courses.  If you take the other courses first, you will have a better understanding of the concepts behind the techniques before tackling them in R, but will be less prepared software-wise when you take the conceptual courses.  Either approach will work, but each has its own costs and benefits.

Dates:
August 31, 2012 to September 28, 2012
modelingr Click here to be reminded of future sessions of this course.

Modeling in R

taught by Sudha Purohit

Enter your email address and submit:
ajax loader

Thank you for your submission.


Registration:
Please read the syllabus tab, noting the prerequisites, text and software requirements.

Register Online -$499
Register Online -$399 (you must be affiliated with a college, university or high school)

Add $50 service fee if you require a prior invoice, or if you need to submit a purchase order or voucher, pay by wire transfer or EFT, or refund and reprocess a prior payment. Please use this printed registration form, for these and other special orders.

Courses may fill up at any time and registrations are processed in the order in which they are received. Your registration will be confirmed for the first available course date, unless you specify otherwise. Multiple course registrations may be entitled to tuition discounts; read more.


Share This : facebook LinkedIn twitter

Modeling in R

taught by Sudha Purohit



Aim of Course:

In this course you will learn how to use R to build statistical models and use them to analyze data. Multiple regression is covered first followed by logistic regression. The generalized linear model is then introduced and shown to include multiple regression and logistic regression as special cases. The Poisson model for count data will be introduced and the concept of overdispersion described. You will then learn how to analyse longitudinal data, first using relatively straightforward graphics and simple inferential approaches. This will be followed by describing mixed-effects models and the generalized estimating approach for such data. The emphasis in the course is how to use R to fit the models listed and how to interpret the R output, rather than the theoretical background of the models. Consequently some knowledge of linear models is required (statistics.com has courses in all of them).

See also the "Important" note in "Who Should Take This Course."

Prerequisite(s):

While we do not require additional specific courses as prerequisites, some familiarity with statistical modeling is needed. Statistics.com has a variety of courses in modeling. See also the "Important" note in "Who Should Take This Course" (in the "Dates" tab).


Course Program:

SESSION 1: Linear Regression, Logistic Regression

  • Multiple linear regression with R
  • Simple examples, dummy explanatory variables, interpreting regression coefficients; finding a parsimonious model

SESSION 2: The Generalized Model With R

  • Logistic regression with R
  • The need for a different model when the response variable is binary, the logistic transform and fitting the model to some simple examples, deviance residuals
  • Multiple regression and logistic regression as special cases of the generalized linear model
  • The Poisson model for count data.
  • The problem of overdispersion


SESSION 3: Analysing Longitudinal Data Using R

  • Examples of longitudinal data
  • Simple graphics for longitudinal data and simple inference using the summary measure approach
  • The 'long form' of longitudinal data
  • Models for longitudinal data when independence of the repeated measurements is assumed
  • Mixed-effects models for longitudinal data

SESSION 4:  Generalized Estimating Equations

  • Modeling the correlational structure of the repeated measurements
  • The generalized estimating equation approach for non-normal response variables in longitudinal data
  • The dropout problem


HOMEWORK:

Homework in this course consists of guided data analysis problems using software and guided data modeling problems using software.

Organization of the Course:

This course takes place over the internet at the Institute for 4 weeks. During each course week, you participate at times of your own choosing - there are no set times when you must be online. Course participants will be given access to a private discussion board. In class discussions led by the instructor, you can post questions, seek clarification, and interact with your fellow students and the instructor.

The course typically requires 15 hours per week. At the beginning of each week, you receive the relevant material, in addition to answers to exercises from the previous session. During the week, you are expected to go over the course materials, work through exercises, and submit answers. Discussion among participants is encouraged. The instructor will provide answers and comments, and at the end of the week, you will receive individual feedback on your homework answers.


Credit:
Students come to the Institute for a variety of reasons. As you begin the course, you will be asked to specify your category:
  1. You may be interested only in learning the material presented, and not be concerned with grades or a record of completion.
  2. You may be enrolled in PASS (Programs in Analytics and Statistical Studies) that requires demonstration of proficiency in the subject, in which case your work will be assessed for a grade.
  3. You may require a "Record of Course Completion," along with professional development credit in the form of Continuing Education Units (CEU's).  For those successfully completing the course, 5.0 CEU's and a record of course completion will be issued by The Institute, upon request.

Course Text:

Course materials will be provided by the instructor.

Software:

Students must have access to R. For information on obtaining a copy of R, please click here.

Register Now

Yes, I want to register for:

Modeling in R

taught by Sudha Purohit



Instructor(s):
Dates:
August 31, 2012 to September 28, 2012
Course Fee: $499
Academic Rate: $399

Before registering, please read the syllabus tab, noting the prerequisites, text and software requirements. When you click the register button, you will be taken to our secure transaction page.

I am affiliated with an academic institution
I am not affiliated with an academic institution


Want to be notified of future course offering?


Enter your email address here:

What our students say:

"I really enjoyed this course and like the instructor. The discussion board provides a valuable venue to discuss questions and clarify doubts. The instructor's feedback is prompt and helpful. I not only got my questions answered but also learned a lot from other's questions."
R. Yang
Purdue University
"As a SAS programmer , I believe I will continue to use SAS for data manipulation but will utilize R when running models and exploring large data sets."
J. Kezik
Yale University
"Over the last two summers, I've taken the two stats courses meant to prepare future AP Stats teachers. Those courses were invaluable to me and gave me the confidence I needed to tackle a difficult subject."
K. Behrend
© Statistics.com 2004-2012