# Logistic Regression

Instructor(s):

Dates:

September 04, 2015 to October 02, 2015 March 11, 2016 to April 08, 2016 September 02, 2016 to September 30, 2016

# cap , informsLogistic Regressiontaught by Joseph Hilbe

Aim of Course:

Logistic regression is one of the most commonly-used statistical techniques. It is used with data in which there is a binary (success-failure) outcome (response) variable, or where the outcome takes the form of a binomial proportion. Like linear regression, one estimates the relationship between predictor variables and an outcome variable. In logistic regression, however, one estimates the probability that the outcome variable assumes a certain value, rather than estimating the value itself. This online course will cover the functional form of the logistic model and how to interpret model coefficients. The concepts of "odds" and "odds ratio" are examined, as well as "risk ratio" and the difference between the two statistics. Our emphasis is on model construction, interpretation, and goodness of fit. Exercises include hands-on computer problems.

Course Program:

## WEEK 1: Basic Terminology and Concepts

• Software for modeling logistic regression: Stata, R, SAS, SPSS, other
• History of the logistic model
• Concepts related to logistic regression
• 2x2, 2xn models of odds and risk ratios
• Fitting algorithms

## WEEK 2: Logistic Model Construction

• Derivation of the binary logistic model
• Model-building strategies
• Link tests, partial residual plots
• Standard errors: scaling, bootstrap, jackknife, robust
• Interpreting odds ratios as risk ratios - criteria
• Stepwise methods, missing values, constrained coefficients, etc Construction and interpretation of interactions

## WEEK 3: Analysis, Fit, and Interpretation of the Logistic Model

• Goodness of fit tests
• Information criterion tests
• Residual analysis
• Validation models

## WEEK 4: Binomial Logistic Regression and Overdispersion

• The meaning and types of overdispersion
• Simulations: detecting apparent vs real overdispersion
• Methods of handling real overdispersion

HOMEWORK:

Homework in this course consists of short answer questions to test concepts, guided data analysis problems using software, guided data modeling problems using software, and end of course data modeling project.

In addition to assigned readings, this course also has example software codes, supplemental readings available online, and an end of course data modeling project.

# Logistic Regression

Instructor(s):

Dates:
September 04, 2015 to October 02, 2015 March 11, 2016 to April 08, 2016 September 02, 2016 to September 30, 2016

Course Fee: \$629

Tuition Savings:  When you register online for 3 or more courses, \$200 is automatically deducted from the total tuition. (This offer cannot be combined and is only applicable to courses of 3 weeks or longer.)

Register Now

Add \$50 service fee if you require a prior invoice, or if you need to submit a purchase order or voucher, pay by wire transfer or EFT, or refund and reprocess a prior payment. Please use this printed registration form, for these and other special orders.

Courses may fill up at any time and registrations are processed in the order in which they are received. Your registration will be confirmed for the first available course date, unless you specify otherwise.

# cap , informsLogistic Regressiontaught by Joseph Hilbe

Who Should Take This Course:

Medical researchers, epidemiologists, forensic statisticians, environmental scientists, actuaries, data miners, industrial statisticans, sports statisticians, and fisheries, to name a few, will all find this course useful. It is an essential course for anyone who needs to model data with binary or categorical outcomes, and who need to estimate probabilities of given outcomes based on predictor variables.

Level:

Intermediate

Prerequisite:
These are listed for your benefit so you can determine for yourself, whether you have the needed background, whether from taking the listed courses, or by other experience.

If you are unclear as to whether you have mastered the above requirements, try these placement tests.

Organization of the Course:

This course takes place online at the Institute for 4 weeks. During each course week, you participate at times of your own choosing - there are no set times when you must be online. Course participants will be given access to a private discussion board. In class discussions led by the instructor, you can post questions, seek clarification, and interact with your fellow students and the instructor.

At the beginning of each week, you receive the relevant material, in addition to answers to exercises from the previous session. During the week, you are expected to go over the course materials, work through exercises, and submit answers. Discussion among participants is encouraged. The instructor will provide answers and comments, and at the end of the week, you will receive individual feedback on your homework answers.

Time Requirement: about 15 hours per week, at times of  your choosing.

Credit:
Students come to the Institute for a variety of reasons. As you begin the course, you will be asked to specify your category:
1. You may be interested only in learning the material presented, and not be concerned with grades or a record of completion.
2. You may be enrolled in PASS (Programs in Analytics and Statistical Studies) that requires demonstration of proficiency in the subject, in which case your work will be assessed for a grade.
3. You may require a "Record of Course Completion," along with professional development credit in the form of Continuing Education Units (CEU's).  For those successfully completing the course, 5.0 CEU's and a record of course completion will be issued by The Institute, upon request.

Course Text:

The course text is Logistic Regression Models by Joseph Hilbe, which you can order online from the publisher, CRC Press.

Software:

Course participants may use any software that is capable of doing logistic regression. The instructor is most familiar with Stata, and the methods covered in this course will primarily be illustrated in Stata. Nearly all Stata commands, however, have corresponding R code at the end of each chapter. Click Here for information on obtaining a free (or nominal cost) copy of various software packages for use during the course.

Stata: The instructor is familiar with Stata and the illustrations and assignments are fully integrated with Stata. If you are undecided about which software to use, Stata, which is relatively easy to learn and use, is a safe choice.

R: R-language solutions to assignments will be provided in this course, and R code is provided at the end of the chapters in the text duplicating nearly all Stata examples used in the text. R code for and tutorial help from the instructor or TA will be available but limited. If you want to use R with this course, you should have some prior experience and facility with it. If you wish to use R, but no have current expertise in it, you should consider taking one of our introductory R courses before taking this one.

SAS: The instructor and TA can offer limited assistance with SAS in this course. If you want to use SAS with this course, you should have some prior experience and facility with it. If you wish to use SAS, but do not have current expertise in it, you should consider taking an introductory course or courses from SAS Institute or elsewhere.

SPSS: The instructor can offer limited assistance with SPSS, but there is no TA support. While SPSS is easier to use than R or SAS for the purposes of this course, we nonetheless recommend that if you want to use SPSS with this course, you should have some prior experience and facility with it. If you wish to use SPSS, but no have current expertise in it, you should consider taking an introductory course or courses from SPSS.

Want to be
notified of future
course offerings?