Missing Data

Missing Data


Aim of Course:

Data sets often have missing values.  Missing data is a problem, in particular, with multivariate modeling.  If the analyst must discard an entire record because the value for one variable is missing, valuable information is lost.  Better to find a way keep the record, adjust for the missing value(s), and let the analysis proceed.

This online course, "Missing Data" teaches the basics of handling missing data including evaluation of types and patterns of missing data, strategies for analysis of data sets with item missing data, and imputation of missing data with an emphasis on multiple imputation.  Example applications are presented using simple random sample and complex sample design data sets with SAS and Stata with additional examples in other software packages such as IVEware.  Homework exercises for each class session along with a final project are included.

Course Program:

WEEK 1: Overview of Missing Data

  • Missing Data: An Overview 
    • Causes of Missing Data
    • Types of Missing Data
    • Missing Data Patterns
    • Issues for Analysis
    • Assumptions and Implications for Analysis and Imputation
    • Analytic Approaches for Data Sets with Missing Data 
      • Conventional and Novel
      • Introduction to Multiple Imputation Process
    • Software Options

WEEK 2: Overview of the Multiple Imputation Process

  • Overview of Multiple Imputation and the Three Step Process of MI
  • Details of Three Step Process 
    • Imputation/Variance Estimation for MI
    • Analysis of Imputed Data Sets
    • Combining Results from Imputation and Analysis of Imputed Data Sets
  • Selection of An Imputation Method
  • Overview of Combined Results
  • Application: Multiple Imputation of Continuous and Categorical Variables 
    • What method to use for imputation?
    • How to specify method in common software?
    • Discussion of Output from the Multiple Imputation Process
      • Key output from imputation, analysis of imputed data sets, and combined results
    • How is the variability of the MI process incorporated into the output?

WEEK 3: Detailed Multiple Imputation Examples

  • Imputation of Continuous Variables (Each example demonstrates the full three step MI Process) 
    • Consideration of imputation methods for continuous variables
    • Inclusion of categorical predictor variables
    • Application: continuous variable imputation with NCS-R data set  
      • How to correctly analyze data set derived from complex sample design
      • Discussion of output from each of the three steps
      • Diagnostic tools
  • Imputation of Categorical Variables  (Each example demonstrates the full three step MI Process)
    • Consideration of imputation methods for categorical variables
    • Inclusion of continuous and categorical predictor variables
    • Application: categorical variable imputation with NCS-R data set  
      • How to correctly analyze data set derived from complex sample design
      • Discussion of output from each of the three steps
      • Diagnostic tools

WEEK 4: Detailed Multiple Imputation Examples and Additional Topics in Missing Data

  • Detailed MI Examples, continued  
    • Imputation of Categorical and Continuous Variables with non-monotone missing data pattern
      • Application: Option 1: Use of sequential regression in Stata and SAS, comparison of results
      • Application: Option 2: Create monotone missing data pattern with subsequent use of appropriate imputation method given monotone missing data, compare how this might change selection of imputation method
    • Additional Topics in Missing Data 
      • Frequently Asked Questions about Missing Data and Imputation
      • Handling missing data in longitudinal data sets
      • Non-ignorable missing data 
        • What is non-ignorable missing data?
        • How to handle missing data that is non-ignorable?
      • Staying current in research and software developments in handling missing data


The homework in this course consists of short answer questions to test concepts, guided data analysis problems using software.

This course also has supplemental readings available online.

Missing Data

Who Should Take This Course:
Any statistical analyst will benefit from this course.

You should also be confident with topics in regression, as covered in our course

Organization of the Course:
Options for Credit and Recognition:
Course Text:
Missing Data by Paul D. Allison, University of Pennsylvania © 2001, 104 pages, SAGE Publications, Inc, Series: Quantitative Applications in the Social Sciences, Volume 136. This text may be ordered here.
Homework assignments will involve the use of standard statistical software. SAS, Stata, R and IVEware can be used for all the assignments, and other software packages such as Solas, Sudaan,  and SPSS are suitable for most but not all assignments.  The instructor is familiar with SAS, Stata and IVEware and can answer questions about those packages. The teaching assistant can provide support for R.


To be scheduled.

Missing Data


To be scheduled.

Course Fee: $589

Do you meet course prerequisites? What about book & software? (Click here to learn more)

We have flexible policies to transfer to another course, or withdraw if necessary (modest fee applies)

Group rates: Click here to get information on group rates. 

First time student or academic? Click here for an introductory offer on select courses. Academic affiliation?  You may be eligible for a discount at checkout.

This course may be scheduled on a contract basis.  Please contact ourcourses@statistics.com to arrange.

Register Now

Add $50 service fee if you require a prior invoice, or if you need to submit a purchase order or voucher, pay by wire transfer or EFT, or refund and reprocess a prior payment. Please use this printed registration form, for these and other special orders.

Courses may fill up at any time and registrations are processed in the order in which they are received. Your registration will be confirmed for the first available course date, unless you specify otherwise.

The Institute for Statistics Education is certified to operate by the State Council of Higher Education in Virginia (SCHEV).

Contact Us
Have a question about a course before you register? Call us. We're here for you. (571) 281-8817 or ourcourses (at) statistics.com

Want to be notified of future courses?

Student comments