logo.gif The source for online courses
in statistics
 ÖÐÎÄ Course Login
Home > Credit > Programs >
PROGRAM IN ADVANCED STATISTICAL STUDIES

Data Mining

Program Registration Fee: $495, plus individual course enrollment fees.

About this Program: This program is aimed at analysts in a variety of fields who need to develop useful information from large datasets. It includes prediction and classification (will a prospective customer purchase, will a loan default, will a customer abandon a subscription service, etc.), clustering (used in customer segmentation), text mining (making classifications or predictions on the basis of automated review of documents), and more.

To Apply for this Program: click here

Prerequisite: You must have taken the equivalent of statistics.com's courses Introduction to Statistics 1, 2, and 3.

Estimated Core Course Tuition: $1995
Estimated Electives Course Tuition: $1532
Program Registration Fee: $495
Estimated Total Cost: $4022

The individual courses are payable as you register for each course or may be prepaid. Tuition and fees do not include the purchase of required course texts.


Core Courses (All required)

Introduction to Data Mining
This course covers the two core paradigms that account for most business applications of data mining: classification and prediction. The course includes hands-on work with XLMiner, a data-mining add-in for Excel.
Tuition: $399 (5 CEUs)
Next dates available:
- September 10, 2010

Logistic Regression
Logistic regression extends ordinary least squares (OLS) methods to model data with binary (yes/no, success/failure) outcomes. Rather than directly estimating the value of the outcome, logistic regression allows you to estimate the probability of a success or failure.
Tuition: $399 (5 CEUs)
Next dates available:
- September 10, 2010

Regression Analysis
In this course you will learn how multiple linear regression models are derived, use software to implement them, learn what assumptions underlie the models, learn how to test whether your data meet those assumptions and what can be done when those assumptions are not met, and develop strategies for building and understanding useful models.
Tuition: $399 (5 CEUs)
Next dates available:
- October 08, 2010

Data Mining: Unsupervised Techniques
This course covers key unsupervised learning techniques - association rules, principal components analysis, and clustering. The course will include an integration of supervised and unsupervised learning techniques.
Tuition: $399 (5 CEUs)
Next dates available:
- October 15, 2010

Interactive Data Visualization
This course covers the principles of the visual display of data, both for presentation and analysis.
Tuition: $399 (5 CEUs)
Next dates available:
- October 29, 2010

Elective Courses (4 required)

Natural Language Processing
This course is designed to give you an introduction to the algorithms, techniques and software used in natural language processing (NLP).
Tuition: $399 (5 CEUs)
Next dates available:
- September 03, 2010

Bootstrap Methods
This course covers the basic theory and application of the bootstrap family of procedures, with the emphasis on applications.
Tuition: $399 (5 CEUs)
Next dates available:
- September 24, 2010

Introduction to Quantitative Risk Analysis
This course will cover the most important principles, techniques and tools used in modeling in Quantitative Risk Analysis.
Tuition: $399 (5 CEUs)
Next dates available:
- October 01, 2010

Advanced Logistic Regression
After taking this course, participants will be able to specify, implement and interpret the output of a variety of advanced logistic regression models not covered in the first course, "Logistic Regression."
Tuition: $399 (5 CEUs)
Next dates available:
- October 15, 2010

Statistical Analysis of Microarray Data with R
This course will acquaint you with the process of microarray data mining from beginning to end. You will learn how to how to preprocess the data, estimate gene expression patterns, cluster genes to detect interesting gene expression patterns, and classify experiments (subjects) based on gene expression patterns. Illustrations of the statistical issues involved at the various stages of the analysis will use real data sets from DNA microarray experiments.
Tuition: $399 (6.25 CEUs)
Next dates available:
- October 22, 2010

Cluster Analysis
This course will teach you how to use various cluster analysis methods to identify possible clusters in multivariate data. Methods discussed include hierarchical clustering, k-means clustering, two-step clustering, and normal mixture models for continuous variables.
Tuition: $399 (5 CEUs)
Next dates available:
- November 05, 2010

Spatial Statistics With Geographic Information Systems
Spatial statistical analysis uses methods adapted from conventional statistics to address problems in which spatial location is the most important explanatory variable. This course will explain and give examples of the analysis that can be conducted in a geographic information system such as ArcGIS or Mapinfo.
Tuition: $399 (5 CEUs)
Next dates available:
- November 12, 2010

Missing Data Analysis
This course will cover the theory and practice of two modern methods of handling missing data: maximum likelihood and multiple imputation.
Tuition: $399 (5 CEUs)
Next dates available:
- November 19, 2010

Introduction to Support Vector Machines in R
This course begins by developing basic concepts such as hyperplanes, features spaces and kernels; then it covers the development of support vector machines (SVM's).
Tuition: $399 (5 CEUs)
Next dates available:
- November 19, 2010

Matrix Algebra Review
This course will provide the basics of vector and matrix algebra and operations necessary to understand multivariate statistical methods, including the notions of the matrix inverse, generalized inverse and eigenvalues and eigenvectors. After successfully completing this course, you will be able to use and understand vector and matrix operations and equations, find and use a matrix inverse, and use and understand the eigenset of a symmetric matrix.
Tuition: $299 (3.75 CEUs)
Next dates available:
- December 10, 2010

Decision Trees and Rule-Based Segmentation
Rule induction is an important component of data mining, and this course covers two main styles of generating rules.
Tuition: $399 (5 CEUs)
Next dates available:
- January 21, 2011

Introduction to Resampling Methods
The course introduces the basic concepts and methods of resampling methods, including bootstrap procedures and permutation (randomization) tests, with little or no complex theory or confusing notation.
Tuition: $299 (3.75 CEUs)
Next dates available:
- February 25, 2011

Text Mining
This course will introduce the essential techniques of text mining, understood here as the extension of data mining's standard predictive methods to unstructured text.
Tuition: $399 (5 CEUs)
Next dates available:
- June 17, 2011