Flexible, affordable statistics education.

Designed to help you master the software you need to enhance your skills and the practical experience you need to get ahead.

Data Mining in R - Learning with Case Studies

taught by Luis Torgo


Brief Description:

The main goal of this course is to teach users how to perform data mining tasks using R.

Instructor(s):
Level: Intermediate

Who Should Take This Course:

R users who want to learn how to apply R to data mining.  Data mining analysts in search of new tools.  Students in statistics.com's PASS program in Data Mining seeking an affordable data mining tool.  Note that working in R will be more involved than using a specially designed interface for data mining, such as those found in major commercial data mining programs.

Dates:
June 29, 2012 to July 27, 2012
datamining-r Click here to be reminded of future sessions of this course.

Data Mining in R - Learning with Case Studies

taught by Luis Torgo

Enter your email address and submit:
ajax loader

Thank you for your submission.


Registration:
Please read the syllabus tab, noting the prerequisites, text and software requirements.

Register Online -$499
Register Online -$399 (you must be affiliated with a college, university or high school)

Add $50 service fee if you require a prior invoice, or if you need to submit a purchase order or voucher, pay by wire transfer or EFT, or refund and reprocess a prior payment. Please use this printed registration form, for these and other special orders.

Courses may fill up at any time and registrations are processed in the order in which they are received. Your registration will be confirmed for the first available course date, unless you specify otherwise. Multiple course registrations may be entitled to tuition discounts; read more.


Share This : facebook LinkedIn twitter

Data Mining in R - Learning with Case Studies

taught by Luis Torgo



Aim of Course:

The main goal of this course is to teach users how to perform data mining tasks using R. The course follows a learn by doing it strategy, where data mining topics are introduced as needed when addressing a series of real world data mining case studies.

This course is a core requirement or elective in the following Program(s) in Analytics and Statistical Studies (PASS):

Prerequisite(s):
  1. Knowledge of the R programming language - the equivalent of either Introduction to R - Data Handling or Introduction to R - Statistical Analysis.
  2. Introduction to Predictive Modeling, or equivalent data mining experience.

Course Program:

SESSION 1: Predicting Algae Blooms (Case Study 1)

  • Descriptive statistics
  • Data visualization
  • Strategies to handle unknown variable values
  • Regression tasks
  • Evaluation metrics for regression tasks

SESSION 2: Predicting Algae Blooms (Continuation of Case Study 1)

  • Multiple linear regression
  • Regression trees
  • Model selection/comparison through k-fold cross-validation

SESSION 3:  Detecting Fraudulent Transactions (Case Study 2)

  • Clustering methods
  • Classification methods
  • Imbalanced class distributions and methods for handling this type of
  • problems
  • Naive Bayes classifiers
  • Precision/recall and precision/recall curves

SESSION 4: Classifying Microarray Samples (Case Study 3)

  • Feature selection methods for problems with a very large number of
  • predictors
  • Random forests
  • k-Nearest neighbors


HOMEWORK:

Homework in this course consists of short answer questions to test concepts and guided data analysis problems using software.

Organization of the Course:

This course takes place over the internet at the Institute for 4 weeks. During each course week, you participate at times of your own choosing - there are no set times when you must be online. Course participants will be given access to a private discussion board. In class discussions led by the instructor, you can post questions, seek clarification, and interact with your fellow students and the instructor.

The course typically requires 15 hours per week. At the beginning of each week, you receive the relevant material, in addition to answers to exercises from the previous session. During the week, you are expected to go over the course materials, work through exercises, and submit answers. Discussion among participants is encouraged. The instructor will provide answers and comments, and at the end of the week, you will receive individual feedback on your homework answers.


Credit:
Students come to the Institute for a variety of reasons. As you begin the course, you will be asked to specify your category:
  1. You may be interested only in learning the material presented, and not be concerned with grades or a record of completion.
  2. You may be enrolled in PASS (Programs in Analytics and Statistical Studies) that requires demonstration of proficiency in the subject, in which case your work will be assessed for a grade.
  3. You may require a "Record of Course Completion," along with professional development credit in the form of Continuing Education Units (CEU's).  For those successfully completing the course, 5.0 CEU's and a record of course completion will be issued by The Institute, upon request.

Course Text:

The course text is Data Mining with R: Learning with Case Studies, by Luis Torgo, which you can order from CRC Press, or by using this form. CRC Press typically gives students a generous discount when students order the text using the above form (not by ordering the text online).

PLEASE ORDER YOUR COPY IN TIME FOR THE COURSE STARTING DATE.

Software:

You must have a copy of R for the course. Click here for information on obtaining a free copy. After installing R in your computer you must also install several R add-on packages. Instructions for this installation will be provided as needed.

Register Now

Yes, I want to register for:

Data Mining in R - Learning with Case Studies

taught by Luis Torgo



Instructor(s):
Dates:
June 29, 2012 to July 27, 2012
Course Fee: $499
Academic Rate: $399

Before registering, please read the syllabus tab, noting the prerequisites, text and software requirements. When you click the register button, you will be taken to our secure transaction page.

I am affiliated with an academic institution
I am not affiliated with an academic institution


Want to be notified of future course offering?


Enter your email address here:

What our students say:

“I took the course to get starting using R, thus I think this will help with my use of statistics in the future.   I really think these online courses are great."
P. Koefoed
University of Copenhagen
"I really enjoyed this course and like the instructor. The discussion board provides a valuable venue to discuss questions and clarify doubts. The instructor's feedback is prompt and helpful. I not only got my questions answered but also learned a lot from other's questions."
R. Yang
Purdue University
"The course was an interesting and delightful excursion into data mining techniques; I thoroughly enjoyed seeing the concepts come to life in the examples. It was a great course."
B. Griffin
University of South Dakota
"This course could serve as a model in the field."
G. Vidmar
Biostatistician, University of Ljubljana
© Statistics.com 2004-2012