Machine Learning with Weka

Machine Learning Using Weka

Aim of Course:

Weka is a powerful, open-source machine learning tool. Its users can import data and train many available algorithms to build classification or regression models. This class is a hands-on tutorial that will teach students how to use the Weka platform. We will cover the basics of machine learning including how to choose the right algorithms for your data, and then learn how to format data and import it into Weka, how to build models, and how to analyze and interpret the results.  

In this course, the focus is on learning the Weka tool, in contrast to other courses where the focus is on a more detailed study of the data mining methods.

Anticipated learning outcomes - you will be able to:

  • Navigate Weka, read in data and work with appropriate data formats
  • Use Weka to implement basic data mining algorithms
    • Trees
    • Rule Systems
    • Bayesian Networks
    • Neural Networks
  • Apply appropriate algorithms to classification and regression
  • Assess model results with appropriate metrics
  • Use Weka to classify documents



This course may be taken individually (one-off) or as part of a certificate program.
Course Program:

Week 1: Machine Learning and Weka Basics

  • What is Machine Learning and Weka?
    • Machine learning fundamentals
    • Core algorithm types
    • Trees
    • Rule Systems
    • Bayesian models
    • Neural Networks
    • Weka basics
    • Explorer/experimenter
    • File types
    • Interface elements
  • Classification
    • What is classification? What are classes? 
    • How classification works, high level
    • Classifying data in Weka, including the major Weka features
  • Regression 
    • What is regression? 
    • Which algorithms will work for regression
    • Running regressions in Weka

Week 2: Creating Datasets for Weka

  • Creating ARFF files
    • Formatting data for use in Weka
    • Data types
    • Class enumeration
  • Features and feature types
    • What are features?
    • What are the major feature types?
    • What features work with regression? How do we handle non-numeric features?
    • Filtering algorithms based on feature-type in Weka

Week 3: Interpreting Results

  • Interpreting and Refining Results 
    • Accuracy
    • Precision, recall, F1 scores
    • Confusion Matrices
  • Class Balancing

Week 4: More advanced Weka features

  • Document classification
    • What are word vectors
    • Converting text files to word vectors
  • Saving and Importing modified ARFF files
  • Visual exploration of features
  • Meta-algorithms

Machine Learning with Weka

Who Should Take This Course:
Data scientists who want to get up to speed quickly with the standard data and text mining methods using an open source tool, and who do not want to go the programming route (R or Python).
Familiarity with introductory statistics, including regression
Organization of the Course:
 Options for Credit and Recognition:
Course Text:
All materials will be provided online.
Weka - open source, free


February 07, 2020 to March 06, 2020 February 12, 2021 to March 12, 2021

Machine Learning with Weka


February 07, 2020 to March 06, 2020 February 12, 2021 to March 12, 2021

Course Fee: $549

Do you meet course prerequisites? What about book & software? (Click here to learn more)

We have flexible policies to transfer to another course, or withdraw if necessary (modest fee applies)

Group rates: Click here to get information on group rates. 

First time student or academic? Click here for an introductory offer on select courses. Academic affiliation?  You may be eligible for a discount at checkout.

Register Now

Add $50 service fee if you require a prior invoice, or if you need to submit a purchase order or voucher, pay by wire transfer or EFT, or refund and reprocess a prior payment. Please use this printed registration form, for these and other special orders.

Courses may fill up at any time and registrations are processed in the order in which they are received. Your registration will be confirmed for the first available course date, unless you specify otherwise.

The Institute for Statistics Education is certified to operate by the State Council of Higher Education in Virginia (SCHEV).

Contact Us
Have a question about a course before you register? Call us. We're here for you. (571) 281-8817 or ourcourses (at)

Want to be notified of future courses?

Student comments