Predictive Analytics 1 with Python – Machine Learning Tools
This course introduces the basic paradigm for predictive modeling: classification and prediction.
Overview
In this course you will be introduced to basic concepts in predictive analytics, also called predictive modeling, the most prevalent form of data mining. You will cover two core paradigms that account for most business applications of predictive modeling: classification and prediction. You will also study commonly used machine learning techniques and learn how to combine models to obtain optimal results. This course includes hands-on work with Python, a free software environment with statistical computing capabilities.
- Introductory, Intermediate
- 4 Weeks
- Expert Instructor
- Tuiton-Back Guarantee
- 100% Online
- TA Support
Learning Outcomes
At the conclusion of this course you will be able to visualize and explore data, provide an assessment basis for predictive models, and choose appropriate performance measures. You will become familiar with common algorithms including k-nearest-neighbor, Naive Bayes, Classification and Regression Trees, as well as ensemble models.
- Visualize and explore data to better understand relationships among variables
- Organize the predictive modeling task and data flow
- Develop machine learning models with the KNN, Naive Bayes and CART algorithms using Python’s scikit-learn
- Assess the performance of these models with holdout data
- Apply predictive models to generate predictions for new data
- Use Python’s sci-kit learn package to implement the models in the course
Who Should Take This Course
Marketing and IT managers, financial analysts and risk managers, accountants, data analysts, data scientists, forecasters. This course is especially useful if you want to understand what predictive modeling might do for your organization, undertake pilots with minimum setup costs, manage predictive modeling projects, or work with consultants or technical experts involved with ongoing predictive modeling deployments.
Our Instructors
Mr. Kuber Deokar
Mr. Kuber Deokar holds a Masters degree in Statistics from University of Pune, India, where he also taught undergraduate statistics. Mr. Deokar holds the position of Instructional Operations Supervisor at Statistics.com. He is responsible for coordination of Statistics.com online courses, and ensures seamless interactions between the management team, course instructors, teaching assistants, and students. He also serves as the senior teaching assistant and shares instructional responsibilities for several courses, and handles consultancy assignments, working from our office in Pune, India.
Course Syllabus
Week 1
Preparation
- What is supervised learning
- Data partitioning and holdout samples
- Choosing variables (features)
- Handling missing data
- Visualization and exploration
Week 2
Classification and Prediction
- Assessing classification models
- Confusion matrix
- Misclassification costs
- Lift
- Assessing prediction models
- Common metrics
- K-Nearest-Neighbors (KNN)
- Measuring distance
- Choosing k
- Generating classifications and predictions
Week 3
Bayesian Classifiers; CART
- Full Bayes classifier
- Naive Bayes classifier
- Classification and Regression Trees (CART)
- Growing the tree
- Avoiding overfit – pruning
- Using trees for classifications and predictions
Week 4
Ensembles
- Combine multiple algorithms
- Improve results
Class Dates
2023
Instructors: Mr. Kuber Deokar
2024
Instructors: Mr. Kuber Deokar
Instructors: Mr. Kuber Deokar
Instructors: Mr. Kuber Deokar
2025
Instructors: Mr. Kuber Deokar
Instructors: Mr. Kuber Deokar
Instructors: Mr. Kuber Deokar
Prerequisites
You should be familiar with Python, as covered in our Python for Analytics course.
The Statistics.com courses have helped me a lot, pushing me to the limit and making me learn much more than I expected I could. The knowledge I gained I could immediately leverage in my job … then eventually led to landing a job in my dream company – Amazon.
Karolis Urbonas
This program has been a life and work game changer for me. Within 2 weeks of taking this class, I was able to produce far more than I ever had before.
Susan Kamp
The material covered in the Analytics for Data Science Certificate will be indispensable in my work. I can’t wait to take other courses. Great work!
Stephen McAllister
I learned more in the past 6 weeks than I did taking a full semester of statistics in college, and 10 weeks of statistics in graduate school. Seriously.
Amir Aminimanizani
This is the best online course I have ever taken. Very well prepared. Covers a lot of real-life problems. Good job, thank you very much!
Elena Rose
The more courses I take at Statistics.com, the more appreciation I have for the smart approach, quality of instructors, assistants, admin and program. Well done!
Leonardo Nagata
This course greatly benefited me because I am interested in working in AI. It has given me solid foundational knowledge…After completing this last course, I feel I have gained valuable skills that will enhance my employability in Data Science, opening up diverse career opportunities.
Richard Jackson
Frequently Asked Questions
-
What is your satisfaction guarantee and how does it work?
-
Can I transfer or withdraw from a course?
-
Who are the instructors at Statistics.com?
Visit our knowledge base and learn more.
Register For This Course
Predictive Analytics 1 with Python – Machine Learning Tools
Additional Information
Homework
Homework in this course consists of short answer questions to test concepts, guided data analysis problems using software, and end of course data modeling project. Note: There will be a mid-week discussion exercise in the first week of the course.
In addition to assigned readings, this course also has supplemental video lectures, and an end of course data modeling project.
Course Text
The required text for this course is Data Mining for Business Analytics: Concepts, Techniques, and Applications in Python, by Shmueli, Bruce, Gedeck, and Patel. This same text is also used in the follow on courses: “Predictive Analytics 2 – Neural Nets and Regression – with Python” and “Predictive Analytics 3 – Dimension Reduction, Clustering and Association Rules – with Python”
Software
The course includes hands-on work with Python, a free software environment with statistical computing capabilities.
Supplemental Information
Take a 10-question quiz on analytics: Test Yourself
Whatch our preview of this course:
Watch this video by Dr. Shmueli on “Data Mining in a Nutshell”.
Register For This Course
Predictive Analytics 1 with Python – Machine Learning Tools