In this course you will continue work from Predictive Analytics 1, and be introduced to additional techniques in predictive analytics, also called predictive modeling, the most prevalent form of data mining. Predictive modeling takes data where a variable of interest (a target variable) is known and develops a model that relates this variable to a series of predictor variables, also called features. Four modeling techniques will be used: linear regression, logistic regression, discriminant analysis and neural networks. The course includes hands-on work with Python, a free software environment with capabilities for statistical computing.
Dr. Peter Gedeck
Peter Gedeck is at the forefront of the use of data science in drug discovery. He is a Senior Data Scientist at Collaborative Drug Discovery, which offers the pharmaceutical industry cloud-based software to manage the huge amount of data involved in the drug discovery process. Drug discovery involves the exploration and testing of huge numbers of molecule combinations, and much of that testing takes place analytically, hence the need for robust software to handle the data and provide a framework for analyzing it. Peter's specialty is the development of machine learning algorithms to predict biological and physicochemical properties of drug candidates. Prior to this, he worked for twenty years as a computational chemist in drug discovery at Novartis in the United Kingdom, Switzerland, and Singapore. His research interests include the application of statistical and machine learning methods to problems in drug discovery. clinical research and meta-analysis. Peter is also a co-author of Data Mining for Business Analytics - Using Python, continuing this best-selling Wiley series, which serves as the foundation for Statistics.com's Predictive Analytics series of courses. Galit Shmueli and Inbal Yahav, fellow instructors at Statistics.com, are also co-authors in this book series, as is Peter Bruce, the founder of Statistics.com.