The Curious Statistician

Sep 30, 2011

Popular Mistakes in Data Mining

John Elder's presentations on common data mining mistakes are a must-see if you have any experience or plans in the data mining arena.
John Elder's presentations on common data mining mistakes are a must-see if you have any experience or plans in the data mining arena.  One example I recall in particular - a neural net that performed brilliantly in distinguishing the presence or absence of tanks in photographs.  The only problem?  The tank photos were taken on sunny days, the non-tank photos on "puffy-cloud" days.  The algorithm was focusing on the clouds.  John has an online course with many such examples, and much more to share, based on years of experience with industry, at statistics.com - Oct. 28 - Nov. 14.  See http://www.statistics.com/dmmistakes .

Comments


Leave a Comment

Add a Review of this item
Comment Title:
Your Name:
Your Email Address:
Notify me of new comments to this page:
Notify me of future course offerings:
Additional Comments:

Want to be notified of future course offering?


Enter your email address here:

What our students say:

“I realized one of my work projects would benefit from deeper statistical analysis, including functions I had a good background in and knew at one time, but I needed to dust the cobwebs off and catch up to changes in the field.” Douglas D. Reimel, Jr.

Douglas Reimel

"I’ve increased my exposure in my department and profession because I have experience with a number of data analysis approaches. I’ve been asked to give guest lectures in other classes on statistical methods and different strategies, and I was asked to present at a national conference." Todd Lewis, Ph.D., Associate ProfessorDepartment of Counseling and Educational DevelopmentSchool of EducationUniversity of North Carolina at Greensboro

Todd Lewis

"We’re trying to make it easier for patients to get their prosthetic arms to do exactly what they want them to do. I’ve applied what I’ve learned through my statistics.com courses, such as Baysian statistics, computing techniques, biostatistics, clinical trials, analysis and sensitivity software, bioavailability, probability distributions, data mining, and designing experiments to map brain impulses to muscle movement, which ultimate...

Patricia Shewokis

It took me a long time to find just the right program that provides the right mix of applied and theory, but I found the right one at statistics.com. My staff emerges from your training ready to make an impact on the company. Joseph SommaDirector, Market IntelligenceIndependent Health

Joseph Somma

"Traditionally, reports are designed to summarize data, but they can only tell you what happened. I'm applying data mining algorithms I've learned in my Statistics.com coursework to ask why something happened." Susan StranburgSoftware Developer

Susan Stranburg

"My courses help me look at more complex problems using different approaches to show more interesting aspects of conditions, beyond just tables and charts, more than just sampling or descriptive statistics." Cristobal BazanUnited Nations agency

Cristobal Bazan

I hear IT people complaining that they’re always needing to learn new technology because things in their field evolve and change quickly. The same thing is true in analytics. New techniques are developing rapidly. Robert Wood Director, Advanced Analytics Group, Merkle

Robert Wood

© Statistics.com 2004-2012