Skip to content

Explore Courses | Elder Research | Contact | LMS Login

Statistics.com: Data Science, Analytics & Statistics Courses
  • Curriculum
    • Curriculum
    • About Us
    • Testimonials
    • Management Team
    • Faculty Search
    • Teach With Us
    • Credit & Credentialing
  • Courses
    • Explore Courses
    • Course Calendar
    • About Our Courses
    • Course Tour
    • Test Yourself!
  • Mastery Series
    • Mastery Series Program
    • Bayesian Statistics
    • Business Analytics
    • Healthcare Analytics
    • Marketing Analytics
    • Operations Research
    • Predictive Analytics
    • Python for Analytics
    • R Programming
    • Rasch & IRT
    • Spatial Statistics
    • Statistical Modeling
    • Survey Statistics
    • Text Mining and Analytics
  • Certificates
    • Certificate Program
    • Analytics for Data Science
    • Biostatistics
    • Programming for Data Science – R (Novice)
    • Programming for Data Science – R (Experienced)
    • Programming for Data Science – Python (Novice)
    • Programming for Data Science – Python (Experienced)
    • Social Science
  • Degrees
    • Degree Programs
    • Computational Data Analytics Certificate of Graduate Study from Rowan University
    • Health Data Management Certificate of Graduate Study from Rowan University
    • Data Science Analytics Master’s Degree from Thomas Edison State University (TESU)
    • Data Science Analytics Bachelor’s Degree – TESU
    • Mathematics with Predictive Modeling Emphasis BS from Bellevue University
  • Enterprise
    • Organizations
    • Higher Education
  • Resources
    • Blog
    • FAQs & Knowledge Base
    • Glossary
    • Site Map
    • Statistical Symbols
    • Weekly Brief Newsletter Signup
    • Word of the Week
Menu Close
  • Curriculum
    • Curriculum
    • About Us
    • Testimonials
    • Management Team
    • Faculty Search
    • Teach With Us
    • Credit & Credentialing
  • Courses
    • Explore Courses
    • Course Calendar
    • About Our Courses
    • Course Tour
    • Test Yourself!
  • Mastery Series
    • Mastery Series Program
    • Bayesian Statistics
    • Business Analytics
    • Healthcare Analytics
    • Marketing Analytics
    • Operations Research
    • Predictive Analytics
    • Python for Analytics
    • R Programming
    • Rasch & IRT
    • Spatial Statistics
    • Statistical Modeling
    • Survey Statistics
    • Text Mining and Analytics
  • Certificates
    • Certificate Program
    • Analytics for Data Science
    • Biostatistics
    • Programming for Data Science – R (Novice)
    • Programming for Data Science – R (Experienced)
    • Programming for Data Science – Python (Novice)
    • Programming for Data Science – Python (Experienced)
    • Social Science
  • Degrees
    • Degree Programs
    • Computational Data Analytics Certificate of Graduate Study from Rowan University
    • Health Data Management Certificate of Graduate Study from Rowan University
    • Data Science Analytics Master’s Degree from Thomas Edison State University (TESU)
    • Data Science Analytics Bachelor’s Degree – TESU
    • Mathematics with Predictive Modeling Emphasis BS from Bellevue University
  • Enterprise
    • Organizations
    • Higher Education
  • Resources
    • Blog
    • FAQs & Knowledge Base
    • Glossary
    • Site Map
    • Statistical Symbols
    • Weekly Brief Newsletter Signup
    • Word of the Week

Blog

Industry Spotlight: The IRS is Watching You

  • April 12, 2019
  • , 9:28 pm

The IRS (U.S. Internal Revenue Service) has been using computers to choose tax returns for audit since 1962. Early on, the selection was rule-based, but the IRS turned to statistical modeling in 1969, using the oldest predictive analytics model in the toolbox – discriminant analysis. Discriminant analysis, a linear classification technique, was first proposed by Ronald Fisher in 1936. Computer scientists think of discriminant analysis as quaint and old-fashioned, that is, if they think of it at all. However, it it has the merit of being computationally economical (hence fast), and works well with smaller datasets.

According to statistician Amir Aczel, the IRS was still using discriminant analysis in 1995. He published a book that claimed to reverse engineer the IRS discriminant function, and provide definitive guidelines about ratios (e.g. deductions to income) that would guarantee or avoid an audit.

The use of more advanced machine learning methods got a boost in 2011 with the creation of the Office of Compliance Analytics. Now, the selection of returns for audit relies primarily on predictive models whose “rules” are internally-generated and may not even be visible to the IRS analysts.

On the data side, the IRS now has access to much more data than was used in the simple days of discriminant analysis, which relied solely on data in returns and forms filed with the IRS. Now the IRS monitors Facebook, Twitter and Instagram for patterns that might alert them to tax fraud by individual taxpayers. Washington State professors Kimberly A. Houser and Debra Sanders report in their paper, The Use of Big Data Analytics by the IRS: Efficient Solutions or the End of Privacy as We Know It?, that the IRS now tracks over 1 million attributes, or predictor variables, on individual taxpayers, drawn from numerous sources, including, in addition to the social media sources noted above, individual email traffic.

One new area where the IRS has been particularly vigilant is crypto-currencies. Last year, the IRS reported on its successful proceedings against Coinbase, which is the largest domestic crypto-currency exchange. While fewer than 1000 taxpayers reported crypto-currency gains over the 2013-2015 period, the IRS thinks that millions should have done so. The agency added the Coinbase taxpayer data, which it seized, to its other big data resources.

The efficacy of big data analytics for predicting tax fraud can be seen in the following statistic: while its enforcement budget has continuously declined over time, last year the IRS reported that it caught more than 400% more tax fraud cases and recovered 1000% more in taxes than in the prior year.

REFERENCES:

  • http://www.jetlaw.org/wp-content/uploads/2017/04/Houser-Sanders_Final.pdf
  • http://cavqm.blogspot.com/2011/07/reverse-engineering-irs-dif-score.html
  • https://www.smartdatacollective.com/can-predictive-analytics-prevent-tax-evasion/
  • https://www.freemanlaw-pllc.com/the-irs-and-big-data-the-future-of-fighting-tax-fraud/

Subscribe to the Blog

You have Successfully Subscribed!

By submitting your information, you agree to receive email communications from statistics.com. All information submitted is subject to our privacy policy. You may opt out of receiving communications at any time.

Categories

Recent Posts

  • March 9: Statistics and Data Science in Practice March 7, 2021
  • Feb 23: Statistics and Data Science in Practice March 5, 2021
  • Word of the Week – Ruin Theory March 4, 2021

About Statistics.com

Statistics.com offers academic and professional education in statistics, analytics, and data science at beginner, intermediate, and advanced levels of instruction. Statistics.com is a part of Elder Research, a data science consultancy with 25 years of experience in data analytics.

Latest Blogs

  • March 9: Statistics and Data Science in Practice
    March 7, 2021/
    0 Comments
  • Feb 23: Statistics and Data Science in Practice
    March 5, 2021/
    0 Comments
  • Word of the Week – Ruin Theory
    March 4, 2021/
    0 Comments

Social Networks

Linkedin-in
Twitter
Facebook-f
Youtube

Contact

The Institute for Statistics Education
4075 Wilson Blvd, 8th Floor
Arlington, VA 22203
(571) 281-8817

ourcourses@statistics.com

© Copyright 2021 - Statistics.com, LLC | All Rights Reserved | Privacy Policy | Terms of Use

By continuing to use this website, you consent to the use of cookies in accordance with our Cookie Policy.

Accept