Carl McBride Ellis, PhD’s Post

View profile for Carl McBride Ellis, PhD

Predictive analytics (tabular data) | Author of "The Orange Book of Machine Learning"

☛ Statisticians torture their data to fit the model. ☛ Data scientists torture their models to fit the data. ----------------- #datascience #machinelearning

Jose Luis Hidalgo

It's (mostly) not my fault that AI will probably kill us all.

5d

And if you torture them hard enough they will reveal their secrets and you might learn something valuable, but you torture them too much and they will tell you anything you want to hear. (Apologies for the macabre comment, but the analogy was too good to pass it 😅)

Stephan Kolassa

Data Science Expert at SAP Switzerland AG

5d

Management tortures data scientists to fit their preconceptions.

Alonso Zambrano Valero

Behavioral Data Scientist | Segmentation, Lift Analysis & Machine Learning for Commercial Strategy | Insurance, Banking & Consumer Goods | Python, SQL, Power BI | Statistical Validation, ETL & Data Governance.

5d

Nowadays, or when working with real world data, you have to do both, if we can even call it torture. This is where the principle of Occam’s Razor comes into play: start with the simplest explanation. I usually begin with statistical models to understand the data and then build on that knowledge using a machine learning model or an ensemble. Cheers.

Stephen Mack

Analytics / Facilitator Decision Management Professional. MB ENTP who gets the big picture

4d

Carl McBride Ellis, PhD, ☛ Statisticians torture their data to fit the model. ☛ Data scientists torture their models to fit the data. Humorous and clever comparison. When I was an undergraduate Chemistry major way before personal computers, a grad student had posted 10 tips for conducting research on his lab door. One related to your observation was: "First plot the curve and then add the data points." My favorite tip was: "Don't just pray for miracles, depend on them."

Mitchell Maltenfort

Statistician at Children's Hospital of Philadelphia

4d

Is it terrible to add “AI practitioners torture everyone?”

Lisandro De la Torre, PhD

Data Science | PhD in Electrical Engineering | Python | R | SQL | MATLAB | Ansys HFSS | CST Microwave Studio | Electromagnetic

5d

Both are extreme points of view. In the second case, you get overfitting; in the first, we could also think of a kind of overfitting, but in the opposite direction. It’s difficult to reach equilibrium, avoiding situations that could be described as “overfitting methodologies”.

Niels Erik Andersen

Advisor, doer, and experienced board member. Making manufacturers more profitable and sustainable.

5d

And both should likely look up from their data and their models and observe the real world and get context.

Mikhail Mikushin

Data Analyst | BI Analyst | SQL | Power BI | Looker Studio | Python

5d

This world is full of pain.

Like
Reply

You gotta do both low bias low variance

Like
Reply
See more comments

To view or add a comment, sign in

Explore content categories