[OC] These two scatter plots have identical correlation coefficients. Here's why one looks so much tighter.

Both datasets: r = 0.70. Same correlation coefficient. But one looks noticeably more clustered around the regression line.

The difference is purely in the standard deviations – not the strength of the relationship. Because Pearson's r converts everything into standard units before measuring, it's blind to how physically spread out the data is. Smaller SDs → visually compact plot → same r.

It's a surprisingly easy trap. Your eyes read the raw coordinate space. r operates in standardized space. Those two views can look totally different.

I put this exact question to ChatGPT (with Thinking Mode) as a test – it fell for it too. Made a short video breaking down the full explanation here: https://youtu.be/GA7DQcc-ouo

by Jazzlike_History89

View 1 Comment

1 Comment

DeliberateDendrite on May 1, 2026 8:02 am

Reminds me of Francis Anscombe’s work. I made a figure consisting of four different datasets that when fitted resulted in the same intercept, slope and correlation coefficient, when a line was fit through them.

You must be logged in to post a comment.

[OC] These two scatter plots have identical correlation coefficients. Here’s why one looks so much tighter.

1 Comment