Correlation vs Causation
Ice cream sales and drowning rates are strongly correlated. Eating ice cream does not cause drowning. Understanding why — and how to think carefully about correlation and causation — is one of the most practically valuable skills in statistics.
In this lesson
1 What Correlation Means
Correlation measures the strength and direction of a linear relationship between two variables. A correlation coefficient (r) ranges from −1 to +1. A correlation of +1 means the variables increase together perfectly. −1 means when one increases the other decreases perfectly. 0 means no linear relationship.
Correlation simply describes a pattern in data. Two variables are correlated if knowing one tells you something about the other. Shoe size and reading ability are correlated in children — not because shoes cause reading, but because both increase with age.
r close to +1: strong positive correlation (both increase together). r close to −1: strong negative correlation (one increases as the other decreases). r close to 0: weak or no linear correlation. Values above 0.7 or below −0.7 are generally considered strong.
2 What Causation Means
Causation means that changing one variable directly produces a change in another. Smoking causes lung cancer — the mechanism is understood, the relationship is direct, and removing the cause reduces the effect. This is causation.
To establish causation you need more than a correlation in data. You need a plausible mechanism, evidence that the cause precedes the effect in time, and ideally experimental evidence where you manipulate the cause and observe the effect while controlling everything else.
3 Why They Are Different
The ice cream and drowning example: both increase in summer. The real cause of both is warm weather — people eat more ice cream when it's hot, and more people swim (and some drown) when it's hot. Ice cream and drowning are correlated, but neither causes the other. The warm weather variable is hiding in the background.
Tyler Vigen's Spurious Correlations website documents hundreds of these: divorce rate in Maine correlates with per capita margarine consumption. US spending on science correlates with suicides by hanging. These demonstrate that any two trending variables will correlate if you look hard enough.
4 Confounding Variables
A confounding variable is a third variable that influences both of the variables you're studying, creating a spurious correlation between them. Warm weather confounds the ice cream/drowning correlation. Age confounds the shoe size/reading ability correlation.
Coffee and heart disease were long thought to be positively correlated — coffee drinkers had higher rates of heart disease. Later research found that heavy coffee drinkers also smoked more. Smoking was the confounder. When researchers controlled for smoking, the coffee-heart disease correlation disappeared.
Ask: is there a third variable that could cause both of the things I'm observing? Demographic variables (age, income, education) are common confounders because they affect so many outcomes simultaneously.
5 How Causation Is Actually Established
The gold standard is a randomized controlled trial (RCT). Randomly assign people to treatment and control groups, administer the treatment to one group only, and measure outcomes. Random assignment eliminates confounders — both groups are similar in every way except the treatment.
When RCTs are unethical or impractical (you can't randomly assign people to smoke for 30 years), researchers use natural experiments, instrumental variables, regression discontinuity, and difference-in-differences — all statistical techniques designed to approximate the conditions of a controlled experiment using observational data.
Bradford Hill criteria provide a framework for inferring causation from observational data: strength of association, consistency across studies, specificity, temporality (cause precedes effect), biological gradient (dose-response), plausibility of mechanism, coherence with existing knowledge, and experimental evidence where available. No single criterion is sufficient; the totality of evidence builds the case.
Practice Problems
📚 Further Reading & Resources
Go deeper with these trusted free resources.