Confidence Intervals Explained
A 95% confidence interval does not mean there is a 95% probability the true value lies within it. This is one of the most persistently misunderstood concepts in statistics. Let's build the correct intuition from the ground up.
In this lesson
1 What a Confidence Interval Is
A confidence interval is a range of values constructed from sample data that is designed to capture the true population parameter a specified percentage of the time. The 95% refers to the procedure, not to any single interval.
Here's the concrete setup: you want to know the average height of all adults in a country. You can't measure everyone, so you take a sample of 1,000 people and measure their heights. You calculate the sample mean and build a confidence interval around it.
A 95% confidence interval means: if you repeated this sampling procedure many times and built a confidence interval each time, 95% of those intervals would contain the true population mean. The 95% describes the long-run reliability of the procedure.
2 How Confidence Intervals Are Constructed
The basic formula for a confidence interval for a mean: CI = x̄ ± z* × (σ/√n), where x̄ is the sample mean, z* is the critical value (1.96 for 95% confidence), σ is the standard deviation, and n is the sample size.
11.9673.04, 76.96)The margin of error (±1.96 in this case) is half the width of the interval. It's what news reports mean when they say "accurate to within ±3 percentage points." Smaller margins of error require larger samples.
3 The Correct Interpretation
After you've built a specific 95% CI of (73.04, 76.96), the true mean either is or isn't in that interval. There's no probability involved for that specific interval — it's a fixed range and the true value is a fixed number. The 95% describes the process that generated the interval.
The correct statement: "I used a procedure that produces intervals containing the true value 95% of the time. This is one of those intervals." Not: "There is a 95% probability the true value is in this specific range."
The practical takeaway: treat a 95% CI as a plausible range for the true parameter. Values inside the interval are consistent with your data. Values outside would be surprising given what you observed.
4 Common Misinterpretations
Once the interval is calculated, there's no randomness left. The true value is fixed. The correct framing is about the procedure's reliability over many repetitions, not the probability for this specific interval.
The confidence interval is about where the population mean might be, not where individual data points fall. For individual data points, you'd use a prediction interval, which is much wider.
A 99% confidence interval is wider than a 95% one — that's how it achieves higher confidence. The tradeoff is precision vs reliability. For medical decisions you might want 99% confidence; for a quick business estimate, 90% might be fine.
5 What Affects Interval Width
Three things determine how wide a confidence interval is. Sample size is the most controllable: doubling the sample size reduces the margin of error by a factor of √2 (about 29%). To halve the margin of error you need to quadruple the sample size.
Confidence level: higher confidence (99% vs 95%) requires a wider interval. You're casting a bigger net to be more sure of catching the true value.
Population variability: more variable populations produce wider intervals. You can't control this directly — it's a property of what you're measuring.
Practice Problems
📚 Further Reading & Resources
Go deeper with these trusted free resources.