StatisticsBeginner

Normal Distribution Explained

The normal distribution — also called the bell curve — is the most important probability distribution in statistics. It describes how many real-world measurements naturally spread out: heights, test scores, measurement errors, and countless other phenomena cluster around a middle value and taper off symmetrically.

1 What the Normal Distribution Is

A normal distribution is a symmetric, bell-shaped curve that describes how values are distributed around a mean. The highest point of the bell is the mean (average), and values become progressively less common the further you move from the center in either direction.

It is defined entirely by two numbers: the mean (μ), which determines where the center sits, and the standard deviation (σ), which determines how wide or narrow the bell is. A small standard deviation produces a tall, narrow bell. A large one produces a wide, flat bell.

What Makes It Normal

A distribution is perfectly normal when it is: symmetric around the mean, has mean = median = mode, and follows the specific mathematical formula involving e (Euler's number) and π. Real data is rarely perfectly normal but is often approximately normal, which is enough for most statistical methods.

2 The 68-95-99.7 Rule

For any normal distribution, regardless of its mean and standard deviation, the same proportions of data fall within 1, 2, and 3 standard deviations of the mean. This is the 68-95-99.7 rule (also called the empirical rule):

Applying the 68-95-99.7 Rule
IQ scores: mean = 100, standard deviation = 15
168% of people score between 100−15 = 85 and 100+15 = 115 (within 1 SD)
295% score between 100−30 = 70 and 100+30 = 130 (within 2 SD)
399.7% score between 100−45 = 55 and 100+45 = 145 (within 3 SD)
4Only 0.3% of people score below 55 or above 145
Answer: The rule applies to any normally distributed data by substituting your mean and SD

This rule is extremely practical. It lets you quickly estimate probabilities without any calculation. If a manufacturing process has a target of 100mm with standard deviation 2mm, you immediately know 95% of parts will be within 4mm of target without doing any math.

3 Why So Many Things Are Normally Distributed

The Central Limit Theorem (CLT) explains why the normal distribution appears everywhere. The theorem states: when you add together many independent random variables, the result tends toward a normal distribution — regardless of what distribution the individual variables have.

Human height is the product of hundreds of genetic and environmental factors. Each factor adds a small random amount to the total. The sum of many small independent random effects produces a normal distribution. This is why height, weight, blood pressure, and exam scores all approximate bell curves.

The CLT is also why sample means are normally distributed even when the underlying data isn't. If you take many samples and calculate the mean of each, those means will form a normal distribution. This property underpins almost all of classical statistics.

4 Z-Scores and the Standard Normal Distribution

The standard normal distribution has mean 0 and standard deviation 1. Any normal distribution can be converted to the standard normal by calculating z-scores: z = (x − μ) / σ.

Z-Score Conversion
A student scores 130 on an IQ test (mean=100, SD=15). What's the z-score?
1z = (x − μ) / σ = (130 − 100) / 15 = 30/15 = 2.0
2A z-score of 2.0 means the student scored 2 standard deviations above the mean
3From the 68-95-99.7 rule: 95% of people score within 2 SDs, so 5% score outside
4Since the distribution is symmetric, 2.5% score above 130
5This student is in the top 2.5% of IQ scores
Answer: z = 2.0, top 2.5%

Z-scores let you compare values from different distributions on a common scale. A z-score of 1.5 means "1.5 standard deviations above average" whether you're measuring height, test scores, or stock returns.

5 Real-World Applications

Finance: stock returns are approximately (though not perfectly) normally distributed in the short term. The assumption of normality underpins options pricing models like Black-Scholes. When markets crash, returns deviate dramatically from normal — these "fat tails" are why pure normal distribution models failed badly in 2008.

Quality control: manufacturers use normal distribution assumptions to set tolerance limits and predict defect rates. Six Sigma quality (6 standard deviations from target) means 3.4 defects per million — achievable because of how rapidly normal distribution probabilities shrink in the tails.

Medicine: clinical reference ranges, drug dosing, and diagnostic thresholds are built around normal distribution assumptions for biological measurements. Knowing that blood glucose levels are approximately normally distributed in healthy people lets doctors set evidence-based thresholds for pre-diabetes and diabetes diagnoses.

Try the Z-Score Calculator

Convert any value to a z-score and find its position in the normal distribution.

Open Calculator →

Practice Problems

A dataset is normally distributed with mean 50 and SD 10. What percentage of values fall between 30 and 70?
30 and 70 are each 2 standard deviations from the mean (50±20). By the 68-95-99.7 rule, 95% of values fall within 2 SDs of the mean.
A student scores 1 SD below the mean on an exam. What percentile are they approximately in?
One SD below the mean puts you at the 16th percentile approximately. Here's why: 68% of students score within 1 SD of the mean, leaving 32% outside. Half of those (16%) are below 1 SD. So the student is at roughly the 16th percentile.
Why can't a perfectly normal distribution describe something like income?
Normal distributions are symmetric and extend to negative infinity. Income can't be negative (it has a hard floor at zero) and is heavily right-skewed — a few very high earners pull the mean well above the median. Income follows a log-normal distribution instead.