StatisticsBeginner

What is Standard Deviation?

Standard deviation is a number that tells you how spread out values are in a dataset. A small standard deviation means values cluster tightly around the average. A large one means they are scattered widely. It is the single most important measure of variability in statistics.

1 What Standard Deviation Actually Measures

Imagine two classes both scored an average of 75 on a test. In Class A, every student scored between 70 and 80. In Class B, half scored 50 and half scored 100. Same average, completely different story. Standard deviation captures that difference — it measures how far values typically stray from the mean.

A standard deviation of 5 means values are typically about 5 units away from the average. A standard deviation of 30 means they stray about 30 units on average. The unit is always the same as the original data — if you're measuring test scores, standard deviation is in points.

The Key Insight

Standard deviation answers: if I pick one value from this dataset at random, how far from the average should I expect it to be? That's it. Everything else follows from this intuition.

2 The Formula, Step by Step

There are two versions: population standard deviation (σ) when you have every value in the group, and sample standard deviation (s) when you have a subset. In practice, use sample (divide by n−1) unless you literally have the entire population.

Calculating Standard Deviation
Dataset: 4, 7, 13, 2, 1, 9
1Find the mean: (4+7+13+2+1+9) / 6 = 36/6 = 6
2Subtract the mean from each value and square it: (4−6)²=4, (7−6)²=1, (13−6)²=49, (2−6)²=16, (1−6)²=25, (9−6)²=9
3Sum the squared differences: 4+1+49+16+25+9 = 104
4Divide by n−1 (sample): 104/5 = 20.8 — this is the variance
5Take the square root: √20.8 ≈ 4.56
Answer: Standard deviation ≈ 4.56 — values typically stray about 4.56 from the mean of 6

The squaring step serves two purposes: it makes all deviations positive (so negative and positive deviations don't cancel), and it weights outliers more heavily — a value 10 away contributes 100, not 10.

3 How to Interpret Standard Deviation

For data that follows a normal distribution (bell curve), the 68-95-99.7 rule applies: about 68% of values fall within one standard deviation of the mean, 95% within two, and 99.7% within three.

The 68-95-99.7 Rule

If a dataset has mean 100 and standard deviation 15 (like IQ scores): 68% of people score between 85 and 115. 95% score between 70 and 130. 99.7% score between 55 and 145. This rule only applies when data is approximately normally distributed.

Standard deviation also lets you compare spread across different datasets. A stock with standard deviation of 2% daily returns is much less volatile than one with 8% daily returns — even if both have the same average return. In finance this is literally how risk is measured.

4 Real-World Applications

Quality control in manufacturing: a factory producing bolts of target diameter 10mm with standard deviation 0.01mm is far more consistent than one with standard deviation 0.5mm. Both hit the target on average, but only the first produces reliably uniform products.

In medicine, lab reference ranges are typically set at mean ± 2 standard deviations for healthy populations. If your blood test result falls outside that range, you are in the outer 5% — which flags it for clinical attention.

In investing, portfolio standard deviation measures risk. A portfolio with 15% average annual return and 5% standard deviation is very different from one with 15% return and 30% standard deviation — the second could easily return −45% in a bad year while the first would likely stay between 5% and 25%.

5 Common Mistakes

Confusing standard deviation with variance

Variance is the average squared deviation. Standard deviation is its square root. They measure the same thing but in different units. Variance is used in statistical formulas; standard deviation is reported to humans because it's in the original units.

Using population formula on a sample

Dividing by n instead of n−1 on sample data systematically underestimates the true population standard deviation. Always use n−1 (sample formula) unless you literally have every member of the population.

Thinking low standard deviation is always better

Low standard deviation means consistency, not quality. A factory consistently producing defective parts has low standard deviation and terrible quality. Context determines whether spread is good or bad.

Try the Standard Deviation Calculator

Calculate standard deviation instantly and see the formula in action with your own numbers.

Open Calculator →

Practice Problems

A dataset has values 10, 20, 30, 40, 50. What is the mean and approximately what would you expect the standard deviation to be — small, medium, or large relative to the mean?
Mean = (10+20+30+40+50)/5 = 30. The values are spread across a range of 40 with the mean in the middle — standard deviation is roughly 14.1. That's medium-large relative to the mean, reflecting the wide spread.
Two funds both return 8% annually. Fund A has SD of 2%, Fund B has SD of 15%. Which is riskier and what does that mean practically?
Fund B is much riskier. In a bad year, Fund B could return 8−30 = −22% or worse (2 SDs down). Fund A's bad year is around 8−4 = 4%. Same average return, dramatically different risk profile.
What percentage of data falls within 2 standard deviations of the mean in a normal distribution?
95% — the 68-95-99.7 rule. This means only 5% of values are more than 2 standard deviations from the mean.