Mean, Median, and Standard Deviation: When Average Lies to You
Aleph Sterling
May 9, 2026 Β· 8 min read
Bill Gates walks into a bar. The average net worth of everyone in the bar immediately becomes over $10 billion. Everyone in the bar is now, on average, a billionaire β while still being broke. This is why "average" is the most misused word in statistics.
Statistics are powerful precisely because they can summarize complex data into a single number. But that simplification comes with a cost: that single number can obscure the most important information in the dataset. Understanding when to use mean, when to use median, and what standard deviation tells you makes you a dramatically more literate consumer of information.
The Mean (Average): Useful Until It Isn't
The mean is calculated by summing all values and dividing by the count. It's mathematically elegant and works perfectly for symmetric, bell-curve-shaped distributions. It fails spectacularly for skewed data.
The mean is misleading when:
- Outliers exist: One extreme value (like Bill Gates' net worth) pulls the mean far from what's "typical"
- The data is skewed: Income, home prices, and wealth are all right-skewed β most values cluster low, but a long tail of high values inflates the mean
- The distribution is bimodal: If half your users are aged 20 and half are aged 60, the "average age of 40" describes no one
The Median: The Middle Value That Often Tells the Truth
The median is the middle value when data is sorted in order. Half the values fall above it, half below. It's resistant to outliers because extreme values don't change the middle of the distribution.
Real Example β U.S. Household Income (2023):
- Mean income: ~$115,000 (pulled up by very high earners)
- Median income: ~$76,000 (what a typical household actually earns)
The difference of $39,000 isn't a calculation error. It's the effect of outliers β a small number of very high earners skewing the mean upward. The median better represents most households.
Use the median when reporting:
- Income, wealth, or net worth
- Home prices
- Company salaries (especially when executives are included)
- Any data with a long tail in one direction
Standard Deviation: How Spread Out Is the Data?
The mean tells you where the center of data is. Standard deviation tells you how far values typically spread from that center. A small standard deviation means most values cluster near the mean. A large one means they're widely dispersed.
Example: Two investment funds with identical 8% average returns
- Fund A: Returns of 7%, 8%, 9%, 8%, 8%. Standard deviation: 0.7%
- Fund B: Returns of -20%, 15%, 32%, -8%, 25%. Standard deviation: 22%
Same average return. Completely different risk profiles. Fund B could devastate your retirement with a bad early sequence, even though the average "looks the same."
In a normal distribution, roughly:
- 68% of values fall within 1 standard deviation of the mean
- 95% of values fall within 2 standard deviations
- 99.7% of values fall within 3 standard deviations
Practical Applications: Spotting Misleading Statistics
Headlines use mean when median tells a different story
"Average American has $87,000 in savings" β this is a mean, skewed heavily by wealthy Americans. The median American has around $8,000 in savings. These describe very different financial realities.
Medical studies and test scores
If a drug reduces symptoms by "an average of 40%," standard deviation matters: if half the patients improved by 80% and half got no benefit (0%), the average is 40% but the drug works very differently for different patients. Standard deviation of 40% vs. 2% tells completely different stories about the drug's reliability.
Restaurant review scores
A restaurant with an average 3.5/5 from 4 reviews (5, 5, 5, 1 β someone had one very bad night) tells a different story than 3.5/5 from 100 reviews clustered around 3β4. Sample size and standard deviation together give the full picture.
A Quick Decision Framework
- Use mean when: Data is roughly symmetric, no extreme outliers, you need to calculate totals from an average
- Use median when: Data is skewed, outliers exist, you want to represent "typical"
- Report standard deviation when: Describing risk, variability, or consistency β anytime the spread matters as much as the center
- Report both mean and median when: The gap between them is large (it signals skew and potential misleading interpretation)
The Bottom Line
The most dangerous statistic is the one presented without context. A mean without knowing the distribution, a percentage without knowing the base, a statistically significant result without knowing the effect size β each can mislead as effectively as a lie.
The antidote isn't distrust of statistics β it's understanding which statistic answers which question. Mean or median? Is the data symmetric? How large is the spread? Asking these questions puts you in control of the numbers rather than at their mercy.
Calculate These Statistics Yourself
- Standard Deviation Calculator β Find mean, median, and standard deviation for any dataset
- Mean Calculator β Calculate average with step-by-step breakdown
- Median Calculator β Find the middle value with full working
- Z-Score Calculator β Measure how far a value is from the mean in standard deviations