Question 1

Can the mean and median ever be equal for a skewed dataset?

Accepted Answer

Yes — it is possible for the mean and median to coincide even in a non-symmetric distribution, though it requires a specific balance of values above and below the median that exactly offsets each other in the mean calculation. This is a coincidence rather than a reliable property. The general rule (right skew → mean > median) describes the typical case, not a mathematical law. Always compute both rather than assuming one from the other.

Question 2

For a normal distribution, are mean and median always equal?

Accepted Answer

For a perfectly normal distribution, yes — mean = median = mode. A normal distribution is perfectly symmetric about its centre. In practice, real data is never perfectly normal, so mean and median may differ slightly even for roughly normal datasets. A small difference (< 5%) is usually just sampling variation and not evidence of meaningful skewness.

Question 3

Which measure does the government report for average wages?

Accepted Answer

Most statistical agencies report both, but when only one is given, median is standard for income data. The UK's Office for National Statistics, the US Census Bureau, and Statistics Canada all use median household income as the headline measure — specifically because it is more representative of the typical household than the mean, which is inflated by top earners. When a government or news report cites "average income," check whether they mean mean or median — the word "average" is ambiguous.

Question 4

Is the median resistant to all types of data problems?

Accepted Answer

The median is resistant to outliers and skewness — but not to all data problems. It is sensitive to the exact values in the middle of the distribution. If your data has systematic measurement errors throughout (not just at the extremes), or if data is missing non-randomly, the median can also be misleading. No single statistic is robust to all types of data quality issues. Always examine the full distribution — not just the summary statistics.

Question 5

What about the mode — when should it be used instead of both?

Accepted Answer

Use mode when: (1) data is categorical (colours, job titles, sizes) — mean and median are undefined for nominal data; (2) you need the single most likely value in a discrete distribution (e.g., most common number of rooms in a house); (3) the distribution is bimodal — reporting one mean or median would miss the two distinct peaks. For continuous data, the mode is rarely meaningful because values rarely repeat exactly.

Property	Mean (x̄)	Median
Definition	Sum of all values ÷ count	Middle value in sorted data
Formula	x̄ = Σx / n	x[(n+1)/2] for odd n; average of two middle values for even n
Sensitive to outliers?	Yes — pulled by extreme values	No — only position matters
Data type required	Numerical (interval or ratio scale)	Numerical or ordinal
For symmetric distributions	Mean = Median	Mean = Median
For right-skewed data	Mean > Median	Better represents typical value
For left-skewed data	Mean < Median	Better represents typical value
Mathematical tractability	High — used in SD, regression, ANOVA	Lower — harder to work with algebraically
Uniqueness	Always unique	Unique (interpolated for even n)
Effect of adding a value	Changes mean	May or may not change median

Mean vs Median — Which Should You Use?

Side-by-Side Comparison

When the Mean Is the Right Choice

When the Median Is the Right Choice

Using Skewness to Make the Decision

How a Single Outlier Changes Each Measure

Summary

Related Calculators

Frequently Asked Questions