Conversion & funnels

Regression to the mean in tests

Regression to the mean is the statistical tendency for an extreme measurement to be closer to the average on the next observation. In experimentation it explains why a page picked because it converted unusually well often 'declines' afterward, and why early test readings overstate effects. Recognising it prevents crediting a change for a return to normal. This page explains the mechanism.

Partially verified

Why extremes don't repeat

Any measurement mixes a stable signal with random noise. An unusually high reading is more likely to have caught a positive noise burst, so the next reading — with fresh, independent noise — tends to land closer to the true average. The extreme regresses toward the mean without anything causal happening.

How it fools experimenters

Selecting a target because it is extreme is the trap. Pick the worst-converting page, change it, and it 'improves' partly because it was due to regress anyway. Early in a test, a variant can look spectacular for the same reason — small samples produce extreme estimates that shrink as data accumulates.

Extreme readings catch noise that won't recur
Selecting on an extreme guarantees apparent change
Small samples exaggerate effect estimates

Guarding against it

A control group is the defence: if both the treated and untreated groups drift back toward average, the drift is regression, not your change. This is exactly why a randomised A/B comparison beats before-and-after on a single hand-picked page.

How it appears in analytics and logs

A standout metric that 'normalises' later may simply be regressing, not responding to anything you did. Early extreme test readings usually shrink with more data.

Diagnostic use case

Be sceptical when you optimise the worst- or best-performing page or variant: some of the subsequent change is regression to the mean, not your intervention.

What WebmasterID can help detect

WebmasterID's consistent first-party time series let you watch a metric over multiple periods, so you can tell a real shift from a reading that is merely regressing toward its average.

Common mistakes

Crediting a fix for a metric that was going to regress anyway.
Trusting a variant's spectacular early-sample reading.
Using before/after on a hand-picked extreme instead of a control.

Privacy and accuracy notes

Regression to the mean is a property of aggregate measurements. Detecting it needs no personal data, only repeated readings of the same metric.

↑ All conversion topics in Conversion & funnels

Sources and verification notes

Wikipedia — Regression toward the meanMechanism and selection-on-extremes trap.

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.