Conversion & funnels

Statistical power

Power is the probability that a test correctly rejects the null when a true effect of a stated size exists: power = 1 − β. It rises with sample size, with the size of the effect you want to catch, and with a looser significance threshold; it falls with higher metric variance. Underpowered tests waste traffic by failing to detect real wins, so power is planned before launch.

Verified against primary sources

What power depends on

Power for a conversion test is driven by four inputs: the baseline rate and its variance, the minimum effect you want to detect, the significance level α, and the sample size per variant. Increase traffic or the target effect and power rises; tighten α or face a noisier metric and power falls. A convention is to design for 0.8 power, meaning an 80% chance of detecting the stated effect if it is real.

power = 1 − β (β is the type II error rate)
Rises with sample size and effect size
Falls with higher variance or a stricter α

Why underpowered tests mislead

An underpowered test that comes back 'not significant' tells you almost nothing — it may have lacked the data to see a genuine effect. Worse, among the underpowered tests that do reach significance, the estimated effect tends to be inflated (the winner's curse). Plan power up front so a null result is interpretable and a significant one is trustworthy.

Power analysis is the same calculation as sample sizing, viewed from the other direction.

How it appears in analytics and logs

Low power means a flat result is uninformative — the test may simply have been too small to see the effect you cared about.

Diagnostic use case

Set a target power (a common convention is 0.8) and the smallest effect worth catching, then size the test so you can actually detect it.

What WebmasterID can help detect

WebmasterID's first-party conversion volumes tell you the traffic available, which bounds the power a test can reach in a given window.

Common mistakes

Reading 'not significant' from an underpowered test as 'no effect'.
Designing for a tiny effect without the traffic to detect it.
Ignoring metric variance when estimating power.

Privacy and accuracy notes

Power is a function of aggregate counts and variance, not individuals. No personal data is required to compute it.

↑ All conversion topics in Conversion & funnels

Sources and verification notes

NIST/SEMATECH e-Handbook — sample sizes and power of a test

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.