Power Analysis and Sample Size

Summary

Power analysis determines the minimum sample size needed to detect a meaningful effect. Under-powered studies risk missing real effects (Type II error); over-powered studies waste resources. Power depends on significance level ( $α$ ), desired power ( $1 - β$ ), and expected effect size.

Core Concepts

Term	Definition	Typical Value
Type I error ( $α$ )	False positive — rejecting a true null	0.05 or 0.01
Type II error ( $β$ )	False negative — failing to reject a false null	0.20
Power ( $1 - β$ )	Probability of detecting a real effect	0.80 or 0.90
Effect size	Magnitude of the difference you want to detect	From prior studies or pilot data

Beyond Type I and Type II

In under-powered studies the more practically dangerous errors are Type S (sign) and Type M (magnitude) errors — getting the direction of an effect wrong, or dramatically over-estimating its size. These are not controlled by conventional power analysis and are exacerbated when sample sizes are small.

Key Normal Deviates

$α$ (two-tailed)	$Z_{α /2}$	Power	$Z_{1 - β}$
0.05	1.96	80%	0.84
0.01	2.58	90%	1.28

Sample Size Formulas

Comparing Two Means (t-test)

N_{per group} = \frac{2 ( Z _{α /2} + Z _{1 - β} ) ^{2} σ ^{2}}{d ^{2}}

where $σ$ is the pooled SD and $d$ is the minimum detectable difference.

Comparing Two Proportions

N = \frac{( Z _{α /2} + Z _{1 - β} ) ^{2} \cdot p ˉ ( 1 - p ˉ ) \cdot ( 1 + r )}{r \cdot d ^{2}}

where $\overset{p}{ˉ}$ is the average proportion, $d$ is the difference, and $r = n_{1} / n_{2}$ .

Survey / Single Proportion

N = \frac{Z _{α /2}^{2} \cdot P ( 1 - P )}{E ^{2}}

where $P$ is expected prevalence and $E$ is margin of error.

Correlation

N = (\frac{Z _{α /2} + Z _{1 - β}}{0.5 ln \frac{1 + r}{1 - r}})^{2}

Practical Adjustments

Attrition: adjust $N_{1} = N / (1 - q)$ where $q$ is expected dropout rate
One-tailed tests: ~20% fewer subjects
Non-randomized designs: add ~20% more subjects
Crossover designs: ~25% of parallel group requirement
Categorical outcomes require larger samples than continuous for equivalent power

Tip

Always base effect size estimates on prior literature or pilot data. Overly optimistic effect sizes lead to under-powered studies — one of the key contributors to the replication crisis.

Connection to Bayesian Approaches

In Bayesian analysis, the concept of “power” is less central — instead, one can use posterior predictive simulation to assess whether the planned sample provides adequate precision for quantities of interest. See Fitting and Validating Computation for simulation-based approaches.

Second Brain

Explorer

Power Analysis and Sample Size

Power Analysis and Sample Size

Core Concepts

Key Normal Deviates

Sample Size Formulas

Comparing Two Means (t-test)

Comparing Two Proportions

Survey / Single Proportion

Correlation

Practical Adjustments

Connection to Bayesian Approaches

See Also

Graph View

Table of Contents

Backlinks