Potential Outcomes Framework

Summary

The potential outcomes framework (Rubin causal model) defines causal effects as comparisons of counterfactual outcomes under different treatment conditions for the same unit. The key identifying assumptions — SUTVA, ignorability (unconfoundedness + overlap) — determine when causal effects can be estimated from observational data.

Overview

The potential outcomes framework is the mainstream statistical framework for causal inference, underpinning the Bayesian review by Li et al. (2022). Following the dictum “no causation without manipulation”, a cause is a pre-specified treatment or intervention that is at least hypothetically manipulable.

Setup

Consider units indexed by . Each unit has:

  • — binary treatment indicator ( active, control)
  • — vector of pre-treatment covariates observed before treatment
  • potential outcomes under treatment and control respectively
  • — the observed outcome (only one potential outcome is observed)

The fundamental problem of causal inference: for each unit, only is observed; is missing (counterfactual).

Let , .

Key Assumption: SUTVA

Assumption: Stable Unit Treatment Value Assumption (SUTVA)

There is: (i) no different version of a treatment, and (ii) no interference — unit ‘s potential outcomes are not affected by other units’ treatment assignments.

Under SUTVA, unit has exactly two potential outcomes: and .

SUTVA rules out spillover effects and treatment heterogeneity due to dose or version. Violations occur in settings like infectious disease (interference) or drug dosage (multiple versions).

Key Assumption: Ignorability

Assumption 2.1 — Ignorability (Li et al. §2)

The assignment mechanism is ignorable if both:

(a) Unconfoundedness: , or equivalently for all .

(b) Overlap: for all , where is the propensity score.

  • Unconfoundedness (also called selection on observables or no unmeasured confounding): treatment assignment is as-good-as-random conditional on observed covariates. This is fundamentally untestable from observed data.
  • Overlap (also called positivity): every unit has a non-zero probability of receiving either treatment. Ensures the conditional distribution of potential outcomes is identifiable from observed data.

Together, these ensure that:

for all — i.e., the potential outcome mean equals the observed conditional mean.

The Role of Covariate Overlap and Balance

Overlap and balance refers to similarity in the distribution of covariates between the two treatment groups. This is a key concept for design:

  • In randomized experiments: all covariates are balanced in expectation; simple difference-in-means is unbiased for and .
  • In observational studies: groups are often imbalanced (e.g., sicker patients get more treatment). Direct comparison gives biased causal estimates.
  • Poor overlap: outcome model estimates rely on extrapolation; sensitive to model misspecification.

Design vs. Analysis

A main effort in causal inference with observational data is the design stage: ensuring overlap and balance to mimic a randomized experiment as closely as possible. The design stage does not involve the outcome — this contrasts with the analysis stage, which uses the outcome to estimate effects.

Randomized Experiments vs. Observational Studies

FeatureRandomized ExperimentObservational Study
Assignment known?Yes, controlledNo, must be modeled
IgnorabilityHolds by designHolds approximately (best case)
OverlapUsually goodMay be poor
ConfoundingEliminated by randomizationPresent; must adjust

A/B testing and randomized controlled trials are the gold standard for causal inference precisely because they eliminate confounding via randomization.

Connections

See Also