Potential Outcomes Framework

Summary

The potential outcomes framework (Rubin causal model) defines causal effects as comparisons of counterfactual outcomes under different treatment conditions for the same unit. The key identifying assumptions — SUTVA, ignorability (unconfoundedness + overlap) — determine when causal effects can be estimated from observational data.

Overview

The potential outcomes framework is the mainstream statistical framework for causal inference, underpinning the Bayesian review by Li et al. (2022). Following the dictum “no causation without manipulation”, a cause is a pre-specified treatment or intervention that is at least hypothetically manipulable.

Setup

Consider $N$ units indexed by $i \in {1, \dots, N}$ . Each unit $i$ has:

$Z_{i} \in {0, 1}$ — binary treatment indicator ( $Z_{i} = 1$ active, $Z_{i} = 0$ control)
$X_{i}$ — vector of $p$ pre-treatment covariates observed before treatment
$Y_{i} (1), Y_{i} (0)$ — potential outcomes under treatment and control respectively
$Y_{i}^{obs} = Y_{i} (Z_{i})$ — the observed outcome (only one potential outcome is observed)

The fundamental problem of causal inference: for each unit, only $Y_{i} (Z_{i})$ is observed; $Y_{i} (1 - Z_{i})$ is missing (counterfactual).

Let $Z = (Z_{1}, \dots, Z_{N})^{T}$ , $X = (X_{1}, \dots, X_{N})^{T}$ .

Key Assumption: SUTVA

Assumption: Stable Unit Treatment Value Assumption (SUTVA)

There is: (i) no different version of a treatment, and (ii) no interference — unit $i$ ‘s potential outcomes are not affected by other units’ treatment assignments.

Under SUTVA, unit $i$ has exactly two potential outcomes: $Y_{i} (1)$ and $Y_{i} (0)$ .

SUTVA rules out spillover effects and treatment heterogeneity due to dose or version. Violations occur in settings like infectious disease (interference) or drug dosage (multiple versions).

Key Assumption: Ignorability

Assumption 2.1 — Ignorability (Li et al. §2)

The assignment mechanism is ignorable if both:

(a) Unconfoundedness: $Z_{i} ⊥ ⊥ {Y_{i} (0), Y_{i} (1)} ∣ X_{i}$ , or equivalently $Z_{i} ⊥ ⊥ {Y_{i} (0), Y_{i} (1)} ∣ {X_{i} = x_{i}}$ for all $x$ .

(b) Overlap: $0 < e (x) < 1$ for all $x$ , where $e (x) = Pr (Z_{i} = 1 ∣ X_{i} = x)$ is the propensity score.

Unconfoundedness (also called selection on observables or no unmeasured confounding): treatment assignment is as-good-as-random conditional on observed covariates. This is fundamentally untestable from observed data.
Overlap (also called positivity): every unit has a non-zero probability of receiving either treatment. Ensures the conditional distribution of potential outcomes is identifiable from observed data.

Together, these ensure that:

μ_{z} (x) \equiv E [Y_{i} (z) ∣ X_{i} = x] = E [Y_{i} ∣ Z_{i} = z, X_{i} = x]

for all $z, x$ — i.e., the potential outcome mean equals the observed conditional mean.

The Role of Covariate Overlap and Balance

Overlap and balance refers to similarity in the distribution of covariates between the two treatment groups. This is a key concept for design:

In randomized experiments: all covariates are balanced in expectation; simple difference-in-means $\overset{τ}{^}$ is unbiased for $τ^{S}$ and $τ^{P}$ .
In observational studies: groups are often imbalanced (e.g., sicker patients get more treatment). Direct comparison gives biased causal estimates.
Poor overlap: outcome model estimates rely on extrapolation; sensitive to model misspecification.

Design vs. Analysis

A main effort in causal inference with observational data is the design stage: ensuring overlap and balance to mimic a randomized experiment as closely as possible. The design stage does not involve the outcome — this contrasts with the analysis stage, which uses the outcome to estimate effects.

Randomized Experiments vs. Observational Studies

Feature	Randomized Experiment	Observational Study
Assignment known?	Yes, controlled	No, must be modeled
Ignorability	Holds by design	Holds approximately (best case)
Overlap	Usually good	May be poor
Confounding	Eliminated by randomization	Present; must adjust

A/B testing and randomized controlled trials are the gold standard for causal inference precisely because they eliminate confounding via randomization.

Connections

Causal Estimands — formal treatment effect quantities defined under this framework
Frequentist Causal Estimation — IPW, outcome modeling, doubly-robust estimators exploit ignorability
General Structure of Bayesian CI — Bayesian inference treats missing potential outcomes as parameters to be imputed
Sensitivity Analysis in Observational Studies — methods to assess robustness when unconfoundedness may fail

Second Brain

Explorer

Potential Outcomes Framework

Potential Outcomes Framework

Overview

Setup

Key Assumption: SUTVA

Key Assumption: Ignorability

The Role of Covariate Overlap and Balance

Randomized Experiments vs. Observational Studies

Connections

See Also

Graph View

Table of Contents

Backlinks