Instrumental Variables and Principal Stratification

Summary

Instrumental variables (IV) are used when unconfoundedness is untenable — they provide an exogenous source of variation in treatment assignment. The IV estimand is the Complier Average Causal Effect (CACE), identified under monotonicity and exclusion restriction. Bayesian IV inference treats the unobserved compliance stratum as a latent variable to be imputed via data augmentation.

Overview

IV methods address settings where unconfoundedness fails — unmeasured confounders exist and sensitivity analysis is insufficient. An instrument provides exogenous variation in treatment to identify causal effects for a subpopulation.

IV methods are one of the most important techniques for causal inference in economics and social sciences. The Bayesian framework naturally extends to this setting (§7 of Li et al. 2022), treating the IV problem as a mixture model with latent compliance strata.

IV Setup

Let:

  • — randomly assigned instrument (e.g., randomization to receive an encouragement)
  • — actual treatment received ( treated, control)
  • — observed outcome
  • — potential treatments under instrument values

When , non-compliance occurs. Four compliance types are defined by the potential treatment pair :

Definition: Compliance Types (Principal Strata)

TypeDescription
Compliers (co)10Follow the instrument
Always-takers (at)11Always take treatment
Never-takers (nt)00Never take treatment
Defiers (df)01Do the opposite

The compliance stratum is a pre-treatment characteristic of unit — comparisons of and within each stratum are valid causal effects (principal causal effects).

IV Assumptions

Assumption: IV Validity (Angrist, Imbens & Rubin 1996)

A valid instrument satisfies: (i) Randomization: is randomly assigned (ii) Exclusion restriction: affects the outcome only through its effect on (the instrument has no direct effect on ) (iii) Monotonicity: for all (no defiers)

Under monotonicity, only compliers, always-takers, and never-takers exist ().

The Complier Average Causal Effect (CACE)

Definition: Complier Average Causal Effect (CACE)

The average treatment effect for compliers only:

Under monotonicity, this equals:

which is exactly the probability limit of the two-stage least squares (2SLS) estimator.

Also called the Local Average Treatment Effect (LATE) in the Frequentist literature (Imbens & Angrist 1994).

Interpretation: CACE is the intention-to-treat effect divided by the compliance rate. It measures the effect of the assignment on the outcome for the subpopulation that actually complies with the assignment.

Bayesian IV Inference

Bayesian IV inference (first outlined by Imbens & Rubin 1997) treats the unobserved compliance stratum as a latent variable to be imputed.

For each unit , six quantities are now associated: , where:

  • — observed outcome
  • — observed treatment
  • — missing outcome
  • — missing treatment

The full-data joint distribution is:

Under unconfoundedness, is ignorable.

Two models to specify:

  1. Compliance model: — e.g., multinomial logistic regression for
  2. Outcome model: — e.g., generalized linear model for given

Example 7.1 — Bayesian IV with One-Sided Non-Compliance

Example 7.1 — IV with One-Sided Non-Compliance

Setup: Randomized experiment with binary outcome , where control units () have no access to treatment — no always-takers.

Result: Only two strata: . Assume (randomization), and simple conjugate priors.

Posterior sampling (Gibbs):

  • Sample from
  • Sample from — similarly for
  • Impute for units using probability proportional to vs.

The CACE posterior: .

Principal Stratification (Generalization)

The IV setup is a special case of principal stratification (Frangakis & Rubin 2002): a unified framework for causal inference with post-treatment confounded variables.

A post-treatment variable lies in the causal pathway between treatment and outcome. Its potential values define principal strata — subsets of units with the same pair of potential intermediate values. Within each stratum, comparisons of and are valid causal effects.

Key insight: in the IV setting is a pre-treatment characteristic, making comparisons within compliance strata valid causal comparisons. This is the fundamental reason the CACE has a clear causal interpretation.

Applications beyond IV:

  • Censoring by death (survival endpoints)
  • Surrogate endpoints
  • Regression discontinuity designs
  • Time-varying treatments
  • Many clever natural experiments

Connections to Frequentist IV

The Bayesian IV approach is closely related to the 2SLS estimator: the CACE equals the probability limit of 2SLS. However:

  • 2SLS is derived from the structural equation model framework
  • The potential outcomes framework makes 2SLS correspond to a causal effect only for a few special cases (Angrist et al. 1996)
  • The Bayesian mixture model approach is more general and handles complex CATE estimands naturally

Connections

See Also