Instrumental Variables

Summary

IV methods solve the omitted variables problem by using a variable (the instrument) that is correlated with the treatment but uncorrelated with other determinants of the outcome. The key implementation is Two-Stage Least Squares (2SLS).

The IV Setup

When the causal model suffers from , an instrument satisfies:

  1. Relevance (first stage): — the instrument affects treatment
  2. Exclusion restriction: — the instrument only affects outcomes through treatment

The IV estimand:

Two-Stage Least Squares

Stage 1: Regress on and covariates → get fitted values

Stage 2: Regress on and → coefficient on is the 2SLS estimate of

Use Canned Software

Don’t literally run 2SLS in two steps — the standard errors will be wrong. Use built-in IV commands (e.g., ivregress in Stata).

The Wald Estimator

With a binary instrument:

The reduced-form difference in means, rescaled by the first-stage difference.

Key Examples

Returns to schooling (Angrist & Krueger, 1991)

  • Instrument: Quarter of birth (affects schooling through compulsory attendance laws)
  • First stage: Q1 births → ~0.15 fewer years of schooling
  • Result: 2SLS estimates of ~0.08-0.11 (slightly above OLS ~0.07)

Vietnam-era military service (Angrist, 1990)

  • Instrument: Draft lottery number (randomly assigned)
  • First stage: Draft-eligible men 16pp more likely to serve
  • Result: Military service reduced 1981 earnings by ~$2,700 (15% of mean)

Effect of family size on labor supply (Angrist & Evans, 1998)

  • Instruments: Twins at second birth; same-sex sibling composition
  • Different instruments give different estimates → suggests heterogeneous effects

Local Average Treatment Effects (LATE)

With heterogeneous effects, IV estimates the causal effect on compliers — those whose treatment status is changed by the instrument. This is a local rather than population-average effect.

See Also