The Selection Problem

Summary

Selection bias arises because individuals who receive treatment differ systematically from those who don’t, even in the absence of treatment. This is the fundamental challenge that all causal inference methods aim to overcome.

Potential Outcomes Framework

For individual with treatment :

  • : outcome without treatment
  • : outcome with treatment
  • Causal effect: (never directly observed for any individual)

The observed outcome:

The Decomposition

Selection Bias Can Be Large

In the hospital example, selection bias is negative (sick people seek hospitals) and large enough to completely mask a positive treatment effect — making hospitals appear harmful.

Solutions

MethodHow it addresses selection bias
Random assignmentMakes independent of potential outcomes
MatchingControls for observables that drive selection
IVUses exogenous variation in treatment
Fixed effectsControls for time-invariant unobservables
RDExploits arbitrary assignment rules

See Also