Bayesian Causal Inference: A Critical Review
Summary
Li, Ding, and Mealli (2022) provide a comprehensive critical review of the Bayesian perspective on causal inference within the potential outcomes framework. The paper identifies issues unique to Bayesian causal inference — including identifiability, the role of the propensity score, prior choice, and the design stage — and extends the discussion to instrumental variables and time-varying treatments.
Overview
Citation: Li F, Ding P, Mealli F. 2023. Bayesian causal inference: a critical review. Phil. Trans. R. Soc. A 381: 20220153. https://doi.org/10.1098/rsta.2022.0153
Authors:
- Fan Li (Duke University)
- Peng Ding (UC Berkeley)
- Fabrizia Mealli (University of Florence and EUI)
Published: Part of the theme issue ‘Bayesian inference: challenges, perspectives, and prospects’
Keywords: causal inference, design, ignorability, potential outcomes, propensity score
Research Question and Contribution
The paper addresses: What is the Bayesian approach to causal inference, and what are its unique strengths and challenges?
Three Frequentist inferential approaches exist within the potential outcomes framework: Fisher randomization testing, Neymanian repeated-sampling evaluation, and Bayesian inference. This review focuses on the Bayesian approach, which has been underrepresented in the literature relative to Frequentist methods.
Key contributions:
- Systematic review of Bayesian causal inference structure (factorization, prior independence, estimands)
- Identifies regularization-induced confounding as a critical high-dimensional challenge
- Reviews three strategies for incorporating the propensity score into Bayesian analysis
- Covers sensitivity analysis to unmeasured confounding (E-value, copula methods)
- Extends to complex mechanisms: IV/principal stratification and time-varying treatments
- Articulates when Bayesian > Frequentist and cautions against “being Bayesian for its own sake”
Paper Structure
| Section | Topic | Notes |
|---|---|---|
| §2 | Estimands, identification, frequentist methods | Causal Estimands, Potential Outcomes Framework, Frequentist Causal Estimation |
| §3 | General Bayesian CI structure | General Structure of Bayesian CI |
| §4 | Model specification (outcome models, high-dim) | Bayesian Outcome Models |
| §5 | Propensity score role | Propensity Score in Bayesian CI |
| §6 | Sensitivity analysis | Sensitivity Analysis in Observational Studies |
| §7 | Complex mechanisms | Instrumental Variables and Principal Stratification, Time-Varying Treatments and G-computation |
| §8 | Discussion/conclusions | (below) |
Key Takeaways
Central message
The Bayesian approach offers a unified inferential framework for any causal estimand via imputation of missing potential outcomes. However, the design stage (ensuring covariate overlap and balance) remains critical regardless of inferential mode — Bayesian analysis cannot substitute for good design.
Strengths of Bayesian causal inference:
- Unified framework for any estimand — including complex ones like ITEs, principal strata effects
- Automatic uncertainty quantification for any functional of the posterior
- Natural incorporation of prior knowledge
- Rich model library for complex data (spatial, temporal, functional, SUTVA violations)
Weaknesses / open questions:
- Identifiability blurs in Bayesian paradigm — all parameters have posteriors even when non-identified
- Prior independence assumption (Assumption 3.2) can act as strongly informative prior in high dimensions (prior dogmatism)
- Propensity score role is contentious — drops from likelihood under ignorability, yet essential for overlap/balance
- High-dimensional settings: open question on optimal design stage procedure
- Computationally demanding relative to Frequentist alternatives
Meta-level conclusion: “Being Bayesian should be dictated by its practical utility in a specific context rather than an unconditional commitment to the Bayesian doctrine.”
Connections
- Potential Outcomes Framework — foundational setup reviewed in §2
- General Structure of Bayesian CI — core inference architecture (§3)
- Bayesian Propensity Scores and IPW — existing vault note on Bayesian IPW (Heiss blog), extended here
- Nonparametric Causal Inference — existing vault note; BART and GP models discussed in §4
- Copula Estimation — copula-based sensitivity analysis in §6
See Also
- Causal Estimands — formal definitions of ITE, SATE, CATE, PATE, MATE
- Propensity Score in Bayesian CI — three strategies for incorporating propensity scores
- Sensitivity Analysis in Observational Studies — E-value and copula parametrizations