Bayesian Linear Regression
Summary
Chapter 14 of BDA3 presents the Bayesian approach to linear regression. Priors on coefficients provide natural regularization, and the full posterior gives uncertainty intervals for predictions — not just point estimates.
The Model
With a noninformative prior , the posterior for is a multivariate distribution centered at the OLS estimate — the Bayesian and frequentist answers coincide.
Regularization Through Priors
Informative priors on provide regularization:
- Ridge-like: — shrinks coefficients toward zero
- Lasso-like: — encourages sparsity
- Horseshoe prior: heavy-tailed, allows large signals while shrinking noise — state of the art for sparse problems
Key Topics
- Causal inference: regression for estimating treatment effects (incumbency and voting example) — connects to Regression and the CEF
- Dimension reduction: when is large relative to , priors are essential
- Unequal variances: heteroscedastic models with
- Prior information: incorporating external knowledge about coefficient magnitudes
See Also
- Regression and the CEF — the frequentist perspective from Angrist & Pischke
- Hierarchical Linear Models — varying coefficients across groups
- Hierarchical Models — partial pooling as the multilevel generalization of this model
- Generalized Linear Models — extending beyond normality
- Omitted Variables Bias — Bayesian regularization (shrinkage) partially mitigates OVB in high- settings
- Conditional Independence Assumption — the assumption needed for causal interpretation of regression coefficients
- Bayesian Workflow - Overview — iterative model building context for regression
- MCMC Basics — computation for posterior inference when analytic forms are unavailable
- Statistical Rethinking - Overview — McElreath’s pedagogical introduction to the same regression models from a code-first perspective
- Linear Models in Statistical Rethinking — McElreath’s code-first treatment of the full linear model framework including priors and prediction
- Moderation Analysis — interaction terms as an extension of Bayesian linear regression
- Missing Data Models — Bayesian regression handles missing data naturally through the generative model
- Horseshoe and Regularized Horseshoe Priors — detailed treatment of global-local shrinkage priors for high-dimensional sparse regression