Bayesian Linear Regression

Summary

Chapter 14 of BDA3 presents the Bayesian approach to linear regression. Priors on coefficients provide natural regularization, and the full posterior gives uncertainty intervals for predictions — not just point estimates.

The Model

y ∣ X, β, σ^{2} \sim N (Xβ, σ^{2} I)

With a noninformative prior $p (β, σ^{2}) \propto σ^{- 2}$ , the posterior for $β$ is a multivariate $t$ distribution centered at the OLS estimate $\hat{β}$ — the Bayesian and frequentist answers coincide.

Regularization Through Priors

Informative priors on $β$ provide regularization:

Ridge-like: $β_{j} \sim N (0, τ^{2})$ — shrinks coefficients toward zero
Lasso-like: $β_{j} \sim Laplace (0, λ)$ — encourages sparsity
Horseshoe prior: heavy-tailed, allows large signals while shrinking noise — state of the art for sparse problems

Key Topics

Causal inference: regression for estimating treatment effects (incumbency and voting example) — connects to Regression and the CEF
Dimension reduction: when $p$ is large relative to $n$ , priors are essential
Unequal variances: heteroscedastic models with $var (y_{i}) = σ_{i}^{2}$
Prior information: incorporating external knowledge about coefficient magnitudes

Second Brain

Explorer

Bayesian Linear Regression

Bayesian Linear Regression

The Model

Regularization Through Priors

Key Topics

See Also

Graph View

Table of Contents

Backlinks