Approximation Methods

Summary

Chapter 13 of BDA3 covers deterministic approximations to the posterior — faster alternatives to MCMC that trade exactness for speed. Useful for large datasets or rapid iteration.

Laplace Approximation

Approximate the posterior with a Gaussian centered at the mode:

p (θ ∣ y) \approx N (\hat{θ}, [- \nabla^{2} lo g p (θ ∣ y)_{\hat{θ}}]^{- 1})

Fast: only requires optimization + Hessian computation
Exact in the limit as $n \to \infty$ (see Asymptotics and Frequentist Connections)
Fails for multimodal, skewed, or bounded posteriors
Foundation for INLA (Integrated Nested Laplace Approximation)

Variational Inference (VI)

Approximate $p (θ ∣ y)$ with a simpler distribution $q (θ)$ by minimizing KL divergence:

q^{*} = ar g q \in Q min KL (q ∥ p (θ ∣ y))

Mean-field VI: factorizes $q (θ) = \prod_{j} q_{j} (θ_{j})$ — fast but ignores posterior correlations
ADVI (Automatic Differentiation VI): transforms to unconstrained space and uses gradient-based optimization
Much faster than MCMC, useful for exploratory analysis and large datasets
Tends to underestimate posterior variance

Expectation Propagation (EP)

Iteratively refines a global approximation by matching moments to each data point’s contribution
More accurate than mean-field VI for some problems
Can be viewed as minimizing a reversed KL divergence

When to Use What

Method	Speed	Accuracy	Best for
MCMC/HMC	Slow	Exact (asymptotically)	Final inference
Laplace	Fast	Good if unimodal	Quick checks, INLA
VI	Fast	Approximate	Large data, exploration
EP	Medium	Good	Sparse/GP models

Second Brain

Explorer

Approximation Methods

Approximation Methods

Laplace Approximation

Variational Inference (VI)

Expectation Propagation (EP)

When to Use What

See Also

Graph View

Table of Contents

Backlinks