Approximation Methods
Summary
Chapter 13 of BDA3 covers deterministic approximations to the posterior — faster alternatives to MCMC that trade exactness for speed. Useful for large datasets or rapid iteration.
Laplace Approximation
Approximate the posterior with a Gaussian centered at the mode:
- Fast: only requires optimization + Hessian computation
- Exact in the limit as (see Asymptotics and Frequentist Connections)
- Fails for multimodal, skewed, or bounded posteriors
- Foundation for INLA (Integrated Nested Laplace Approximation)
Variational Inference (VI)
Approximate with a simpler distribution by minimizing KL divergence:
- Mean-field VI: factorizes — fast but ignores posterior correlations
- ADVI (Automatic Differentiation VI): transforms to unconstrained space and uses gradient-based optimization
- Much faster than MCMC, useful for exploratory analysis and large datasets
- Tends to underestimate posterior variance
Expectation Propagation (EP)
- Iteratively refines a global approximation by matching moments to each data point’s contribution
- More accurate than mean-field VI for some problems
- Can be viewed as minimizing a reversed KL divergence
When to Use What
| Method | Speed | Accuracy | Best for |
|---|---|---|---|
| MCMC/HMC | Slow | Exact (asymptotically) | Final inference |
| Laplace | Fast | Good if unimodal | Quick checks, INLA |
| VI | Fast | Approximate | Large data, exploration |
| EP | Medium | Good | Sparse/GP models |
See Also
- Efficient MCMC — the exact alternative
- Fitting and Validating Computation — validating that approximations are adequate