Efficient MCMC

Summary

Chapter 12 of BDA3 covers advanced MCMC methods, especially Hamiltonian Monte Carlo (HMC), which uses gradient information to make large, efficient moves through parameter space. Stan implements HMC for general models.

Improving Gibbs and Metropolis

Reparameterization: transform to reduce posterior correlations (e.g., centering/non-centering in hierarchical models)
Auxiliary variables / data augmentation: add latent variables to simplify conditionals (e.g., $t$ distribution as normal-inverse- $χ^{2}$ mixture)
Parameter expansion: adding redundant parameters can break dependence and improve mixing
Adaptive Metropolis: tune the jumping distribution during warmup, then fix for inference

Hamiltonian Monte Carlo (HMC)

Treats sampling as simulating Hamiltonian dynamics with position $θ$ and momentum $ϕ$ :

H (θ, ϕ) = - lo g p (θ ∣ y) + \frac{1}{2} ϕ^{T} M^{- 1} ϕ

Key properties:

Uses gradient $\nabla lo g p (θ ∣ y)$ to guide proposals — far more efficient than random walks
Proposals travel far in parameter space while maintaining high acceptance rates
Scales much better to high dimensions than random-walk Metropolis
NUTS (No-U-Turn Sampler): automatically tunes the trajectory length

Stan

Stan is the modern platform for Bayesian inference:

Implements NUTS (adaptive HMC)
Requires differentiable log-posteriors (automatic differentiation)
Models specified in a declarative language
Interfaces: RStan, PyStan, CmdStan

Second Brain

Explorer

Efficient MCMC

Efficient MCMC

Improving Gibbs and Metropolis

Hamiltonian Monte Carlo (HMC)

Stan

See Also

Graph View

Table of Contents

Backlinks