MCMC Basics

Summary

Chapter 11 of BDA3 introduces Markov chain Monte Carlo — the workhorse of Bayesian computation. MCMC constructs a Markov chain whose stationary distribution is the posterior, enabling sampling from complex, high-dimensional posteriors.

Gibbs Sampler

Iteratively sample each parameter from its full conditional distribution:

θ_{j}^{(t + 1)} \sim p (θ_{j} ∣ θ_{- j}^{(t)}, y)

Requires known conditional distributions (often available for conjugate models)
Each step updates one parameter block, cycling through all blocks
Can be slow when parameters are highly correlated

Metropolis-Hastings Algorithm

More general: propose a move $θ^{*} \sim J (θ^{*} ∣ θ^{(t)})$ and accept with probability:

min (1, \frac{p ( θ ^{*} ∣ y ) J ( θ ^{(t)} ∣ θ ^{*} )}{p ( θ ^{(t)} ∣ y ) J ( θ ^{*} ∣ θ ^{(t)} )})

Random walk Metropolis: $J (θ^{*} ∣ θ) = N (θ, c^{2} Σ)$ — simple but can be slow in high dimensions
Optimal acceptance rate: ~0.44 in 1D, ~0.23 in high dimensions

Convergence Diagnostics

$\hat{R}$ statistic: compare between-chain and within-chain variance across $m$ parallel chains. $\hat{R} < 1.1$ indicates approximate convergence
Effective sample size $n_{eff}$ : accounts for autocorrelation in the chain
Run multiple chains from dispersed starting points
Discard warmup/burn-in iterations

Second Brain

Explorer

MCMC Basics

MCMC Basics

Gibbs Sampler

Metropolis-Hastings Algorithm

Convergence Diagnostics

See Also

Graph View

Table of Contents

Backlinks