Fitting and Validating Computation

Summary

Sections 3—4 of Gelman et al. (2020) cover how to fit Bayesian models using MCMC (particularly HMC), validate that the computation is correct, and use simulated data to diagnose problems before touching real data. Key tools include convergence diagnostics, fake-data simulation, and simulation-based calibration (SBC).

Fitting a Model (Section 3)

Initial Values, Adaptation, and Warmup

MCMC algorithms (especially HMC as implemented in Stan) operate in stages:

Warmup — moves chains from arbitrary initial values toward the typical set of the posterior; also tunes sampler parameters (e.g., step size, mass matrix)
Sampling — draws from the posterior after chains have mixed

Warmup serves dual purposes: (a) reducing bias from initial values, and (b) providing information for tuning parameters. See MCMC Basics and Efficient MCMC.

How Long to Run

Run until $\hat{R} < 1.01$ for all parameters and effective sample sizes are sufficient for the quantities of interest. However, during early model exploration, running fewer iterations (e.g., 200 instead of 2000) can quickly reveal major issues like coding errors — analogous to avoiding premature optimization in software.

Approximate Algorithms and Approximate Models

There is a tradeoff between speed and accuracy. Early in the workflow, approximate methods (variational inference, Laplace approximation, INLA, penalized ML) can help explore model space quickly. Near the end, accurate MCMC is needed for fine-scale posterior features.

Fit Fast, Fail Fast

A key goal is for bad models to fail quickly rather than consuming resources on near-perfect inference for a model we will ultimately discard. If fitting to different data subsets yields wildly different estimates, the model is likely inappropriate.

Validating Computation with Constructed Data (Section 4)

Fake-Data Simulation

Simulate data from the model with known parameter values $θ^{*}$ , then fit the model and check:

Data informativeness — can the data constrain the parameters beyond the prior?
Parameter recovery — are the true $θ^{*}$ values recovered within posterior intervals?
Behavior across parameter space — how does the posterior change in different regions?

Why Fake Data First?

Real data confound modeling issues with computational issues. Simulated data isolate the computation: if the model cannot recover known truth from its own fake data, it will certainly fail on real data.

Simulation-Based Calibration (SBC)

SBC (Cook et al., 2006; Talts et al., 2020) is a more comprehensive approach than single-point fake-data checks:

Draw $θ \sim p (θ)$ from the prior
Simulate $y \sim p (y ∣ θ)$
Fit the model to obtain $p (θ ∣ y)$
Check that the posterior is calibrated relative to the prior-generated truth

Repeating this many times, if the algorithm is correct, the rank statistics of the true parameter within the posterior samples should be uniform. SBC can detect both computational errors and model specification issues.

Experimentation with Constructed Data

Fitting models to data simulated under different assumptions (e.g., $t$ -distributed errors when the model assumes normal) lets us study robustness and understand bias under misspecification.

Second Brain

Explorer

Fitting and Validating Computation

Fitting a Model (Section 3)

Initial Values, Adaptation, and Warmup

How Long to Run

Approximate Algorithms and Approximate Models

Fit Fast, Fail Fast

Validating Computation with Constructed Data (Section 4)

Fake-Data Simulation

Simulation-Based Calibration (SBC)

Experimentation with Constructed Data

See Also

Graph View

Table of Contents

Backlinks