Model Checking

Summary

Chapter 6 of BDA3 presents posterior predictive checking as the primary tool for assessing Bayesian model fit. The core idea: simulate replicated data from the fitted model and compare to the observed data.

Posterior Predictive Checking

Generate replicated datasets $y^{rep}$ from the posterior predictive distribution:

p (y^{rep} ∣ y) = \int p (y^{rep} ∣ θ) p (θ ∣ y) d θ

If the model fits well, $y^{rep}$ should “look like” the observed data $y$ .

Test Quantities and Bayesian p-values

Define a test quantity $T (y, θ)$ — any scalar summary of data and parameters. The posterior predictive p-value is:

p_{B} = Pr (T (y^{rep}, θ) \geq T (y, θ) ∣ y)

Values near 0 or 1 indicate model misfit. Unlike classical p-values, this accounts for parameter uncertainty.

Graphical Checks

Compare histograms/density plots of $y$ vs. $y^{rep}$
Overlay multiple $y^{rep}$ datasets on the observed data
Residual plots: Bayesian residuals use a single posterior draw of $θ$ , not a point estimate
Binned residual plots: useful for discrete data where raw residuals are hard to interpret

Key Principles

Model checking is about understanding where the model fails, not binary accept/reject
Checking is iterative: identify misfit → expand the model → check again (see Bayesian Workflow - Overview)
Sensitivity analysis: assess how conclusions change under alternative models or priors

Second Brain

Explorer

Model Checking

Model Checking

Posterior Predictive Checking

Test Quantities and Bayesian p-values

Graphical Checks

Key Principles

See Also

Graph View

Table of Contents

Backlinks