Summary

Sections 7—8 of Gelman et al. (2020) cover the iterative heart of Bayesian workflow: modifying models in response to data, failures, and new understanding, then comparing and combining multiple models. The goal is not to find the single “best” model but to understand how inferences change across a topology of related models.

Modifying a Model (Section 7)

Model expansion is triggered by:

Constructing a Model for the Data

The data model is often more important than its distributional form. Key considerations include how the data are linked to underlying parameters of interest, handling nonsampling error, and allowing for systematic measurement bias. Any generative model is necessarily an approximation.

Incorporating Additional Data

Adding data can be as simple as adding predictors, or as complex as integrating multiple data sources (e.g., direct measurements combined with population summary statistics). When more parameters are added, not all can have large effects — this motivates hierarchical priors and regularization.

Working with Prior Distributions

Priors exist on a ladder of informativeness:

Prior TypeExample
Improper flatUniform on
Super-vague but proper
Weakly informative
Generic weakly informativeDefault regularizing priors
Specific informative based on domain knowledge

Information Budget

As models grow more complex, the “information budget” must be divided among more parameters. Priors often need to become tighter as models expand, to avoid destabilizing estimates. Prior predictive checking (Choosing and Building Models) helps calibrate this.

Priors should be understood as constraints — choosing a prior is choosing how much subject-matter information to include. The joint prior over all parameters matters more than individual marginals, especially in high dimensions.

A Topology of Models

Models in a given framework (e.g., ARMA, linear regression with variable selection) form a partial ordering or topology. For example, AR(1) is simpler than AR(2), which is simpler than ARMA(2,1). Navigating this topology — understanding connections between parameters across models — is central to workflow.

Understanding and Comparing Multiple Models (Section 8)

Why Fit Multiple Models?

  • Hard to fit a big model directly; build up incrementally
  • Coding errors and conceptual mistakes require iterative correction
  • New data motivate model expansion
  • Comparing inferences across models reveals what drives conclusions

Multiverse Analysis

Instead of selecting one model, perform a multiverse analysis: fit all plausible model variants and see how conclusions change. If conclusions are robust across models, the choice of “best” model matters less. See Model Comparison.

Cross Validation and Stacking

When comparing models via LOO-CV, do not simply pick the model with the best score if there is non-negligible uncertainty in the comparison. Instead, use stacking to combine predictive distributions:

where weights are chosen to minimize cross-validation error. Stacking can be viewed as pointwise model selection and fills the gap between independent-error ML validation and grouped/structured data. It outperforms traditional Bayesian model averaging, which is sensitive to prior specification on parameters that do not affect predictions.

Projection Predictive Variable Selection

For large model spaces, projection predictive selection (Piironen and Vehtari, 2017) finds smaller submodels with comparable predictive performance to the full expanded model, avoiding the overfitting that comes from searching through many models independently.

See Also