Model Comparison

Summary

Chapter 7 of BDA3 covers methods for evaluating, comparing, and expanding models based on predictive accuracy. The key measure is expected log predictive density (ELPD), estimated via cross-validation or information criteria.

Measures of Predictive Accuracy

The gold standard is the expected log pointwise predictive density (ELPD):

elpd = i = 1 \sum n \int p_{t} (\tilde{y}_{i}) lo g p (\tilde{y}_{i} ∣ y) d \tilde{y}_{i}

where $p_{t}$ is the true data-generating distribution. This must be estimated since $p_{t}$ is unknown.

Information Criteria

AIC: $- 2 lo g p (y ∣ \hat{θ}) + 2 k$ — penalizes by number of parameters $k$
DIC: replaces $k$ with effective number of parameters $p_{D}$
WAIC (Widely Applicable IC): fully Bayesian, computed from the posterior: $elpd_{WAIC} = i = 1 \sum n (lo g \frac{1}{S} s = 1 \sum S p (y_{i} ∣ θ^{s}) - V_{s} [lo g p (y_{i} ∣ θ^{s})])$

Cross-Validation

Leave-one-out CV (LOO-CV): gold standard but expensive
Pareto-smoothed importance sampling (PSIS-LOO): efficient approximation using importance weights from the full posterior — implemented in the loo R package
Preferred over WAIC in practice due to better diagnostics ( $\hat{k}$ diagnostic)

Bayes Factors

BF_{12} = \frac{p ( y ∣ M _{1} )}{p ( y ∣ M _{2} )} = \frac{\int p ( y ∣ θ _{1} , M _{1} ) p ( θ _{1} ∣ M _{1} ) d θ _{1}}{\int p ( y ∣ θ _{2} , M _{2} ) p ( θ _{2} ∣ M _{2} ) d θ _{2}}

Warning

Bayes factors are sensitive to the prior, especially for vague priors. BDA3 generally recommends predictive approaches (LOO, WAIC) over Bayes factors for model comparison.

Second Brain

Explorer

Model Comparison

Model Comparison

Measures of Predictive Accuracy

Information Criteria

Cross-Validation

Bayes Factors

See Also

Graph View

Table of Contents

Backlinks