History Matching for ABMs

Summary

History Matching (HM) is an iterative wave-based procedure that eliminates implausible parameter regions for an ABM. In each wave, parameter samples are scored by an implausibility metric combining model error with all quantified uncertainties. Implausible regions are discarded; the retained non-implausible space is sampled more densely in the next wave. HM stops when the non-implausible space stops shrinking.

Overview

HM originated in climate and physical modeling (Craig et al. 1997) and has been adapted for ABMs. Unlike Bayesian calibration, HM makes no probabilistic statements about parameters — it only labels a region as implausible (“could not plausibly produce the observed data”) or non-implausible (“could”). This binary output is then used as an informed prior for ABC.

Implausibility Score

Definition: Implausibility Score

For a parameter set $x$ and observation $z^{r}$ , the implausibility is:
$I^{r} (x) = \frac{d ^{2} ( z ^{r} , f ^{r} ( x ))}{V _{s}^{r} + V _{o}^{r} + V _{m}^{r}}$
where:

$d^{2} (z^{r}, f^{r} (x))$ = squared error between simulation output and expected output

$V_{s}^{r}$ = ensemble variance (stochastic variability across runs with same parameters)

$V_{o}^{r}$ = observation uncertainty

$V_{m}^{r}$ = model discrepancy variance

A parameter set $x$ is implausible if $I^{r} (x) \geq c$ . By Pukelsheim’s $3 σ$ rule, $c = 3$ ensures the correct parameter set has $I^{r} (x) < 3$ with probability $\geq 0.95$ .

Wave Structure

Each HM wave:

Sample $N$ parameter sets from the current non-implausible space using Latin Hypercube Sampling (LHS)
Run the model $K$ times for each sample (ensemble) to estimate $V_{s}^{r}$
Calculate implausibility $I^{r} (x)$ for each sample
Discard implausible samples ( $I^{r} (x) \geq c$ ); retain non-implausible samples
The retained non-implausible region becomes the sampling space for the next wave

Stopping criteria: when all parameters are implausible, or when the non-implausible area does not decrease further between waves.

Model Discrepancy

Definition: Model Discrepancy Variance ( $V_{m}^{r}$ )

$V_{m}^{r} = \frac{1}{N - 1} n = 1 \sum N (d (z^{r}, f^{r} (x_{n})) - E^{r} (X))^{2}$
where $E^{r} (X)$ is the average model error across all $N$ parameter sets tested. This estimates how much variation in model output arises from imperfect model specification — the gap between the best model and reality.

Key implication: Model discrepancy cannot be reduced by better calibration — it reflects fundamental model imperfection and must be explicitly acknowledged.

Ensemble Variance

Definition: Ensemble Variance ( $V_{s}^{r}$ )

$V_{s}^{r} = \frac{1}{N} n = 1 \sum N [\frac{1}{K - 1} k = 1 \sum K (d (z^{r}, f_{k}^{r} (x_{n})) - E_{K}^{r} (x_{n}))^{2}]$
where $K$ is the ensemble size and $E_{K}^{r} (x_{n}) = \frac{1}{K} \sum_{k = 1}^{K} d (z^{r}, f_{k}^{r} (x_{n}))$ .

Choose $K$ by running models across a range of ensemble sizes and selecting the smallest $K$ at which variance stabilises. In the SugarScape example, $K = 200$ ; in the birds model, $K = 30$ .

Multiple Outputs

When the model produces multiple observed outputs ( $R > 1$ , e.g., small/medium/large farm counts in RISC), a separate implausibility measure is computed for each output and the maximum is used:

I (x) = r max I^{r} (x)

Key Differences from Other Methods

Aspect	HM	ABC	GA / Simulated Annealing
Output	Non-implausible region	Posterior distribution	Point estimate
Probabilistic statements	No	Yes	No
Handles uncertainty explicitly	Yes	Implicitly via $ε$	No
Computational cost (runs)	80–320 (birds)	11,000+ (birds, no HM)	256–290 (birds)

SugarScape Example

In the SugarScape toy model (2 parameters: metabolism $\in [1, 4]$ , vision $\in [1, 16]$ ):

Wave 1: Full grid tested; substantial implausible region identified (dark grey in figure)
Wave 10: Non-implausible region narrowed to upper-right corner (high metabolism, high vision)
HM correctly identifies that the true parameters {metabolism=4, vision=6} lie in this region

Connections

The three uncertainty components $V_{m}^{r}$ , $V_{s}^{r}$ , $V_{o}$ are defined in Uncertainty Quantification for ABM Calibration
HM output feeds directly into Approximate Bayesian Computation for ABMs as the uniform prior
Extends ABM Calibration Overview beyond GA point estimation
Latin Hypercube Sampling used for space-filling design; related to Experimental Design for ABMs

Second Brain

Explorer

History Matching for ABMs

History Matching for ABMs

Overview

Implausibility Score

Wave Structure

Model Discrepancy

Ensemble Variance

Multiple Outputs

Key Differences from Other Methods

SugarScape Example

Connections

See Also

Graph View

Table of Contents

Backlinks