Synthetic Likelihood Construction

Summary

The core estimator. Given summary statistics $s$ of the data, assume $s \sim N (μ_{θ}, Σ_{θ})$ . For any $θ$ , simulate $N_{r}$ replicate data sets, convert each to a statistics vector, and estimate $\hat{μ}_{θ}$ and $\hat{Σ}_{θ}$ . The log synthetic likelihood $l_{s} (θ)$ is then the MVN log-density of the observed $s$ under those estimates. $l_{s}$ is much smoother in $θ$ than the true density, invariant to reparameterization, robust to uninformative statistics, and behaves like a genuine likelihood as $N_{r} \to \infty$ — so it is explored by Metropolis–Hastings MCMC, with MLE recovered by quadratic regression and model comparison via AIC/GLRT.

Overview

This note states the synthetic-likelihood algorithm and its statistical properties — the machinery that turns the phase-insensitive statistics into well-founded inference. It is the computational heart of Wood (2010).

Main Content

Multivariate-normal approximation (Wood 2010, Eq. 2)

The chosen summary statistics are taken to be approximately multivariate normal:
$s \sim N (μ_{θ}, Σ_{θ}) .$
The mean $μ_{θ}$ and covariance $Σ_{θ}$ are generally intractable functions of the model parameters $θ$ , but for any $θ$ they can be estimated by simulation. Using regression coefficients as statistics promotes the normality that supports this approximation.

Evaluating the synthetic likelihood (Wood 2010, Fig. 2 & Eq.)

For a given parameter vector $θ$ :

Use the model to simulate $N_{r}$ replicate data sets $y_{1}^{*}, y_{2}^{*}, \dots$ and convert each to a statistics vector $s_{1}^{*}, s_{2}^{*}, \dots$ — exactly as $y$ was converted to $s$ .

Estimate the mean: $\hat{μ}_{θ} = \sum_{i} s_{i}^{*} / N_{r}$ .

Form $S = (s_{1}^{*} - \hat{μ}_{θ}, s_{2}^{*} - \hat{μ}_{θ}, \dots)$ and estimate the covariance: $\hat{Σ}_{θ} = S S^{⊤} / (N_{r} - 1)$ (a robust covariance estimator can be advantageous here).

Drop irrelevant constants; the log synthetic likelihood is

$l_{s} (θ) = - \frac{1}{2} (s - \hat{μ}_{θ})^{⊤} \hat{Σ}_{θ}^{- 1} (s - \hat{μ}_{θ}) - \frac{1}{2} lo g ∣ \hat{Σ}_{θ} ∣.$

Properties

Measures fit, but smoothly. Like any likelihood, $l_{s} (θ)$ measures the consistency of $θ$ with the data — but it is a much smoother function of $θ$ than the true density $f_{θ}$ , making it optimizable and samplable.
Generality. Handles hidden state variables, complicated observation processes, missing data, and multiple data series.
Invariance & robustness. $l_{s}$ is invariant to reparameterization and robust to the inclusion of uninformative statistics, so very careful statistic selection is unnecessary; statistics may be freely transformed to improve the normality approximation (Eq. 2).
Asymptotic in $N_{r}$ . $l_{s}$ behaves like a conventional likelihood in the $N_{r} \to \infty$ limit, giving access to likelihood-based inference machinery.

Exploring $l_{s}$ by MCMC

Metropolis–Hastings exploration (Wood 2010, Methods summary)

$l_{s}$ usually displays residual small-scale roughness, so smooth-function optimizers fail; instead use Metropolis–Hastings MCMC. From a parameter guess $θ^{[0]}$ , iterate for $k = 1, 2, \dots$ :

Propose $θ^{*} = θ^{[k - 1]} + δ^{[k]}$ , with $δ^{[k]}$ from a convenient symmetric distribution.

Set $θ^{[k]} = θ^{*}$ with probability $min [1, exp {l_{s} (θ^{*}) - l_{s} (θ^{[k - 1]})}]$ ; otherwise $θ^{[k]} = θ^{[k - 1]}$ .

The chain both locates and quantifies the range of parameter values consistent with the data. (A flat prior makes the acceptance ratio depend only on $l_{s}$ ; informative priors enter multiplicatively as usual.)

Point estimation, model comparison, and checking

MLE via quadratic regression. Near the maximum-likelihood estimate $\hat{θ}$ , the $N_{r} \to \infty$ limit of $l_{s}$ is estimated by quadratic regression of the sampled $l_{s} (θ^{[k]})$ values on the $θ^{[k]}$ from the converged chain — recovering $\hat{θ}$ and the standard likelihood theory for inference.
Model comparison. Alternative models compared by AIC or generalized likelihood-ratio testing.
Model-checking diagnostic. If the model fits,

(s - \hat{μ}_{θ})^{⊤} \hat{Σ}_{θ}^{- 1} (s - \hat{μ}_{θ}) \sim χ_{d i m (s)}^{2} .

Connections

Built on the phase-insensitive statistics that make $s$ approximately normal.
The simulate-statistics-then-score loop parallels MSM / Indirect Inference / the SME (match simulated vs. observed summaries) but yields an explicit parametric likelihood rather than a quadratic moment criterion.
Closely related to ABC: both are likelihood-free and summary-statistic-based, but synthetic likelihood replaces ABC’s acceptance threshold with an MVN density, enabling standard MCMC and likelihood theory.

Second Brain

Explorer

Synthetic Likelihood Construction

Synthetic Likelihood Construction

Overview

Main Content

Properties

Exploring $l_{s}$ by MCMC

Point estimation, model comparison, and checking

Connections

See Also

Graph View

Table of Contents

Backlinks

Second Brain

Explorer

Synthetic Likelihood Construction

Synthetic Likelihood Construction

Overview

Main Content

Properties

Exploring ls​ by MCMC

Point estimation, model comparison, and checking

Connections

See Also

Graph View

Table of Contents

Backlinks

Exploring $l_{s}$ by MCMC