Chaos and Phase-Insensitive Statistics
Summary
Why conventional likelihood-based inference collapses for chaotic dynamic models, and how to fix it. Because trajectories are hypersensitive to the noise realization and parameters, the joint density of data and process noise is a wildly irregular, multimodal function of both and — useless for measuring fit and intractable to integrate or sample. The resolution is philosophical: the local phase of the data is a purely noise-driven, non-repeatable feature that should not enter any model–data comparison. So one judges fit using phase-insensitive summary statistics that capture the dynamically-important structure (autocovariances, autoregression coefficients) while discarding local phase.
Overview
This note develops the obstacle that motivates the synthetic likelihood and the design principle behind its summary statistics. The prototypical example is the scaled Ricker map, which shows the collapse of standard methods under chaotic dynamics.
Main Content
The scaled Ricker map (Wood 2010, Eq. 1)
The prototypic ecological model with complex dynamics describes a population by
where are independent process-noise terms and is the intrinsic growth-rate parameter controlling the dynamics. Observations are Poisson deviates with mean (a common sampling situation). The inference target is . For the dynamics are chaotic.
The likelihood collapse
To do likelihood-based inference one must integrate the joint density over all process-noise vectors . But (Wood 2010, Fig. 1b–c):
- As a function of the noise : holding and fixed and varying a single noise deviate , is extraordinarily jagged and multimodal — so the integral over is analytically and numerically intractable.
- As a function of : varying with fixed makes equally irregular.
Bayesian inference fares no better: sampling from a density proportional to such an irregular has no workable method. Since this joint density underlies all conventional fit measures, the chaotic regime defeats them.
Why local phase must be discarded (Wood 2010, "Defining fit")
Naive inference tries to make the model reproduce the exact course of the observed data — something the real system itself would not do if repeated. The dynamic processes driving the system are a repeatable feature about which inference is legitimate, but the local phase of the data is an entirely noise-driven feature that should contribute nothing to a measure of match or mismatch. To be scientifically meaningful, statistical fit must be judged on quantities that reflect what is dynamically important, discarding local phase. (The idea of phase-insensitive comparison is not itself new; what is new is assessing the consistency of model-simulation and data statistics in a way that gives access to likelihood-based inference.)
Designing the summary statistics
The first analysis step reduces the raw data to a vector of summary statistics designed to capture the dynamic structure of the model — specifying what matters about the dynamics, but not how much it matters. Suitable examples:
- coefficients of the autocovariance function;
- coefficients of polynomial autoregressive models of the dynamics;
- coefficients from polynomial regression of the observed order statistics on fixed reference quantiles (to summarize the marginal distribution).
Using regression coefficients as statistics promotes approximate normality of , which supports the key multivariate-normal approximation used in the Synthetic Likelihood Construction. There is complete freedom to transform statistics to improve this approximation, and the method is robust to including uninformative statistics, so careful selection is not essential.
Connections
- The obstacle motivates the Synthetic Likelihood Construction, which replaces the intractable with a smooth MVN likelihood of .
- Phase-insensitive summary statistics are the same device used in ABC and in Indirect Inference / Method of Simulated Moments (where an auxiliary model’s coefficients or moments serve as the statistics).
- Chaotic sensitivity to initial conditions is the hallmark of the dynamical systems studied; the scaled Ricker map recurs in ecology much as the Brock–Mirman map does in economics.
See Also
- Synthetic Likelihood - Overview — where this obstacle fits in the method
- Synthetic Likelihood Construction — the synthetic likelihood built on these statistics
- Nicholson’s Blowfly Application — the specific statistics used in practice