Implicit Likelihood Estimator

Summary

The implicit-likelihood estimator handles models whose likelihood can be sampled but not evaluated (e.g. random-effects / nuisance-variable models, where is intractable). It learns two approximations — a marginal and a likelihood — and plugs both into the EIG. Unlike the other three, it is not a bound on the EIG, but Lemma 2 bounds its error, so minimizing that bound trains it.

Overview

The posterior, marginal, and VNMC estimators all assume the likelihood can be evaluated pointwise. Many important models cannot: introduce nuisance latents (random effects, latent confounders) and the likelihood becomes an intractable integral, even though you can still simulate by sampling then . The variational posterior works here unchanged (it needs only samples), but does not — it contains explicitly. The fix is to approximate the likelihood too.

Main Content

Definition: Implicit-likelihood estimator (Foster 2019, Eq. 12)

Using a marginal approximation and a likelihood approximation :

This is not a bound on the EIG (unlike the other three estimators).

Lemma 2 — EIG estimation error bound (Foster 2019)

For any valid and , the EIG estimation error is bounded:

where does not depend on or . The RHS is iff and for almost all .

Training implication

Lemma 2 says: learn and by maximizing via stochastic gradient ascent (two ordinary maximum-likelihood density-estimation problems), then substitute into Eq. 12. In general and are learned separately with no weight sharing; Foster 2019 §A.4 discusses the coupled case .

Where it sits among the four

From Foster 2019, Table 1: is the only estimator marked implicit-likelihood ✓ besides , and it relies on approximating a distribution over (so prefer it, like , when ). It gave the lowest empirical MSE on the mixed-effects benchmark (the implicit-likelihood problem) in Table 2.

Examples

Mixed-effects / item-response model (Foster 2019 §6.1, §6.3)

A psychology item-response model has common fixed effects (of interest) and per-participant random effects (nuisance ), making implicit. gives the best EIG accuracy here and is used to drive the online adaptive face-perception experiment on Mechanical Turk, producing lower-entropy posteriors than random design. See Sequential and Adaptive BED.

Connections

  • Generalizes [[Variational Marginal Estimator|]] (recover it when the likelihood is explicit, ).
  • Anticipates likelihood-free ACE (Foster 2020, Theorem 2), which instead keeps the contrastive structure and replaces the likelihood with an unnormalized approximation while preserving a valid lower bound.
  • Related to simulation-based / likelihood-free inference (LFIRE, ABC) for implicit models.

See Also