Implicit Likelihood Estimator
Summary
The implicit-likelihood estimator handles models whose likelihood can be sampled but not evaluated (e.g. random-effects / nuisance-variable models, where is intractable). It learns two approximations — a marginal and a likelihood — and plugs both into the EIG. Unlike the other three, it is not a bound on the EIG, but Lemma 2 bounds its error, so minimizing that bound trains it.
Overview
The posterior, marginal, and VNMC estimators all assume the likelihood can be evaluated pointwise. Many important models cannot: introduce nuisance latents (random effects, latent confounders) and the likelihood becomes an intractable integral, even though you can still simulate by sampling then . The variational posterior works here unchanged (it needs only samples), but does not — it contains explicitly. The fix is to approximate the likelihood too.
Main Content
Definition: Implicit-likelihood estimator (Foster 2019, Eq. 12)
Using a marginal approximation and a likelihood approximation :
This is not a bound on the EIG (unlike the other three estimators).
Lemma 2 — EIG estimation error bound (Foster 2019)
For any valid and , the EIG estimation error is bounded:
where does not depend on or . The RHS is iff and for almost all .
Training implication
Lemma 2 says: learn and by maximizing via stochastic gradient ascent (two ordinary maximum-likelihood density-estimation problems), then substitute into Eq. 12. In general and are learned separately with no weight sharing; Foster 2019 §A.4 discusses the coupled case .
Where it sits among the four
From Foster 2019, Table 1: is the only estimator marked implicit-likelihood ✓ besides , and it relies on approximating a distribution over (so prefer it, like , when ). It gave the lowest empirical MSE on the mixed-effects benchmark (the implicit-likelihood problem) in Table 2.
Examples
Mixed-effects / item-response model (Foster 2019 §6.1, §6.3)
A psychology item-response model has common fixed effects (of interest) and per-participant random effects (nuisance ), making implicit. gives the best EIG accuracy here and is used to drive the online adaptive face-perception experiment on Mechanical Turk, producing lower-entropy posteriors than random design. See Sequential and Adaptive BED.
Connections
- Generalizes [[Variational Marginal Estimator|]] (recover it when the likelihood is explicit, ).
- Anticipates likelihood-free ACE (Foster 2020, Theorem 2), which instead keeps the contrastive structure and replaces the likelihood with an unnormalized approximation while preserving a valid lower bound.
- Related to simulation-based / likelihood-free inference (LFIRE, ABC) for implicit models.
See Also
- Variational Marginal Estimator — the explicit-likelihood version
- Variational Posterior Estimator (Barber-Agakov) — the other implicit-capable estimator
- Likelihood-Free ACE and Gradient Estimation — Foster 2020’s bound-preserving implicit approach