Variational Marginal Estimator
Summary
The variational marginal estimator learns an approximation to the intractable marginal likelihood and substitutes it into the likelihood form of the EIG. It yields an upper bound , tight iff equals the true marginal. It is the natural dual of the posterior estimator and is preferred when is high-dimensional (so a posterior approximation is hard) but is low-dimensional.
Overview
When is high-dimensional, learning a good amortized posterior is hard. The marginal estimator instead targets the (often lower-dimensional) outcome density: learn and plug it into the likelihood form of the EIG, .
Main Content
Definition: Variational marginal estimator (Foster 2019, Eq. 9)
with . Train by stochastic gradient descent to minimize .
Upper-bound property and tightness
is an upper bound on the EIG, with equality iff . The bound was studied in a mutual-information context (Poole et al. 2019) but not previously used for BOED. The gap is an expected KL from the true marginal to its approximation.
Posterior vs marginal: choosing by dimension
| Targets | Bound | Prefer when | |
|---|---|---|---|
| [[Variational Posterior Estimator (Barber-Agakov)|]] | distribution over | lower | |
| distribution over | upper |
The two are complementary: run both to sandwich the EIG () and bound a design’s true value (Foster 2019 §6.1).
Sequential advantage
In sequential BOED, needs only samples from the running posterior , not its density — unlike and . This makes it well suited to adaptive experiments where the posterior is only available through samples; it is the estimator used in the sequential CES experiment (High-Dimensional Design Applications).
Examples
Marginal estimator on revealed preference (Foster 2019 §6)
On the revealed-preference economics benchmark (low-dimensional response , higher-dimensional latent utility), the marginal estimator is the natural choice and is used to drive Bayesian-optimization-based design selection. In the sequential CES experiment it reduces posterior entropy and concentrates on the true parameters faster than NMC- or random-design baselines.
Connections
- Dual of Variational Posterior Estimator (Barber-Agakov): posterior↔lower, marginal↔upper.
- Extended to implicit likelihoods by [[Implicit Likelihood Estimator|]], which adds a second approximation for the likelihood.
- In Foster 2020 the marginal idea reappears as the VNMC upper bound used to verify designs found by ACE.
See Also
- Variational Posterior Estimator (Barber-Agakov) — the complementary lower bound
- Variational NMC Estimator — a consistent upper bound (tight as )
- Implicit Likelihood Estimator — marginal + likelihood approximations for implicit models