High-Dimensional Design Applications
Summary
The five experiments of Foster 2020 demonstrate that one-stage gradient BOED scales where two-stage methods fail. Death process (2-D): gradient methods beat Bayesian optimization even in low dimension. Regression (400-D): ~2× the EIG of BO/random-search baselines. Advertising (ablation over dimension): gap grows with . Biomolecular docking (100-D, real pharmacology): ACE designs beat human experts. CES (6-D iterated): ACE/PCE reduce posterior entropy faster than the Foster 2019 marginal+BO baseline.
Overview
Designs are judged primarily by their EIG (computed analytically or by a large NMC estimator when possible). Where the EIG is intractable, the ACE lower bound and VNMC upper bound are paired to trap the true EIG: if design ‘s lower bound exceeds design ‘s upper bound, is provably superior. When the optimal design is known, designs are also scored by design error .
Main Content
Death process — gradients beat BO in low dimension (§4.2, Figs. 1–2)
Epidemiology model: a population of transitions healthy→infected at unknown rate ; measure infected counts at two times and (). Aim: infer . With 66 discrete outcomes, use Rao–Blackwellized gradients. Final EIG: ACE 0.9830 ± 0.0001, PCE 0.9822, BA 0.9822, BO+NMC 0.9732. All gradient methods beat BO in both quality and wall-clock — even on a 2-D problem.
Regression — 400-dimensional design (§4.3, Table 1)
Bayesian linear regression, , design is the matrix (400 dimensions), latents with Normal likelihood, priors , , constraint . Gradient methods strongly outperform gradient-free baselines — roughly double the final EIG:
Method EIG l.b. EIG u.b. ACE 16.1 20.7 PCE 16.6 21.5 BA 16.4 21.1 BO + VNMC 7.3 9.6 Random search + VNMC 7.1 9.4
Advertising — ablation over dimension (§4.4, Fig. 3)
Allocate budget across regions, with ; observe sales ; infer market opportunities per region, with neighbouring regions correlated (information pools across regions). The EIG is analytic, and BO is given an EIG oracle (point evaluations of ) to isolate the value of gradient optimization. Gradient methods (ACE/PCE/BA) still win, and BO degrades for ; PCE is strong at low but degrades as grows (the prior becomes an inefficient contrastive proposal), while ACE/BA learn adaptive proposals.
Biomolecular docking — beating experts (§4.5, Table 2)
Pharmacology hit-rate model (Lyu et al. 2019, Nature): probability that compound with docking score is a “hit” follows a sigmoid , . Design = 100 docking scores at which to test compounds (100-D). Result: all gradient methods beat the expert design of Lyu et al.; ACE best (EIG ∈ [1.0835, 1.0852] vs expert [1.0191, 1.0227]). Designs are qualitatively different from expert choices (Fig. 5).
CES — iterated behavioural-economics design (§4.6, Fig. 4)
Constant-elasticity-of-substitution model (Arrow et al. 1961): a participant compares baskets , responding on a slider based on utility difference governed by latents ; 6-D designs over 20 sequential steps with the same participant. Replaces with the running posterior (Sequential and Adaptive BED). ACE and PCE decrease posterior entropy and parameter RMSEs faster and further than the Foster 2019 marginal+BO baseline; BA does poorly here. ACE/PCE perform similarly because, after enough data, the posterior changes little step-to-step so becomes an effective proposal for .
Connections
- Demonstrates the bound-trapping use of ACE (lower) + VNMC (upper).
- Validates the unified gradient thesis: the advantage over two-stage BO grows with design dimension.
- Cited by Rainforth 2023 as evidence that gradient-based design scales to real high-dimensional problems.
- The CES experiment is the shared sequential benchmark with Foster 2019’s marginal estimator.
See Also
- Adaptive Contrastive Estimation (ACE) / Prior Contrastive Estimation (PCE) — the bounds optimized
- Likelihood-Free ACE and Gradient Estimation — the gradient machinery (RB, reparam) used here
- Sequential and Adaptive BED — the iterated-design setting of the CES experiment