Sequential and Adaptive BED
Summary
Bayesian adaptive design (BAD) extends BED to a sequence of experiment steps, choosing each design after seeing the history . Each step maximizes the incremental EIG — an ordinary EIG with the prior replaced by the current posterior . Traditional BAD iterates design → run → infer (Figure 1 of Rainforth 2023), which is powerful but (a) requires expensive posterior inference at every step and (b) is myopically greedy. These two flaws motivate policy-based adaptive design (From Designs to Policies (Deep Adaptive Design)).
Overview
Many real experiments are sequential: a psychology trial asks a participant a series of questions; an active-learning loop queries points one at a time. Here we can use information from earlier responses to choose later designs, substantially improving efficiency — fewer iterations to reach the same certainty. The Bayesian framework handles this self-consistently: the posterior after step simply becomes the prior for step .
Main Content
Definition: Sequential model and Bayesian update (Foster 2019, Eq. 5)
For experiments with outcomes conditionally independent given ,
Having run experiments , design by replacing the prior with the posterior conditional on the history.
Definition: Incremental EIG (Rainforth 2023, Eq. 4)
With history (and ), the incremental EIG of the next step is
This is just an ordinary EIG with an updated prior and likelihood; traditional BAD greedily picks at each step.
Additivity of incremental EIGs
The incremental EIGs are additive in expectation, so the total EIG over all steps equals the sum of the per-step incremental EIGs (Rainforth 2023, Eq. 17):
This requires updating beliefs about all model parameters (including nuisance ) between steps, not just the target — otherwise previously gathered data is ignored and additivity breaks.
The two flaws of traditional BAD
- Inference cost. Even with an extremely fast EIG estimator, BAD must update the model via Bayesian inference at every step (Figure 1: the design → observe → infer loop). This is only practical when computation time during the experiment is cheap — often it is not.
- Greedy myopia. Choosing to maximize only the next step’s incremental EIG ignores how that design influences information gathered at future steps. The truly optimal policy maximizes the total EIG, not the per-step EIG.
Both flaws are removed by learning a design policy upfront, deployed near-instantly during the experiment — see From Designs to Policies (Deep Adaptive Design).
Inference compatibility (estimator caveat)
In sequential settings, replacing with has estimator-specific consequences (Foster 2019 §3, Eq. 14): the marginal and implicit-likelihood estimators need only samples from the posterior, while the posterior and VNMC estimators also need its density. Foster 2019 shows the additive constant can be dropped, so any inference scheme — exact or approximate — is compatible. ACE in iterated design (Foster 2020 §3.5) similarly replaces with the running posterior.
Examples
Adaptive psychology experiment (Foster 2019 §6.3)
Human participants on Mechanical Turk respond to features of stylized faces under a mixed-effects model with per-participant nuisance variables. Using the implicit-likelihood estimator to choose each of 36 possible stimuli online yields a lower-entropy (more certain) posterior than random design — a fully online design → respond → re-infer → design loop. The constant-elasticity-of-substitution (CES) economics experiment (20 sequential steps) is the other recurring adaptive benchmark across both Foster papers; see High-Dimensional Design Applications.
Connections
- Special case of Bayesian active learning (the designs are which datapoints to label); the BALD score is the per-step EIG of model parameters.
- Reframed as a Bayes-adaptive Markov decision process (RL), with the incremental EIG as reward — links BAD to Bayesian reinforcement learning (Rainforth 2023 §5.2).
- Solved non-myopically by deep adaptive design, which amortizes the entire policy and optimizes the total EIG.
See Also
- Expected Information Gain — the per-step objective
- From Designs to Policies (Deep Adaptive Design) — amortized policies that fix BAD’s two flaws
- High-Dimensional Design Applications — the CES iterated-design experiment
- Dynamic Treatment Regimes Framework — a parallel sequential-decision framework in causal inference