Sequential and Adaptive BED

Summary

Bayesian adaptive design (BAD) extends BED to a sequence of $T$ experiment steps, choosing each design $ξ_{t}$ after seeing the history $h_{t - 1} = {(ξ_{k}, y_{k})}_{k < t}$ . Each step maximizes the incremental EIG — an ordinary EIG with the prior replaced by the current posterior $p (θ ∣ h_{t - 1})$ . Traditional BAD iterates design → run → infer (Figure 1 of Rainforth 2023), which is powerful but (a) requires expensive posterior inference at every step and (b) is myopically greedy. These two flaws motivate policy-based adaptive design (From Designs to Policies (Deep Adaptive Design)).

Overview

Many real experiments are sequential: a psychology trial asks a participant a series of questions; an active-learning loop queries points one at a time. Here we can use information from earlier responses to choose later designs, substantially improving efficiency — fewer iterations to reach the same certainty. The Bayesian framework handles this self-consistently: the posterior after step $t - 1$ simply becomes the prior for step $t$ .

Main Content

Definition: Sequential model and Bayesian update (Foster 2019, Eq. 5)

For $T$ experiments with outcomes conditionally independent given $(θ, ξ)$ ,
$p (y_{1 : T}, θ ∣ ξ_{1 : T}) = p (θ) t = 1 \prod T p (y_{t} ∣ θ, ξ_{t}) .$
Having run experiments $1, \dots, t - 1$ , design $ξ_{t}$ by replacing the prior with the posterior $p (θ ∣ ξ_{1 : t - 1}, y_{1 : t - 1})$ conditional on the history.

Definition: Incremental EIG (Rainforth 2023, Eq. 4)

With history $h_{t - 1} = {(ξ_{k}, y_{k})}_{k = 1}^{t - 1}$ (and $h_{0} = \emptyset$ ), the incremental EIG of the next step is
$EIG_{θ} (ξ_{t} ∣ h_{t - 1}) := E_{p (θ ∣ h_{t - 1}) p (y_{t} ∣ θ, ξ_{t}, h_{t - 1})} [lo g \frac{p ( y _{t} ∣ θ , ξ _{t} , h _{t - 1} )}{p ( y _{t} ∣ ξ _{t} , h _{t - 1} )}] .$
This is just an ordinary EIG with an updated prior and likelihood; traditional BAD greedily picks $ξ_{t} = ar g max_{ξ_{t}} EIG_{θ} (ξ_{t} ∣ h_{t - 1})$ at each step.

Additivity of incremental EIGs

The incremental EIGs are additive in expectation, so the total EIG over all $T$ steps equals the sum of the per-step incremental EIGs (Rainforth 2023, Eq. 17):
$TEIG_{θ} (π_{ϕ}) = EIG_{θ} (π_{ϕ} (\emptyset)) + t = 2 \sum T E_{p (y_{1 : t - 1} ∣ π_{ϕ})} [EIG_{θ} (π_{ϕ} (h_{t - 1}) ∣ h_{t - 1})] .$
This requires updating beliefs about all model parameters (including nuisance $ψ$ ) between steps, not just the target $θ$ — otherwise previously gathered data is ignored and additivity breaks.

The two flaws of traditional BAD

Inference cost. Even with an extremely fast EIG estimator, BAD must update the model via Bayesian inference at every step (Figure 1: the design → observe → infer loop). This is only practical when computation time during the experiment is cheap — often it is not.
Greedy myopia. Choosing $ξ_{t}$ to maximize only the next step’s incremental EIG ignores how that design influences information gathered at future steps. The truly optimal policy maximizes the total EIG, not the per-step EIG.

Both flaws are removed by learning a design policy $π_{ϕ} (h_{t - 1}) \mapsto ξ_{t}$ upfront, deployed near-instantly during the experiment — see From Designs to Policies (Deep Adaptive Design).

Inference compatibility (estimator caveat)

In sequential settings, replacing $p (θ)$ with $p (θ ∣ h_{t - 1})$ has estimator-specific consequences (Foster 2019 §3, Eq. 14): the marginal and implicit-likelihood estimators need only samples from the posterior, while the posterior and VNMC estimators also need its density. Foster 2019 shows the additive constant $lo g p (y_{1 : t - 1} ∣ ξ_{1 : t - 1})$ can be dropped, so any inference scheme — exact or approximate — is compatible. ACE in iterated design (Foster 2020 §3.5) similarly replaces $p (θ)$ with the running posterior.

Examples

Adaptive psychology experiment (Foster 2019 §6.3)

Human participants on Mechanical Turk respond to features of stylized faces under a mixed-effects model with per-participant nuisance variables. Using the implicit-likelihood estimator $\overset{μ}{^}_{m + ℓ}$ to choose each of 36 possible stimuli online yields a lower-entropy (more certain) posterior than random design — a fully online design → respond → re-infer → design loop. The constant-elasticity-of-substitution (CES) economics experiment (20 sequential steps) is the other recurring adaptive benchmark across both Foster papers; see High-Dimensional Design Applications.

Connections

Special case of Bayesian active learning (the designs are which datapoints to label); the BALD score is the per-step EIG of model parameters.
Reframed as a Bayes-adaptive Markov decision process (RL), with the incremental EIG as reward — links BAD to Bayesian reinforcement learning (Rainforth 2023 §5.2).
Solved non-myopically by deep adaptive design, which amortizes the entire policy and optimizes the total EIG.

Second Brain

Explorer

Sequential and Adaptive BED

Sequential and Adaptive BED

Overview

Main Content

The two flaws of traditional BAD

Inference compatibility (estimator caveat)

Examples

Connections

See Also

Graph View

Table of Contents

Backlinks