Time-Varying Treatments and G-computation

Summary

Time-varying treatments involve sequential assignment over $T$ time points, where intermediate confounders are affected by past treatment. Sequential ignorability extends the ignorability assumption to this setting. The g-formula (Robins 1986) identifies the causal effect of a treatment sequence by iterating outcome regression across time. The Bayesian approach applies g-computation: fit a Bayesian model for each component in the g-formula and combine posterior draws.

Overview

In many real-world scenarios, subjects receive treatments sequentially at multiple time points. The challenge: time-varying confounders $L_{t}$ are affected by both previous treatments and future treatment assignment and outcomes. These settings are called time-varying, sequential, or longitudinal treatments.

Standard ignorability cannot handle this because conditioning on a time-varying confounder $L_{t}$ simultaneously:

Removes confounding for the $t$ -th treatment
Opens a collider bias path for earlier treatments

The solution is sequential ignorability combined with the g-formula.

Setup

Consider $T$ time points. For unit $i$ ( $i = 1, \dots, N$ ; $t = 1, \dots, T$ ):

$L_{0}$ — baseline time-invariant covariates
$Z_{t}$ — binary treatment at time $t$
$L_{t}$ — time-varying confounders between $Z_{t - 1}$ and $Z_{t}$ (affected by previous treatments)
$Y_{i}$ — final outcome at time $T$

Treatment sequence: $\overset{ˉ}{Z}_{t} = (Z_{1}, \dots, Z_{t})$ and $\overset{z}{ˉ}_{t} = (z_{1}, \dots, z_{t})$ .

Causal estimand: the marginal effect comparing two pre-specified treatment sequences $\overset{z}{ˉ}, \overset{z}{ˉ}^{'} \in {0, 1}^{T}$ :

τ_{\overset{z}{ˉ}, \overset{z}{ˉ}^{'}} \equiv E [Y_{i} (\overset{z}{ˉ}_{T})] - E [Y_{i} (\overset{z}{ˉ}_{T}^{'})]

Sequential Ignorability

Assumption 7.2 — Sequential Ignorability

For all $t = 1, \dots, T$ :
$Z_{t} ⊥ ⊥ Y (\overset{z}{ˉ}) ∣ \overset{ˉ}{Z}_{t - 1}, \overset{ˉ}{L}_{t - 1}$
for all $\overset{z}{ˉ}_{t}$ . That is, given all past treatment and covariate history, the current treatment assignment is independent of future potential outcomes.

Sequential ignorability is the time-varying analogue of the standard ignorability assumption. It requires that at each time point, treatment is as-good-as-random conditional on the entire observed history up to that point.

The G-Formula

Theorem: G-Formula (Robins 1986)

Under sequential ignorability, the marginal mean potential outcome for treatment sequence $\overset{z}{ˉ}_{T}$ is identified from observed data as:
$E [Y_{i} (\overset{z}{ˉ}_{T})] = L_{0}, L_{1}, \dots, L_{T - 1} \sum E [Y ∣ \overset{ˉ}{Z}_{T} = \overset{z}{ˉ}_{T}, \overset{ˉ}{L}_{T - 1}] \cdot t = 1 \prod T Pr (L_{t} ∣ \overset{ˉ}{Z}_{t} = \overset{z}{ˉ}_{t}, \overset{ˉ}{L}_{t - 1}) \cdot Pr (L_{0})$

The g-formula is an extension of the outcome regression identification formula to sequential treatments. It requires:

A model for the final outcome $E [Y ∣ \overset{ˉ}{Z}_{T}, \overset{ˉ}{L}_{T - 1}]$
Models for the time-varying confounders $Pr (L_{t} ∣ \overset{ˉ}{Z}_{t}, \overset{ˉ}{L}_{t - 1})$ at each time point

Bayesian G-Computation

The Bayesian approach applies g-computation: fit Bayesian models for each component in the g-formula, combine posterior draws.

Algorithm:

Fit a Bayesian outcome model $Pr (Y ∣ \overset{ˉ}{Z}_{T}, \overset{ˉ}{L}_{T - 1}; θ_{Y})$
Fit Bayesian confounder models $Pr (L_{t} ∣ \overset{ˉ}{Z}_{t}, \overset{ˉ}{L}_{t - 1}; θ_{L_{t}})$ for each $t$
Draw posterior samples $(θ_{Y}, θ_{L_{1}}, \dots, θ_{L_{T}})$ jointly
Compute the g-formula integral by Monte Carlo: simulate $\overset{ˉ}{L}$ from the confounder models under intervention $\overset{z}{ˉ}$ , plug into the outcome model, average

Example 7.3 — Bayesian G-computation with Two Periods

Setup: $T = 2$ time periods. $L_{0}$ : binary baseline covariate. $Z_{1}$ : binary treatment at $t = 1$ . $L_{1}$ : binary time-varying covariate. $Z_{2}$ : binary treatment at $t = 2$ . $Y$ : binary outcome.

Estimand: $E [Y (z_{1}, z_{2})]$ for any $(z_{1}, z_{2}) \in {0, 1}^{2}$ , via:
$E [Y (z_{1}, z_{2})] = l_{0}, l_{1} \sum Pr (Y = 1 ∣ Z_{2} = z_{2}, Z_{1} = z_{1}, L_{1} = l_{1}, L_{0} = l_{0})$ $\cdot Pr (L_{1} = l_{1} ∣ Z_{1} = z_{1}, L_{0} = l_{0}) \cdot Pr (L_{0} = l_{0})$
Posterior computation (with Beta conjugate priors):

Sample $Pr (Y = 1 ∣ Z_{2}, Z_{1}, L_{1}, L_{0})$ from $Beta (1/2 + \sum 1 (\dots), \dots)$ (8 Bernoulli cells)

Sample $Pr (L_{1} = l_{1} ∣ Z_{1} = z_{1}, L_{0} = l_{0})$ similarly (4 Bernoulli cells)

Sample $Pr (L_{0} = 1)$ from $Beta (1/2 + \sum L_{0 i}, 1/2 + \sum (1 - L_{0 i}))$

Then combine to get posterior of $E [Y (z_{1}, z_{2})]$ and contrasts.

Challenges and Extensions

Scalability: the g-formula becomes intractable as $T$ and the dimension of $L$ increase — the sum over all history paths requires exponentially many models.

Marginal Structural Models (MSM): A popular alternative (Robins et al. 2000) that models the marginal potential outcome distribution rather than the full conditional. The Bayesian version (Saarela et al. 2016) uses the Bayesian bootstrap.

Dynamic treatment regimes: A closely related topic — sequences of decision rules that individualize treatment over time based on evolving history. Optimal dynamic treatment regimes require combining causal inference + decision theory + reinforcement learning.

G-null Paradox

Robins & Wasserman (2015) showed that unsaturated MSMs might rule out the null hypothesis of zero causal effect a priori — a phenomenon called the g-null paradox. This is an important limitation of MSMs in practice.

Comparison: Bayesian vs. Frequentist

Approach	Method	Key advantage	Limitation
Frequentist	IPW-based MSM	Computationally simpler	Extreme weights; g-null paradox
Frequentist	G-computation	Direct; flexible	Requires many models
Bayesian	G-computation	Uncertainty propagation across time	Computationally demanding
Bayesian	Marginal structural model (Bayesian bootstrap)	Avoids specifying all confounder models	Relies on IPW; extreme weight issues remain

Connections

General Structure of Bayesian CI — data augmentation and posterior imputation generalized to time-varying settings
Potential Outcomes Framework — SUTVA and the multiple potential outcomes $Y_{i} (\overset{z}{ˉ}_{T})$
Bayesian Outcome Models — each time-step outcome model in the g-formula
Instrumental Variables and Principal Stratification — principal stratification applied to censoring/time-varying settings

Second Brain

Explorer

Time-Varying Treatments and G-computation

Time-Varying Treatments and G-computation

Overview

Setup

Sequential Ignorability

The G-Formula

Bayesian G-Computation

Challenges and Extensions

Comparison: Bayesian vs. Frequentist

Connections

See Also

Graph View

Table of Contents

Backlinks