Counterfactual Impact Estimation

Summary

Given posterior predictive samples of the counterfactual time series, causal impact is estimated as the difference between observed outcomes and predicted counterfactuals. Three quantities are reported: (1) pointwise impact $ϕ_{t}$ , (2) cumulative impact, and (3) running average impact. All are distributions (not point estimates) with credible intervals.

Overview

The core idea: after fitting the BSTS model on pre-intervention data, the model is used to predict what would have happened in the absence of the intervention. The causal effect is the difference between observed outcomes and these counterfactual predictions.

Definitions

Definition: Pointwise Causal Impact

For each posterior draw $τ$ and each post-intervention time point $t = n + 1, \dots, m$ :
$ϕ_{t}^{(τ)} := y_{t} - \tilde{y}_{t}^{(τ)} (2.15)$
where $y_{t}$ is the observed outcome and $\tilde{y}_{t}^{(τ)}$ is the $τ$ -th draw from the posterior predictive counterfactual distribution.

The collection ${ϕ_{t}^{(τ)}}_{τ}$ yields the posterior predictive density of the causal effect at each time point.

Definition: Cumulative Causal Impact

The cumulative effect of the intervention from $t = n + 1$ through $t$ :
$t^{'} = n + 1 \sum t ϕ_{t^{'}}^{(τ)} \forall t = n + 1, \dots, m (2.16)$
When to use: Appropriate when $y_{t}$ is a flow variable — measured over an interval (e.g., number of searches per day, sales per week).

When NOT to use: Inappropriate for stock variables (e.g., total subscribers at a point in time) — use running average instead.

Definition: Running Average Causal Impact

The average causal effect per time period from $t = n + 1$ through $t$ :
$\frac{1}{t - n} t^{'} = n + 1 \sum t ϕ_{t^{'}}^{(τ)} \forall t = n + 1, \dots, m (2.17)$
Always interpretable regardless of whether $y_{t}$ is a flow or stock.

Caution: As the forecasting period grows, probability intervals widen (more uncertain predictions further in the future), so the running average’s uncertainty increases even as the estimate stabilizes in expectation.

Posterior Summary

For each quantity, the standard Bayesian summary is:

Point estimate: Posterior mean = average over $τ$ draws
Uncertainty: Central 95% posterior probability interval (not a confidence interval)
Significance test: Effect is “significant” if the 95% PI excludes zero

Temporal Structure of Uncertainty

A key feature: prediction intervals widen progressively as we forecast further post-intervention. This is because:

The local linear trend $δ_{t}$ drifts as a random walk → future trend increasingly uncertain
The longer the campaign period, the more the counterfactual can diverge from the true outcome

This is appropriate: we genuinely know less about what would have happened further in the future.

Implication for estimation accuracy: The absolute percentage estimation error increases with forecasting horizon (see Fig. 4a in paper). Structural breaks (sudden changes in the DGP) accelerate this degradation.

Connection to Difference-in-Differences

The pointwise impact $ϕ_{t}$ is the time-series analog of the DiD estimate $(\overset{y}{ˉ}_{p os t}^{t re a t} - \overset{y}{ˉ}_{p re}^{t re a t}) - (\overset{y}{ˉ}_{p os t}^{co n t ro l} - \overset{y}{ˉ}_{p re}^{co n t ro l})$ . CausalImpact generalizes this by:

Modeling the full counterfactual trajectory (not just pre/post means)
Incorporating temporal autocorrelation
Using a spike-and-slab prior to select which controls matter

Connections

Draws $\tilde{y}_{t}^{(τ)}$ come from MCMC Inference for CausalImpact
Applied in CausalImpact Empirical Application
Generalizes Differences-in-Differences to time series
Conceptually related to Counterfactual Inference (existing vault note on BART counterfactuals)

Second Brain

Explorer

Counterfactual Impact Estimation

Counterfactual Impact Estimation

Overview

Definitions

Posterior Summary

Temporal Structure of Uncertainty

Connection to Difference-in-Differences

Connections

See Also

Graph View

Table of Contents

Backlinks