Counterfactual Impact Estimation

Summary

Given posterior predictive samples of the counterfactual time series, causal impact is estimated as the difference between observed outcomes and predicted counterfactuals. Three quantities are reported: (1) pointwise impact , (2) cumulative impact, and (3) running average impact. All are distributions (not point estimates) with credible intervals.

Overview

The core idea: after fitting the BSTS model on pre-intervention data, the model is used to predict what would have happened in the absence of the intervention. The causal effect is the difference between observed outcomes and these counterfactual predictions.

Definitions

Definition: Pointwise Causal Impact

For each posterior draw and each post-intervention time point :

where is the observed outcome and is the -th draw from the posterior predictive counterfactual distribution.

The collection yields the posterior predictive density of the causal effect at each time point.

Definition: Cumulative Causal Impact

The cumulative effect of the intervention from through :

When to use: Appropriate when is a flow variable — measured over an interval (e.g., number of searches per day, sales per week).

When NOT to use: Inappropriate for stock variables (e.g., total subscribers at a point in time) — use running average instead.

Definition: Running Average Causal Impact

The average causal effect per time period from through :

Always interpretable regardless of whether is a flow or stock.

Caution: As the forecasting period grows, probability intervals widen (more uncertain predictions further in the future), so the running average’s uncertainty increases even as the estimate stabilizes in expectation.

Posterior Summary

For each quantity, the standard Bayesian summary is:

  • Point estimate: Posterior mean = average over draws
  • Uncertainty: Central 95% posterior probability interval (not a confidence interval)
  • Significance test: Effect is “significant” if the 95% PI excludes zero

Temporal Structure of Uncertainty

A key feature: prediction intervals widen progressively as we forecast further post-intervention. This is because:

  1. The local linear trend drifts as a random walk → future trend increasingly uncertain
  2. The longer the campaign period, the more the counterfactual can diverge from the true outcome

This is appropriate: we genuinely know less about what would have happened further in the future.

Implication for estimation accuracy: The absolute percentage estimation error increases with forecasting horizon (see Fig. 4a in paper). Structural breaks (sudden changes in the DGP) accelerate this degradation.

Connection to Difference-in-Differences

The pointwise impact is the time-series analog of the DiD estimate . CausalImpact generalizes this by:

  • Modeling the full counterfactual trajectory (not just pre/post means)
  • Incorporating temporal autocorrelation
  • Using a spike-and-slab prior to select which controls matter

Connections

See Also