Group-Time Average Treatment Effects
Summary
The group-time average treatment effect is the disaggregated building block of the Callaway–Sant’Anna framework: the average effect at calendar time for the cohort first treated in period . It imposes no restriction on treatment-effect heterogeneity across groups or over time, so it (unlike a single TWFE coefficient) always carries a well-defined causal interpretation and can be flexibly aggregated to answer many policy questions.
Overview
In the canonical two-period setup the target is . With multiple periods and staggered adoption there is no single “treated” timing, so the paper indexes effects by both the cohort (when treatment starts) and the calendar period (when we measure). Fixing and varying traces effect dynamics for that cohort; varying compares cohorts. Because is defined directly on potential outcomes — not as a regression coefficient — it sidesteps the TWFE negative-weighting problem entirely (see Difference-in-Differences with Multiple Time Periods - Overview).
Main Content
Setup and notation
- periods, . = treatment status of unit at .
- Group = the first period a unit is treated; if never treated. , and flags the never-treated. is the last-treated group.
- is the set of identifiable groups (drops when no never-treated group exists, since it has no valid comparison). is the support of pre-treatment covariates.
- Generalized propensity score : probability of first treatment at , conditional on and on being either in group or never-treated. (More generally conditions on being in or “not-yet-treated by ”.)
Potential outcomes (multi-stage adoption)
Combining Robins’ dynamic potential outcomes with Heckman et al.’s multi-stage adoption: = untreated potential outcome (never participates through ); = potential outcome if first treated in period . Observed and potential outcomes link via
We observe exactly one potential-outcome path per unit. Assumption 1 (Irreversibility): a.s., and a.s. (staggered adoption — units never “forget” treatment). Assumption 2 (Random sampling): the panel is iid (results extend to repeated cross-sections, Appendix B). The unit index is suppressed henceforth.
The group-time average treatment effect
The main building block is
the average treatment effect at period among units first treated in period . It places no restriction on (a) heterogeneity across groups, (b) the timing , or (c) how effects evolve over . The family can therefore be read directly for heterogeneity or aggregated to answer: (a) overall effect of participating by ; (b) across-group heterogeneity; (c) effects by length of exposure (event study); (d) how effects evolve over calendar time. In the 2x2 case collapses to the canonical ATT.
Why this avoids TWFE bias. A single in a TWFE regression must average all cohort-period effects with estimator-determined (possibly negative) weights. By contrast, each is its own estimand; aggregation weights are then chosen transparently by the researcher to match the question (see Aggregating Group-Time Effects), and the simple combinations the paper proposes use non-negative weights — ruling out the “positive-for-all-but-negative-estimate” pathology.
Examples
Minimum wage: reading directly
With running 2001-2007 and groups , the cohort first raising its minimum wage in 2004 (Illinois) has , , , — i.e. teen employment falls more the longer the higher wage is in place. Each number is interpretable on its own without any homogeneity assumption.
Connections
- Identified under conditions in Identifying Assumptions for Staggered DiD.
- Estimated via the formulas in Doubly-Robust Estimands for ATT(g,t).
- Combined into summaries in Aggregating Group-Time Effects.
- Inference on the whole family in Simultaneous Inference via Multiplier Bootstrap.
See Also
- Estimands in Longitudinal Research — potential-outcome estimands across periods
- Difference in differences — the canonical 2x2 ATT this generalizes
- Mostly Harmless Econometrics — DiD chapter