Generalized Synthetic Control Method
Summary
The Generalized Synthetic Control (GSC) method (Xu 2017) imputes treated counterfactuals using an interactive fixed effects (IFE) model estimated on the control group. A 3-step procedure — (1) estimate IFE on controls, (2) project treated pretreatment outcomes onto factor space, (3) impute counterfactuals — handles multiple treated units, variable treatment timing, and heterogeneous treatment effects in a single run. Built-in cross-validation selects the number of factors; parametric bootstrap provides standard errors and confidence intervals. DID is the special case with constant factor loadings; canonical SC is the special case with one treated unit.
Overview
Two standard approaches to causal inference with time-series cross-sectional (TSCS) data both have critical limitations:
- Difference-in-differences requires parallel pre-trends — that treated and control units would have evolved identically absent treatment. This is often implausible when unobserved time-varying confounders affect units differentially.
- Synthetic control matches on pre-treatment trajectories but applies only to a single treated aggregate unit and provides no formal uncertainty estimates.
The GSC method addresses both by explicitly modeling the unobserved time-varying confounders as an interactive fixed effects structure. Because the IFE model is estimated once on all control units jointly, it (1) pools information across controls efficiently, (2) handles multiple treated units in a single run, (3) recovers unit-specific factor loadings for each treated unit, and (4) supports parametric bootstrap inference.
Framework
Setting
Let and denote the sets of treated and control units. total units observed for periods. Unit is first treated at time and observed for post-treatment periods. For notational simplicity, assume all treated units start treatment at the same time, .
The IFE Model
Assumption 1: Functional Form (Xu 2017, p. 60)
The outcome of unit at time is generated by:
where:
- = heterogeneous treatment effect for unit at time (individual treatment indicator if and )
- = vector of observed covariates with common coefficients
- = vector of unobserved common factors (time-varying)
- = vector of unknown factor loadings (unit-specific intercepts)
- = zero-mean idiosyncratic shocks
Identification constraints: (factors normalized), diagonal.
Special cases of this model:
- DID: Set (scalar time trend) and for all units → (additive time fixed effect). Parallel trends is the restriction that factor loadings are identical across units.
- Canonical SC: Restrict to and use the SC weighting scheme instead of the IFE estimator.
- Two-way fixed effects: (additive unit + time fixed effects, the “twoway FE” restriction).
Decomposition of Confounders
The term captures unobserved time-varying confounders that affect different units with different magnitudes (). For example, if a law is passed in a state because public opinion becomes more liberal, and shifting ideology affects liberal vs. conservative states differently, then captures the national ideology trend and captures each state’s sensitivity to it. Two-way FE models would incorrectly assume for all .
ATT Estimand
Definition: Average Treatment Effect on the Treated (ATT)
The target estimand at time is:
Because is observed for treated units in post-treatment periods, the problem reduces to estimating — the counterfactual untreated potential outcome.
As in Abadie, Diamond, and Hainmueller (2010), the treatment effects are treated as fixed given the sample (not as random draws). The estimand is the ATT in the sample drawn, not the population ATT.
Identification Assumptions
Assumption 2: Strict Exogeneity (Xu 2017, p. 61)
The idiosyncratic error term of any unit at any time period is independent of all treatment assignments, observed covariates, unobserved factors and factor loadings.
This is weaker than the strict exogeneity required by two-way FE models. Two-way FE requires , which demands that the composite error (including the factor component) be uncorrelated with treatment — implausible when factor loadings are correlated with treatment assignment. GSC only requires the idiosyncratic part to be exogenous.
Assumptions 3–5 (Xu 2017, pp. 61–62):
- Assumption 3 (Weak serial dependence): Error terms allow mild autocorrelation; strong dependence (unit roots) ruled out
- Assumption 4 (Regularity conditions): Moment conditions for consistency of and the factor space
- Assumption 5 (Cross-sectional independence and homoscedasticity): Required for valid parametric bootstrap; heteroscedasticity across time allowed
The Three-Step GSC Estimator
Definition: GSC Estimation Procedure (Xu 2017, Section 3)
Step 1: Estimate the IFE model on control group data only:
subject to and diagonal.
Step 2: Estimate factor loadings for each treated unit by projecting its pretreatment outcomes onto the estimated factor space:
where superscript denotes pretreatment periods .
Step 3: Impute the treated counterfactual for , :
Treatment effect estimate:
ATT estimate:
Remark 2: Consistency of the GSC Estimator (Xu 2017, p. 64)
Under Assumptions 1–4, the bias of the GSC estimator shrinks to zero as the sample grows, i.e.:
( held fixed). Intuitively, both large and large are necessary: for consistent estimation of and the factor space, for consistent estimation of each treated unit’s factor loadings .
Key implication: When is small, the “incidental parameters” problem leads to imprecise and hence biased treatment effect estimates. This is a crucial difference from two-way FE models.
Model Selection: Cross-Validation for Number of Factors
The number of factors is unknown and must be selected. Xu (2017) proposes a leave-one-out cross-validation procedure that uses the DID data structure — pretreatment treated observations naturally serve as a validation set.
Algorithm 1: Cross-Validation for Number of Factors (Xu 2017, pp. 63–64)
For a given candidate :
Step 1: Estimate the IFE model on control group data, obtaining and .
Step 2: For each pretreatment period (leave-one-out loop):
- (a) Hold back period ; estimate factor loadings for each treated unit using all other pretreatment periods :
- (b) Predict the treated outcome at period :
- (c) Save prediction error: for
Step 3: Compute MSPE for this :
Steps 4–5: Repeat for different ‘s; select that minimizes MSPE.
The procedure is computationally inexpensive: for each , the IFE model is estimated only once (Step 1); all other steps are OLS projections.
Inference: Parametric Bootstrap
Unlike the permutation inference of canonical SC (see Synthetic Control Inference and Diagnostics), GSC produces frequentist uncertainty estimates via parametric bootstrap — suitable when is moderate and asymptotic theory applies.
Algorithm 2: Parametric Bootstrap for ATT Variance (Xu 2017, pp. 64–65)
Motivation: We estimate — the variance conditional on the observed factors and loadings. Only is random.
Step 1 (Collect prediction errors): For rounds, take one control unit as “fake treated,” apply GSC to obtain prediction errors . Collect .
Step 2 (Apply GSC to original data): Obtain , estimated coefficients , , , fitted values , and in-sample residuals .
Step 3 (Bootstrap loop, times): Construct bootstrapped sample by:
Apply GSC to ; obtain bootstrapped .
Step 4: Compute variance:
Use percentile method for confidence intervals (Efron and Tibshirani 1993).
Key design decision: Treated and control units draw residuals from different empirical distributions. Control residuals come from the in-sample IFE model residuals (the model fits controls well). Treated residuals come from out-of-sample prediction errors (the IFE model predicts treated units in pretreatment periods, so prediction errors are typically larger).
Monte Carlo Performance
Simulation DGP (Xu 2017, Eq. 3):
with factors; treated factor loadings shifted from control range by parameter (at : common support; at : no overlap).
Table 1: Finite Sample Properties (Xu 2017, Table 1)
Setup: 5 treated units, , , 5,000 simulations, bootstraps, estimand = (true value = 5).
Key results:
- GSC has limited bias even when and ; bias → 0 as both grow
- Coverage of 95% CI ≈ 0.947 across all configurations — valid bootstrap inference
- Monte Carlo (Online Appendix): GSC has less bias than DID/twoway-FE estimators in the presence of time-varying confounders; less bias than the IFE estimator when treatment effects are heterogeneous across units; more efficient than the original synthetic matching method
Empirical Application: Election Day Registration and Voter Turnout
Example: EDR Laws and Voter Turnout (Xu 2017, Section 5)
Setup: 47 US states, 1920–2012 presidential elections. Nine states adopted Election Day Registration (EDR) laws (treated); 38 states never adopted them (controls). Outcome: voter turnout as % of voting-age population.
DID result (Table 2, cols 1–2): Fixed effects estimate of EDR = 0.87–0.78 percentage points (SE ≈ 3%). But Figure 2(a) shows that DID’s parallel trends assumption fails: average predicted turnout diverges substantially from actual turnout in pretreatment periods.
GSC result (Table 2, cols 3–4): Cross-validation selects factors. Estimated ATT ≈ 5.13–4.90 percentage points (SE ≈ 2.27%). Figure 2(b) shows near-perfect pretreatment fit.
Factor interpretation (Figure 3):
- Factor 1: captures the sharp post-1965 turnout increase in southern states (Voting Rights Act removing Jim Crow laws)
- Factor 2: captures a general downward trend in turnout; southern states have the largest loadings on Factor 1 (Figure 3b)
- Treated unit loadings mostly lie within the convex hull of control loadings → reliable interpolation
Heterogeneous effects by adoption wave (Table 3):
- 1st wave (ME, MN, WI): ATT ≈ 7.27 pp (SE 3.33) — large, significant
- 2nd wave (ID, NH, WY): ATT ≈ 2.17 pp (SE 2.82) — modest, insignificant
- 3rd wave (MT, IA, CT): ATT ≈ −1.14 pp (SE 3.00) — null/negative
Interpretation: Early adopters enacted EDR to genuinely increase turnout; later adopters introduced it to opt out of the NVRA, and by then registration costs had already fallen nationally.
GSC vs. Related Methods
| Method | Treated units | Parallel trends | Uncertainty | Factor selection |
|---|---|---|---|---|
| Two-way FE (DID) | Multiple | Required | OLS SE | N/A (implicit ) |
| IFE estimator (Bai 2009) | Multiple | Not required | Asymptotic | Manual or IC |
| Canonical SC (ADH 2010) | 1 | Not required | Permutation | N/A (weights, not factors) |
| GSC (Xu 2017) | Multiple | Not required | Bootstrap | Cross-validation |
| Penalized SC (AL 2019) | Multiple | Not required | Permutation | Penalty () |
| Matrix completion | Multiple | Not required | Bootstrap | Low-rank constraint |
Diagnostics Recommended by Xu
- Plot raw data, fitted values, and imputed counterfactuals for each treated unit
- Plot factor loadings of both treated and control units (as in Figure 3b) — check that treated units lie within the convex hull of control units; extrapolation is a source of bias
- Compare with DID and IFE estimates as robustness check
- Use
gsynthR package (Xu 2016, Harvard Dataverse) for implementation
Implementation Note
The R package gsynth implements GSC with:
gsynth(Y ~ D + X, data, index = c("unit","time"), force = "two-way", CV = TRUE, r = c(0, 5), se = TRUE, inference = "parametric", nboots = 1000)
Connections
- Synthetic Control Bias Theory: The linear factor model (Abadie et al.) and IFE model (Bai 2009 / Xu) are the same structure; GSC is the estimation method that directly fits this model rather than using weighted matching
- Differences-in-Differences: DID is the special case (constant loadings) of Assumption 1; GSC relaxes this restriction
- Synthetic Control: SC is the special case ; GSC extends to with parametric inference
- Synthetic Control Extensions: GSC is the primary method recommended for multiple treated units; contrasts with the penalized SC of Abadie and L’Hour (2019)
- Bayesian Difference in Differences: Bayesian analog for aggregate time-series; Xu cites Pang (2014) Bayesian multilevel factor models as a complementary approach
See Also
- Xu 2016 - Overview — paper overview with citation and contribution summary
- Synthetic Control — the canonical single-unit estimator
- Synthetic Control Bias Theory — formal theory of the linear factor model
- Synthetic Control Inference and Diagnostics — permutation inference for canonical SC
- Synthetic Control Extensions — broader survey of SC extensions
- Differences-in-Differences — the special case when parallel trends holds
- Abadie 2021 - Overview — complementary guide on canonical SC feasibility and requirements