Synthetic Control Bias Theory

Summary

The formal justification for synthetic controls rests on the linear factor model — a generalization of difference-in-differences that allows unobserved confounders to have time-varying loadings. Under this model, Abadie, Diamond, and Hainmueller (2010) derive a bias bound: the bias of the synthetic control estimator is inversely proportional to the number of pre-treatment periods , provided the synthetic control closely tracks the treated unit’s pre-treatment trajectory. Variable selection (via the V matrix and cross-validation) is the mechanism that enforces this fit.

Overview

The basic synthetic control estimator in Synthetic Control is intuitive — find a weighted combination of donor units that matches the treated unit before the intervention, then attribute post-treatment divergence to the treatment. But why is this a valid identification strategy? The answer is the linear factor model.

The key insight: a synthetic control that reproduces the treated unit’s pre-treatment outcomes implicitly matches on the unobserved common factors — exactly those confounders that would bias a regression estimator. The more pre-treatment periods available, the more constraints the matching imposes, and the better the identification.

The Linear Factor Model

Definition: Linear Factor Model (Abadie et al. 2010, Eq. 10)

The potential outcome without treatment for unit at time follows:

where:

  • = common time trend (constant factor loading)
  • = observed covariates (time-invariant or unaffected by treatment), with time-varying coefficients
  • = vector of unobserved unit-specific factors
  • = vector of time-varying factor loadings (common factors)
  • = zero-mean idiosyncratic shocks

This model generalizes the standard panel data fixed-effects model. The difference-in-differences / fixed-effects restriction (constant over time) is obtained by restricting to be time-invariant. The linear factor model allows to change over time, so the “parallel trends” assumption of DiD is a special case.

Connection to DiD

DiD assumes (constant factor loadings), so unobserved confounders affect all units equally across time. The linear factor model relaxes this: each unit has its own loading on common factors that may drift over time. Synthetic control handles this by matching on the trajectory, not just the level.

The Bias Bound

Theorem: Bias Bound for Synthetic Controls (Abadie, Diamond, and Hainmueller 2010)

Under the linear factor model, suppose a synthetic control with weights reproduces the characteristics of the treated unit:

where and include pre-intervention outcomes and predictors. Then for , the bias of is bounded by a function that is:

  • Inversely proportional to (the number of pre-treatment periods)
  • Increasing in (the size of the donor pool)
  • Controlled by the quality of fit:

Implication: A large alone does not guarantee low bias — the synthetic control must also achieve a close pre-treatment fit. Conversely, a close fit with small may still produce substantial bias if the idiosyncratic transitory shocks are large.

Key practical implications of the bias bound:

  1. Long pre-treatment windows are valuable. More pre-treatment periods impose more matching constraints on .
  2. Imperfect fit is a warning sign. If , Abadie, Diamond, and Hainmueller (2010) advise against using synthetic controls.
  3. Large donor pools can increase bias. A large gives more flexibility to fit the pre-treatment data but may introduce interpolation biases between the treated unit and distant donor units.

Sparsity: The Geometry of Synthetic Controls

One of synthetic control’s most important properties is sparsity — typically, only a small number of donor units receive nonzero weight.

Theorem: Sparsity of Synthetic Controls (Abadie 2021, Section 3.2)

When falls outside the convex hull of the columns of , and the columns of are in general quadratic position, the synthetic control is unique and sparse — with the number of nonzero weights bounded by (the number of predictors in ).

Geometric interpretation: The synthetic control is the projection of onto the convex hull of . Since is typically outside this hull (curse of dimensionality), the projection touches the hull at a face with at most vertices.

Sparsity is a feature, not a bug:

  • Interpretability: The synthetic counterfactual is a named weighted average of specific donor units (e.g., 42% Austria + 22% United States + …)
  • Transparency: The contribution of each donor unit is explicit, allowing subject-matter evaluation of the counterfactual’s plausibility
  • Contrast with regression: Regression weights are dense (all units contribute), unrestricted (can be negative), and allow extrapolation — obscuring potential biases

The V Matrix and Variable Selection

The synthetic control optimization (Eq. 7 in Abadie 2021) requires choosing a positive definite matrix that weights the relative importance of each predictor:

Definition: Cross-Validation for V Matrix Selection

Split the pre-intervention periods into a training period and a validation period (with as a default). Then:

  1. For each candidate , compute weights using training period data only
  2. Evaluate the MSPE of the resulting synthetic control on the validation period:
  1. Select that minimizes the validation-period MSPE
  2. Use for estimation

Simple alternatives to cross-validation:

  • : rescale all predictors to unit variance (equivalent to standardizing)
  • Equal weights : appropriate when predictors are on similar scales

Pre-Intervention Outcomes as Predictors

Including pre-intervention outcome values of in and is often the single most important variable selection decision. Pre-treatment outcomes are powerful predictors of post-treatment outcomes (via the factor model), and they are automatically absorbed into in the factor model framework. However, including many individual time-period outcomes (rather than summary measures) can lead to overfitting during the training period. Use aggregate summaries (means, subperiod averages) as default.

Contrast: Synthetic Control vs. Regression

PropertySynthetic ControlRegression
Weight constraints, Unconstrained
ExtrapolationPrecluded by convex combinationAllowed (negative weights)
SparsityYes (bounded by )No (dense weights)
Fit transparencyExplicit: Hidden: regression forces exactly
Bias when units dissimilarVisible in large Hidden by extrapolation
Pre-analysis planWeights registerable before outcomes observedCannot preregister

The regression estimator forces a perfect fit of the covariates () even when the untreated units are completely dissimilar to the treated unit, allowing extrapolation. Regression weights in table 3 of Abadie (2021) include negative values for four OECD countries in the German reunification example — synthetic control weights in table 2 are all nonnegative.

See Also