7-Synthetic Control Methods
Utrecht School of Economics
2026
Benefits:
Costs:
How would you assess the effect of immigration on local wages, with data?
In 1980 Fidel Castro announced that any Cuban who wanted to leave the country could do so.
Cubans in the US helped to organize the Mariel Boatlift, which brought 125,000 Cubans to Miami in a few months.
Miami’s labor force increased by 7% in a few months.
Card (1990) used this event to study the effect of an inflow of low-skilled workers on local wages.
The analysis compares the evolution of wages in Miami with the evolution of wages in other cities.
Data from the Current Population Survey, on wages and employment before and after the Mariel Boatlift in Miami and in a set of control cities.
Card finds no evidence of a negative effect of the Mariel Boatlift on wages in Miami.
Neither for white nor for African-American workers.
Card (1990, p. 249)
Does this method remind you of something we have seen before?
→ Difference-in-Differences! DiD also compares a treated unit to a control group before and after an intervention. SCM generalises this: rather than choosing the control group ad hoc, it constructs one as an optimally weighted average of donor units.
Card’s choice of control cities was arbitrary.
The synthetic control method removes this arbitrariness.
The method constructs a synthetic control group for the treated unit that is a weighted average of the donor pool.
The weights are chosen to minimize the difference between the treated unit and the synthetic control group in the pre-treatment period.
Synthetic control methods are usually used for a single treated unit (a single school, firm, country, etc.).
Our goal: reproduce the counterfactual of a treated unit by finding the combination of untreated units that best resembles the treated unit before the intervention in terms of the values of \(k\) relevant covariates (predictors of the outcome of interest)
Method selects weighted average of all potential comparison units that best resembles the characteristics of the treated unit(s) - called the synthetic control
Convex hull means synth is a weighted average of units which means the counterfactual is a collection of comparison units that on average track the treatment group over time.
Constraints on the model use non-negative weights which does not allow for extrapolation
Makes explicit the contribution of each comparison unit to the counterfactual
Formalizing the way comparison units are chosen has direct implications for inference
Subjective researcher bias kicked down to the model selection stage
Significant diversity at the moment as to how to principally select models - from machine learning to modifications - as well as estimation and software
Part of the purpose of this procedure is to reduce subjective researcher bias
Ferman, Pinto and Possebom (2020) suggest specific specifications and report all of them
Suppose that we observe \(J + 1\) units in periods \(1, 2, . . . , T\)
Unit “one” is exposed to the intervention of interest (that is, “treated”) during periods \(T_0 + 1, . . . , T\)
The remaining \(J\) are an untreated reservoir of potential controls (a “donor pool”)
Key assumptions: no anticipation of the treatment before \(T_0\); no spillovers from the treated unit to the donor pool (SUTVA)
The synthetic control is a weighted average of the \(J\) potential controls
Let \(W = (w_2, \ldots, w_{J+1})′\) with \(w_j \geq 0\) for \(j = 2, \ldots, J + 1\) and \(w_2 + \cdots + w_{J+1} = 1\). Each value of \(W\) represents a potential synthetic control
Let \(X_1\) be a \((k \times 1)\) vector of pre-intervention characteristics for the treated unit. Similarly, let \(X_0\) be a \((k \times J)\) matrix which contains the same variables for the unaffected units.
Step 1: For a given predictor-weight matrix \(V\) (diagonal, \(k \times k\)), \(W^*(V) = (w^*_2, \ldots, w^*_{J+1})′\) minimises \(\|X_1 − X_0 W\|_V\), subject to the weight constraints
Step 2: \(V\) is then chosen to minimise the pre-treatment mean squared prediction error (MSPE): \[ \sum_{t=1}^{T_0} \bigg (Y_{1t} - \sum_{j=2}^{J+1}w_j^*(V)\,Y_{jt}\bigg )^2 \]
Choice of \(v_1, . . . , v_k\) can be based on:
Abadie, Diamond, and Hainmueller (2010) use the synthetic control method to estimate the effect of Proposition 99 on per capita cigarette consumption in California.
| Variables | Real California | Synthetic Calif. | Avg. of 38 Control States |
|---|---|---|---|
| Ln(GDP per capita) | 10.08 | 9.86 | 9.86 |
| Percent aged 15–24 | 17.40 | 17.40 | 17.29 |
| Retail price | 89.42 | 89.41 | 87.27 |
| Beer consumption per capita | 24.28 | 24.20 | 23.75 |
| Cigarette sales per capita 1988 | 90.10 | 91.62 | 114.20 |
| Cigarette sales per capita 1980 | 120.20 | 120.43 | 136.58 |
| Cigarette sales per capita 1975 | 127.10 | 126.99 | 132.81 |
All variables except lagged cigarette sales are averaged for the 1980–1988 period. Beer consumption is averaged 1984–1988.
We only observe 2 points per year: Hard to determine whether the observed difference between the treated unit and the synthetic control is due to the treatment or to chance
ADH (2010) use randomization inference to test the null hypothesis that the treatment had no effect on the outcome
To assess significance, we calculate exact p-values under Fisher’s sharp null using a test statistic equal to after to before ratio of RMSPE
If we were to assign the intervention at random, the probability of obtaining a post/pre-Proposition 99 RMSPE ratio as extreme as the one observed in California is 0.026.
Remember the Mariel Boatlift? Peri and Yasenov (2019) use the synthetic control method to estimate the effect of the Mariel Boatlift on the Miami labor market.
Findings: no evidence of a negative effect of the Mariel Boatlift on wages in Miami.
SCM is a powerful tool, but it is not a panacea. It requires:
Motivation: When a single unit is treated (a state, country, firm), DiD with an ad hoc control group is arbitrary. SCM constructs the counterfactual optimally.
The method: A synthetic control is a weighted average of donor units, with weights chosen to minimise the pre-treatment gap in outcomes and predictors.
Two-step estimation: Unit weights \(W^*\) and predictor weights \(V\) are jointly optimised — \(V\) governs how predictors are compared, \(W^*\) governs how donors are blended.
Inference: Standard asymptotics are unavailable (\(J\) small, \(T\) moderate). Fisher permutation/randomisation inference — applying SCM to each donor as a placebo — yields an exact \(p\)-value.
Validity: Requires no anticipation, SUTVA, and that the treated unit lies in the convex hull of the donor pool (no extrapolation).
Limitations: Researcher degrees of freedom at the model-selection stage; works best with many pre-treatment periods and a clearly defined single treated unit.