Causal Estimands

Summary

Causal estimands are the target quantities in causal inference — specific comparisons of potential outcomes that answer “what is the effect of the treatment?” The main estimands differ by what population they average over and whether they condition on covariates. The choice of estimand is a scientific question, not a statistical one.

Overview

Causal effects are defined as contrasts of potential outcomes under different treatment conditions for the same unit. Since only one potential outcome is ever observed per unit, causal effects are fundamentally about counterfactuals. The choice of estimand should be driven by the scientific question at hand.

Individual Treatment Effect (ITE)

Definition: Individual Treatment Effect (ITE)

For unit , the individual treatment effect is:

the difference in potential outcomes under treatment vs. control for the same unit.

The ITE is never directly observable (fundamental problem of causal inference). Population-level estimands average over ITEs in various ways.

Sample Average Treatment Effect (SATE)

Definition: Sample Average Treatment Effect (SATE)

The average ITE over the observed sample of units:

The SATE is a function of potential outcomes of the specific sample. It is non-random given the sample, though it involves missing potential outcomes.

Conditional Average Treatment Effect (CATE)

Definition: Conditional Average Treatment Effect (CATE)

The average treatment effect for all units with covariate value :

where for .

The CATE captures treatment effect heterogeneity — how the average effect varies across covariate subgroups. Estimating as a function of is a central goal in modern causal inference.

Population Average Treatment Effect (PATE)

Definition: Population Average Treatment Effect (PATE)

Averaging the CATE (or ITE) over a target population :

  • The PATE is a function of the distribution of potential outcomes in a population.
  • In observational studies where the target population is the population from which the sample is drawn, PATE is typically the estimand of interest.
  • In randomized experiments, SATE is often the primary estimand.

SATE vs. PATE distinction

Both ITE and CATE are important for characterizing treatment effect heterogeneity, but they are obviously different. They are sometimes conflated in the literature.

  • SATE = average of ITEs over the specific sample
  • PATE = average of ITEs over the population distribution

Mixed Average Treatment Effect (MATE)

Definition: Mixed Average Treatment Effect (MATE)

Replace the population distribution in the PATE with the empirical distribution of covariates in the sample:

where evaluated at parameter .

  • The MATE is a convenient approximation to the PATE: it conditions on the observed values rather than integrating over the population distribution.
  • Most Bayesian causal inference in practice focuses on the MATE (rather than PATE or SATE).
  • The distinction: PATE has the largest uncertainty; SATE has the smallest; MATE is in between.

Summary Table

EstimandFormulaPopulationKey feature
ITEUnit Never observed; target of imputation
SATESampleNon-random given sample
CATESubgroup Captures heterogeneity
PATEPopulationRequires population distribution
MATESample (empirical )Most used in Bayesian CI

Principal Causal Effects

In complex assignment mechanisms (e.g., instrumental variables), one may define stratum-specific effects:

Definition: Principal Causal Effects

For compliance stratum (compliers, always-takers, never-takers, defiers), the stratum-specific effect is:

These are called principal causal effects.

See Instrumental Variables and Principal Stratification for the full IV/compliance framework.

Connections

See Also