Causal Estimands

Summary

Causal estimands are the target quantities in causal inference — specific comparisons of potential outcomes that answer “what is the effect of the treatment?” The main estimands differ by what population they average over and whether they condition on covariates. The choice of estimand is a scientific question, not a statistical one.

Overview

Causal effects are defined as contrasts of potential outcomes under different treatment conditions for the same unit. Since only one potential outcome is ever observed per unit, causal effects are fundamentally about counterfactuals. The choice of estimand should be driven by the scientific question at hand.

Individual Treatment Effect (ITE)

Definition: Individual Treatment Effect (ITE)

For unit $i$ , the individual treatment effect is:
$τ_{i} \equiv Y_{i} (1) - Y_{i} (0)$
the difference in potential outcomes under treatment vs. control for the same unit.

The ITE is never directly observable (fundamental problem of causal inference). Population-level estimands average over ITEs in various ways.

Sample Average Treatment Effect (SATE)

Definition: Sample Average Treatment Effect (SATE)

The average ITE over the observed sample of $N$ units:
$τ^{S} \equiv N^{- 1} i = 1 \sum N τ_{i} = N^{- 1} i = 1 \sum N [Y_{i} (1) - Y_{i} (0)]$

The SATE is a function of potential outcomes of the specific sample. It is non-random given the sample, though it involves missing potential outcomes.

Conditional Average Treatment Effect (CATE)

Definition: Conditional Average Treatment Effect (CATE)

The average treatment effect for all units with covariate value $X_{i} = x$ :
$τ (x) \equiv E [Y_{i} (1) - Y_{i} (0) ∣ X_{i} = x] = μ_{1} (x) - μ_{0} (x)$
where $μ_{z} (x) \equiv E [Y_{i} (z) ∣ X_{i} = x]$ for $z = 0, 1$ .

The CATE captures treatment effect heterogeneity — how the average effect varies across covariate subgroups. Estimating $τ (x)$ as a function of $x$ is a central goal in modern causal inference.

Population Average Treatment Effect (PATE)

Definition: Population Average Treatment Effect (PATE)

Averaging the CATE (or ITE) over a target population $F (x; θ_{X})$ :
$τ^{P} \equiv E [Y_{i} (1) - Y_{i} (0)] = E [τ (X_{i})]$

The PATE is a function of the distribution of potential outcomes in a population.
In observational studies where the target population is the population from which the sample is drawn, PATE is typically the estimand of interest.
In randomized experiments, SATE is often the primary estimand.

SATE vs. PATE distinction

Both ITE and CATE are important for characterizing treatment effect heterogeneity, but they are obviously different. They are sometimes conflated in the literature.

SATE = average of ITEs over the specific sample

PATE = average of ITEs over the population distribution

Mixed Average Treatment Effect (MATE)

Definition: Mixed Average Treatment Effect (MATE)

Replace the population distribution $F (x; θ_{X})$ in the PATE with the empirical distribution $\hat{F}_{X}$ of covariates in the sample:
$τ^{M} \equiv (β_{1} - β_{0})^{'} \overset{ˉ}{X} = N^{- 1} i = 1 \sum N τ (X_{i}; θ_{Y})$
where $τ (x; θ_{Y}) = τ (x)$ evaluated at parameter $θ_{Y}$ .

The MATE is a convenient approximation to the PATE: it conditions on the observed $X$ values rather than integrating over the population distribution.
Most Bayesian causal inference in practice focuses on the MATE (rather than PATE or SATE).
The distinction: PATE has the largest uncertainty; SATE has the smallest; MATE is in between.

Summary Table

Estimand	Formula	Population	Key feature
ITE	$Y_{i} (1) - Y_{i} (0)$	Unit $i$	Never observed; target of imputation
SATE	$N^{- 1} \sum_{i} τ_{i}$	Sample	Non-random given sample
CATE	$μ_{1} (x) - μ_{0} (x)$	Subgroup $X = x$	Captures heterogeneity
PATE	$E [τ (X_{i})]$	Population	Requires population distribution
MATE	$N^{- 1} \sum_{i} τ (X_{i}; θ_{Y})$	Sample (empirical $X$ )	Most used in Bayesian CI

Principal Causal Effects

In complex assignment mechanisms (e.g., instrumental variables), one may define stratum-specific effects:

Definition: Principal Causal Effects

For compliance stratum $U_{i} \in {co, at, nt, df}$ (compliers, always-takers, never-takers, defiers), the stratum-specific effect is:
$τ_{u} \equiv E [Y_{i} (1) - Y_{i} (0) ∣ U_{i} = u]$
These are called principal causal effects.

See Instrumental Variables and Principal Stratification for the full IV/compliance framework.

Connections

Potential Outcomes Framework — the setup that defines these estimands
General Structure of Bayesian CI — Bayesian inference for these estimands via posterior imputation
Frequentist Causal Estimation — frequentist estimators targeting PATE/SATE/CATE
Bayesian Outcome Models — outcome model $μ_{z} (x)$ used to estimate CATE

Second Brain

Explorer

Causal Estimands

Causal Estimands

Overview

Individual Treatment Effect (ITE)

Sample Average Treatment Effect (SATE)

Conditional Average Treatment Effect (CATE)

Population Average Treatment Effect (PATE)

Mixed Average Treatment Effect (MATE)

Summary Table

Principal Causal Effects

Connections

See Also

Graph View

Table of Contents

Backlinks