T-Learner and Minimax Rate

Summary

The T-learner (two-learner) fits separate base learners for the treatment and control response functions, then estimates CATE as their difference. It achieves the minimax optimal rate for CATE estimation under standard smoothness conditions. However, it is suboptimal for unbalanced treatment groups — the smaller group’s response function is estimated with higher variance, which propagates into the CATE estimate.

Definition and Algorithm

Definition: T-Learner

Step 1 (first stage): Fit separate response functions on each arm:

Step 2: Estimate CATE as:

Minimax Rate Theorem

Theorem 1: Minimax Rate of T-Learner (Künzel et al. 2019)

For a family of superpopulations from (where controls base function smoothness and controls CATE smoothness), there exist base learners for the T-learner such that:

where is the total number of units, is the number of treated units, and is a constant.

Interpretation: The T-learner rate is limited by the smaller of the two sample sizes ( treated vs. control). When groups are balanced, both converge at rate , which is minimax optimal if the response functions and CATE have the same smoothness.

Key limitation: If the treatment group is much smaller (), the T-learner is limited by — it cannot exploit the large control group to improve estimation of the treatment response.

When T-Learner Fails

Consider an experiment where:

  • Control group: observations
  • Treatment group: observations
  • True CATE: constant

The T-learner fits on only 100 observations → high variance → the CATE estimate inherits that variance. The large control group provides no benefit.

The X-learner addresses precisely this failure mode — see X-Learner.

Properties

Key advantage:

  • Completely separates treatment and control estimation → no interference between groups
  • Theorem 1 guarantees minimax optimality when groups are balanced and

Key weakness:

  • Suboptimal when (or vice versa): limited by smaller group’s sample size
  • Cannot exploit cross-group information (unlike X-learner)

Connections

See Also