T-Learner and Minimax Rate
Summary
The T-learner (two-learner) fits separate base learners for the treatment and control response functions, then estimates CATE as their difference. It achieves the minimax optimal rate for CATE estimation under standard smoothness conditions. However, it is suboptimal for unbalanced treatment groups — the smaller group’s response function is estimated with higher variance, which propagates into the CATE estimate.
Definition and Algorithm
Definition: T-Learner
Step 1 (first stage): Fit separate response functions on each arm:
Step 2: Estimate CATE as:
Minimax Rate Theorem
Theorem 1: Minimax Rate of T-Learner (Künzel et al. 2019)
For a family of superpopulations from (where controls base function smoothness and controls CATE smoothness), there exist base learners for the T-learner such that:
where is the total number of units, is the number of treated units, and is a constant.
Interpretation: The T-learner rate is limited by the smaller of the two sample sizes ( treated vs. control). When groups are balanced, both converge at rate , which is minimax optimal if the response functions and CATE have the same smoothness.
Key limitation: If the treatment group is much smaller (), the T-learner is limited by — it cannot exploit the large control group to improve estimation of the treatment response.
When T-Learner Fails
Consider an experiment where:
- Control group: observations
- Treatment group: observations
- True CATE: constant
The T-learner fits on only 100 observations → high variance → the CATE estimate inherits that variance. The large control group provides no benefit.
The X-learner addresses precisely this failure mode — see X-Learner.
Properties
Key advantage:
- Completely separates treatment and control estimation → no interference between groups
- Theorem 1 guarantees minimax optimality when groups are balanced and
Key weakness:
- Suboptimal when (or vice versa): limited by smaller group’s sample size
- Cannot exploit cross-group information (unlike X-learner)
Connections
- Extends Metalearners for CATE framework
- Limitation motivates X-Learner design
- Rate result uses def-family families
See Also
- Metalearners for CATE — framework
- S-Learner — simpler single-model approach
- X-Learner — overcomes T-learner’s unbalanced group limitation