Within-Between Persons Causal Inference
Summary
The within/between-persons distinction matters for causal inference because different types of confounders operate at each level. Between-persons data from randomized experiments recovers average causal effects. Within-persons (fixed-effects) data eliminates time-invariant confounders but not time-varying ones. Neither level is universally superior — the right choice depends on the estimand and the confounding structure of the substantive problem.
Overview
A between-persons association compares how people who differ in their average X differ in their average Y. A within-persons association compares how individual ‘s Y changes when ‘s X changes from its usual level.
These can be statistically independent and can even have opposite signs — a classic Simpson’s paradox situation in longitudinal data.
Between-Persons Data and Average Causal Effects
Theorem: Between-Persons Data Under Randomization (pp. 2–3)
Individual causal effect: . Average treatment effect (ATE):
Under randomization (exchangeability: ):
Between-persons comparisons from randomized experiments recover the ATE without requiring longitudinal data. The consistency assumption () and the exchangeability induced by randomization are sufficient.
Going beyond experiments requires strong assumptions — CIA (Conditional Independence Assumption), IV (Instrumental Variables), etc. — that may or may not be defensible.
Why Within-Persons Data Is Not Necessary
- Randomized experiments are between-persons comparisons and do recover causal effects
- Multiple observations per person are not required for causal inference in the average sense
- The value of longitudinal data is in enhancing causal claims (removing certain confounders, tracking dynamics), not in being necessary for them
Why Within-Persons Data Is Not Sufficient
Problem: Time-Varying Confounders Survive Within-Person Designs
Within-persons designs (fixed effects, within-person mean centering) control for time-invariant confounders whose effects do not change over the study duration.
But time-varying confounders — events, moods, social circumstances that fluctuate within a person — survive demeaning. They can confound within-person associations even after all between-person variance is removed.
Example (talkativeness and happiness ):
- Social events on day increase both and
- The fixed-effects model demeans each person’s and , removing stable extraversion
- But social events (within-person, time-varying) still confound the residual association
Why Within-Persons Data Can Be Very Helpful
-
Removes time-invariant confounders: Stable personality traits, demographics, genetics — a large and important class of potential confounders — are eliminated by fixed-effects demeaning without having to measure them
-
Individual-level causal effects: Longitudinal within-person data enables estimation of individual-level effects for specific people, not just averages
-
Mechanism and dynamics: Reciprocal dynamics, mediation pathways, and temporal ordering of cause and effect can only be studied with longitudinal data
Confounding at Each Level
| Confounding Type | Between-Persons | Within-Persons (FE) |
|---|---|---|
| Time-invariant traits (stable personality) | Confounds (unless randomized) | Controlled |
| Time-varying circumstances (social events, moods) | Partially (with covariate adjustment) | Not controlled |
| Reciprocal dynamics () | Not directly modeled | Not directly modeled (requires CLPM/DPM) |
The Noncausal Research Question
Not all longitudinal research needs to be causal. Descriptive questions — “what is the age trajectory of happiness on average in this population?” — are legitimate and important. The distinction:
- Descriptive/predictive: Associations and trajectories as they are
- Causal: What would happen if we intervened to change X?
Problems arise when researchers implicitly claim causal interpretations for noncausal analyses, or use causal language without causal designs.
Connections
- Potential Outcomes Framework provides the formal basis for individual and average causal effects
- The Selection Problem is the core challenge at the between-persons level (without randomization)
- Directed Acyclic Graphs make the confounding structure explicit for each level
- Fixed-Effects Model is the canonical implementation of within-persons causal analysis
- Cross-Lagged and Dynamic Panel Models extend within-persons to lagged reciprocal effects
See Also
- Fixed-Effects Model — the within-persons approach, assumptions, and limitations
- Estimands in Longitudinal Research — how to define the right target before choosing a level
- Within-Between Persons Distinction - Overview — paper overview