Table 2 Fallacy
Summary
The Table 2 Fallacy is the practice of presenting regression confounders (nuisance parameters) alongside the treatment of interest and interpreting their coefficients as total causal effects. Because the adjustment set that identifies the treatment effect almost never simultaneously identifies the confounders themselves, such interpretations are statistically and causally invalid — and often severely biased.
Overview
Named by Westreich and Greenland (2013), the Table 2 Fallacy describes a pervasive practice in social science, epidemiology, economics, and political science: a researcher fits a multivariable regression to estimate the causal effect of treatment on outcome while adjusting for a set of confounders . The results table then presents all coefficients — not just ‘s — with stars, confidence intervals, and causal-sounding language. Subsequent papers cite these confounder coefficients as theoretically meaningful contributions.
The fallacy is pernicious because it appears rigorous. The model is correctly specified for identifying ; the mistake is in assuming that same specification also identifies for each confounder.
Table 2 Fallacy
The Table 2 Fallacy is the erroneous practice of presenting and interpreting regression coefficients for confounders (nuisance parameters) included in an adjustment set as if they were valid total-effect estimates. The fallacy arises because the adjustment set chosen to identify the treatment effect does not generally satisfy the identification conditions for any other causal path in the same model.
The Core Argument
What Regression Adjustment Does
When the goal is to estimate the causal effect of on , we select a valid adjustment set that satisfies the backdoor criterion relative to the path (see DAGs and Causal Identification). Under this condition and the Stable Unit Treatment Value Assumption (SUTVA), regression adjustment consistently estimates:
Critically, is a sacrifice made to block backdoor paths into . Its members are chosen to satisfy the backdoor criterion for — not for .
Why Confounders Cannot Be Jointly Identified
Consider the DAG in the Nafa (2022) example: are measured; are unobserved. The adjustment set blocks all backdoor paths for .
But paths like and are confounded by biasing paths and . Since and are unobserved, it is mathematically impossible to simultaneously identify both and without additional identifying assumptions.
Non-Joint Identification of Adjustment Sets
Let be a valid adjustment set for the causal path in a DAG . For any , the same adjustment set does not generally satisfy the backdoor criterion for the path . Identification of requires a separate, valid adjustment set for that path, which may require conditioning on variables not included in and leaving others out.
In the simple confounded DAG (, ), the biasing path cannot be blocked without measuring . If is unobserved, ‘s coefficient in any regression of on and is biased for ‘s causal effect — regardless of how large is.
The Logic Applies Regardless of Inferential Framework
Nafa (2022) notes that the problem is not Bayesian versus frequentist. A correctly specified Bayesian model with weakly informative priors will still return posterior distributions for confounder coefficients that systematically miss the true value when the path is unidentified. Larger samples make matters worse: as , the posterior concentrates around the biased value with increasing certainty.
Downstream Consequences
- Scientific literature accumulation of false knowledge: Papers citing a biased confounder coefficient as a causal estimate create a chain of pseudo-scientific claims.
- Magnitude errors compound: Subsequent meta-analyses, effect-size aggregations, and theory-building built on Table 2 estimates inherit and amplify the original bias.
- Graduate training propagation: Teaching students to “interpret every coefficient” instills poor practices that propagate through academic generations.
What the Simulations Show
See Nuisance Parameter Bias Simulation. When the unobserved confounder is correlated with :
- 90% credible intervals for ‘s coefficient capture the true value only 0–1% of the time
- Coverage rates for nuisance parameters , , are 6–29% even under the favorable condition where and are independent
- These error rates do not improve with sample size — they get worse as grows from 2,500 to 10,000
When Multiple Identification Is Possible
There is one way to escape the fallacy: adopt additional identifying assumptions that permit joint identification. If one can defend the assumption that is conditionally independent of (e.g., through a natural experiment, instrumental variables, or domain-specific arguments), then both and may be jointly identified using the adjustment set (excluding from the adjustment set for its own identification).
This requires:
- A separate DAG analysis for each causal path of interest
- Explicit identification strategy for each path
- Separate defense of the required assumptions
Connections
- DAGs and Causal Identification — The backdoor criterion defines when an adjustment set is valid for a specific path; the Table 2 Fallacy is the error of assuming one valid set works for all paths simultaneously
- Logic of Regression Adjustment — What the adjustment set actually identifies: the PATE for the primary treatment only
- Nuisance Parameter Bias Simulation — Empirical demonstration of the magnitude of bias under this fallacy
- Potential Outcomes Framework — The potential outcomes definition of causal effects underlying the fallacy
- Garden of Forking Paths — Related methodological failure: implicit multiple comparisons from data-contingent analysis
- Researcher Degrees of Freedom — Related failure mode: the flexibility in choosing which confounders to “interpret”
See Also
- Bayesian Propensity Score Weighting — Correct approach: use DAG to select adjustment set for the treatment, then focus interpretation solely on the treatment effect
- The Selection Problem — The fundamental challenge that makes adjustment necessary and limits what can be identified
- Observational vs Experimental Methods in Advertising — Activity bias as a case where confounders (activity) cause 10–1000x overestimates when treated as if identified