Table 2 Fallacy

Summary

The Table 2 Fallacy is the practice of presenting regression confounders (nuisance parameters) alongside the treatment of interest and interpreting their coefficients as total causal effects. Because the adjustment set that identifies the treatment effect almost never simultaneously identifies the confounders themselves, such interpretations are statistically and causally invalid — and often severely biased.

Overview

Named by Westreich and Greenland (2013), the Table 2 Fallacy describes a pervasive practice in social science, epidemiology, economics, and political science: a researcher fits a multivariable regression to estimate the causal effect of treatment on outcome while adjusting for a set of confounders . The results table then presents all coefficients — not just ‘s — with stars, confidence intervals, and causal-sounding language. Subsequent papers cite these confounder coefficients as theoretically meaningful contributions.

The fallacy is pernicious because it appears rigorous. The model is correctly specified for identifying ; the mistake is in assuming that same specification also identifies for each confounder.

Table 2 Fallacy

The Table 2 Fallacy is the erroneous practice of presenting and interpreting regression coefficients for confounders (nuisance parameters) included in an adjustment set as if they were valid total-effect estimates. The fallacy arises because the adjustment set chosen to identify the treatment effect does not generally satisfy the identification conditions for any other causal path in the same model.

The Core Argument

What Regression Adjustment Does

When the goal is to estimate the causal effect of on , we select a valid adjustment set that satisfies the backdoor criterion relative to the path (see DAGs and Causal Identification). Under this condition and the Stable Unit Treatment Value Assumption (SUTVA), regression adjustment consistently estimates:

Critically, is a sacrifice made to block backdoor paths into . Its members are chosen to satisfy the backdoor criterion for — not for .

Why Confounders Cannot Be Jointly Identified

Consider the DAG in the Nafa (2022) example: are measured; are unobserved. The adjustment set blocks all backdoor paths for .

But paths like and are confounded by biasing paths and . Since and are unobserved, it is mathematically impossible to simultaneously identify both and without additional identifying assumptions.

Non-Joint Identification of Adjustment Sets

Let be a valid adjustment set for the causal path in a DAG . For any , the same adjustment set does not generally satisfy the backdoor criterion for the path . Identification of requires a separate, valid adjustment set for that path, which may require conditioning on variables not included in and leaving others out.

In the simple confounded DAG (, ), the biasing path cannot be blocked without measuring . If is unobserved, ‘s coefficient in any regression of on and is biased for ‘s causal effect — regardless of how large is.

The Logic Applies Regardless of Inferential Framework

Nafa (2022) notes that the problem is not Bayesian versus frequentist. A correctly specified Bayesian model with weakly informative priors will still return posterior distributions for confounder coefficients that systematically miss the true value when the path is unidentified. Larger samples make matters worse: as , the posterior concentrates around the biased value with increasing certainty.

Downstream Consequences

  1. Scientific literature accumulation of false knowledge: Papers citing a biased confounder coefficient as a causal estimate create a chain of pseudo-scientific claims.
  2. Magnitude errors compound: Subsequent meta-analyses, effect-size aggregations, and theory-building built on Table 2 estimates inherit and amplify the original bias.
  3. Graduate training propagation: Teaching students to “interpret every coefficient” instills poor practices that propagate through academic generations.

What the Simulations Show

See Nuisance Parameter Bias Simulation. When the unobserved confounder is correlated with :

  • 90% credible intervals for ‘s coefficient capture the true value only 0–1% of the time
  • Coverage rates for nuisance parameters , , are 6–29% even under the favorable condition where and are independent
  • These error rates do not improve with sample size — they get worse as grows from 2,500 to 10,000

When Multiple Identification Is Possible

There is one way to escape the fallacy: adopt additional identifying assumptions that permit joint identification. If one can defend the assumption that is conditionally independent of (e.g., through a natural experiment, instrumental variables, or domain-specific arguments), then both and may be jointly identified using the adjustment set (excluding from the adjustment set for its own identification).

This requires:

  • A separate DAG analysis for each causal path of interest
  • Explicit identification strategy for each path
  • Separate defense of the required assumptions

Connections

See Also