Forking Paths and Bayesian Approaches
Summary
The Garden of Forking Paths problem is fundamentally about frequentist p-values being invalid when analysis is data-contingent. Bayesian and multilevel approaches offer a principled alternative by regularizing estimates and naturally accounting for multiplicity.
Why P-Values Are Vulnerable
The p-value’s interpretation depends on the sampling distribution of the test statistic under repetition of the same procedure. But if the procedure itself changes with the data (), the standard sampling distribution is wrong. The p-value is computed as if the test were pre-specified when it wasn’t.
The Bayesian Alternative
As Gelman & Loken note at the end of their paper, once we abandon the p-value framework:
We can take the observed result as data and update beliefs using Bayes’ theorem.
For example, Bem’s ESP result (53.1% hit rate, ) can be reanalyzed: with a reasonable prior, the posterior probability of a meaningful effect is far lower than the p-value suggests.
Hierarchical Models as Natural Regularization
Hierarchical Models provide a structural solution to the multiple comparisons problem:
- Partial pooling: estimates for many groups are automatically shrunk toward the grand mean
- Groups with less data are regularized more heavily
- This is equivalent to an implicit multiplicity correction — but derived from the model structure, not an ad hoc penalty
- The James-Stein phenomenon: pooled estimates dominate unpooled ones
Practical Recommendations
- Use multilevel models when studying effects across many groups or conditions
- Report uncertainty: full posterior intervals convey strength of evidence better than p-values
- Pre-register analyses to reduce (but not eliminate) forking paths
- Regularize: priors that incorporate domain knowledge prevent extreme estimates from noisy data
See Also
- Garden of Forking Paths — the problem statement
- Researcher Degrees of Freedom — the sources of analytic flexibility
- Multiple Testing Corrections — the frequentist alternative (Bonferroni, FDR) for multiplicity
- Hierarchical Models — the Bayesian structural solution via partial pooling
- Model Checking — evaluating whether the model is adequate
- The Experimental Ideal — pre-registration as a complementary (frequentist) safeguard
- Asymptotics and Frequentist Connections — how Bayesian and frequentist inference relate formally
- Power Analysis and Sample Size — under-powered studies amplify forking paths; Bayesian posterior simulation as an alternative to power calculations
- Pre-registration and Open Science - Overview — how to actually pre-register (the practice this note recommends)