Robbins Formula and Poisson Empirical Bayes
Summary
Robbins’ formula is the foundational nonparametric empirical Bayes result: in the Poisson model, the Bayes posterior mean of an unobserved rate can be expressed entirely through the marginal density of the observed counts, with no need to specify the prior . For an observed count , the EB estimate of the true rate is , where is replaced by the empirical frequency of counts across the many parallel cases. This is the most general expression of Efron’s theme that the prior can be learned from the experience of others.
Overview
Herbert Robbins was the principal developer of the program explicitly named “empirical Bayes.” His branch aimed to show how a frequentist, by exploiting the marginal distribution of pooled data across many parallel cases, could achieve full Bayesian efficiency — without ever specifying the prior. Where Stein’s branch (the James-Stein Estimator) is parametric (estimate one hyperparameter ), Robbins’ Poisson construction is nonparametric: the entire posterior-mean function is read off from observed marginal frequencies.
The setup: many independent units, unit has an unknown Poisson rate drawn from an unknown prior , and we observe a count . We want the Bayes estimate for a unit observed to have count .
Main Content
Poisson marginal density ^poisson-marginal
With prior and likelihood , the marginal probability of observing count is
Crucially, is directly estimable from the data as the observed fraction of units having count .
Robbins' formula (Poisson empirical Bayes) ^robbins-formula
The Bayes posterior mean of given an observed count depends on the prior only through the marginal :
Proof idea. Using , the numerator of the posterior mean equals , while the denominator is . The prior cancels out entirely.
Empirical Bayes estimator ^robbins-eb-estimator
Replace the unknown marginal by the empirical frequencies across the parallel units:
No parametric form for the prior is ever assumed — the “prior may exist only as a motivational device.” This is the purest realization of the empirical Bayes principle (see Empirical Bayes - Overview).
The same marginal-density logic powers the parametric (Gaussian) branch: there the marginal lets one estimate via , giving the James-Stein Estimator. Robbins’ version is more general because it estimates the whole posterior-mean curve rather than a single hyperparameter.
Examples
Insurance claims (classic Robbins application)
Suppose a large portfolio of auto-insurance policyholders each have an unknown accident rate , and last year we observed claims for policyholder . To predict next year’s expected claims for someone who filed claims, Robbins’ formula gives . If , , policyholders, then a customer with claims has predicted rate , and one with claim has — each shrunk relative to the raw count, learned purely from the marginal counts.
Relation to Efron's batting data
While Efron’s worked Chapter 1 example (18 baseball players, James-Stein Estimator) uses the Gaussian/parametric branch, it shares Robbins’ driving idea: each player’s prediction is improved by exploiting the marginal distribution of all players’ early-season averages rather than that player’s data alone.
Connections
- Empirical Bayes - Overview — the broader EB program and the marginal-density view this specializes.
- James-Stein Estimator — the parametric Gaussian counterpart; estimates one hyperparameter from the marginal.
- Empirical Bayes Interpretation of Shrinkage — both branches are instances of estimating the prior from data.
- Multiple Comparisons - Bayesian Perspective — Robbins’ testing branch (effects piling up at ) connects to large-scale multiplicity.