Monsters and Mixtures

Summary

Chapters 9–11 of Statistical Rethinking cover generalized linear models (GLMs) through the lens of maximum entropy, then extend to “monster” models: zero-inflated Poisson, beta-binomial, gamma-Poisson (negative binomial), and ordered categorical outcomes.

Maximum Entropy and GLMs (Ch 9)

Why use exponential family distributions? Because they are the maximum entropy distributions for given constraints:

Constraint	MaxEnt Distribution	Link
Known mean and variance	Gaussian	Identity
Two outcomes, constant $p$	Binomial	Logit
Count of events, constant rate	Poisson	Log

Nature Loves Entropy

Exponential family distributions arise naturally because there are more ways to produce them than any other distribution with the same constraints. Using them is not an assumption about mechanism — it’s the least informative choice.

The GLM framework:

y_{i} \sim Distribution (θ_{i})

f (θ_{i}) = α + β x_{i}

where $f$ is the link function that maps the linear model to the natural parameter.

Counting and Classification (Ch 10)

Binomial Regression (Logistic)

Model binary or proportion outcomes
Logit link: $lo g \frac{p _{i}}{1 - p _{i}} = α + β x_{i}$
Interpret on log-odds scale; exponentiate for odds ratios

Poisson Regression

Model counts when there’s no known maximum
Log link: $lo g λ_{i} = α + β x_{i}$
Offset term for varying exposure: $lo g λ_{i} = lo g τ_{i} + α + β x_{i}$

Monsters and Mixtures (Ch 11)

Ordered Categorical (Ordinal)

Cumulative logit model: each threshold gets its own intercept
$Pr (y_{i} \leq k) = logit^{- 1} (α_{k} - ϕ_{i})$

Zero-Inflated Poisson

A mixture: with probability $p$ the outcome is always 0 (never even attempts); with probability $1 - p$ it follows a Poisson process.

Pr (y = 0) = p + (1 - p) e^{- λ}

Pr (y = k, k > 0) = (1 - p) \frac{λ ^{k} e ^{- λ}}{k !}

Over-Dispersed Models

When variance exceeds what the simple model predicts:

Beta-binomial: continuous mixture of binomial probabilities
Gamma-Poisson (negative binomial): continuous mixture of Poisson rates

These are the observational-level equivalents of multilevel models — they model unexplained heterogeneity without explicitly modeling groups.

Second Brain

Explorer

Monsters and Mixtures

Monsters and Mixtures

Maximum Entropy and GLMs (Ch 9)

Counting and Classification (Ch 10)

Binomial Regression (Logistic)

Poisson Regression

Monsters and Mixtures (Ch 11)

Ordered Categorical (Ordinal)

Zero-Inflated Poisson

Over-Dispersed Models

See Also

Graph View

Table of Contents

Backlinks