Probability and Bayesian Inference
Summary
Chapter 1 of BDA3 establishes the three steps of Bayesian data analysis and the notation used throughout the book. Bayesian inference treats all unknowns as random variables with probability distributions.
The Three Steps of Bayesian Data Analysis
- Set up a full probability model — a joint distribution for all observable and unobservable quantities, consistent with domain knowledge
- Condition on observed data — compute the posterior distribution
- Evaluate model fit — check whether the model is adequate via posterior predictive checks and sensitivity analysis
Core Formula
Bayes’ theorem is the foundation:
where is the prior, is the likelihood, and is the posterior.
Key Concepts
- Notation: joint density — conditioning on the model is always implicit
- Prediction: posterior predictive distribution
- Law of total variance:
- Computation: simulation-based inference — draw samples from the posterior and use sample statistics as estimates (see Introduction to Bayesian Computation)
- Uses R and Stan for computation (Appendix C)
See Also
- Single-Parameter Models — first concrete applications of Bayes’ theorem
- Hierarchical Models — modeling with multiple levels of uncertainty
- Bayesian Workflow - Overview — the full iterative workflow beyond these three steps
- Overfitting and Information Criteria — KL divergence and the information-theoretic score criteria are built on the probability framework here