Glossary

Modeling Terms

Adstock: A parametric transformation that models the carry-over effect of advertising: today's media exposure influences sales today and over a decaying window of subsequent periods. Common forms include geometric (single decay rate) and Weibull (more flexible curve). See the Technical Guide for the math and the transforms/adstock.py module for the implementation.
Saturation: A parametric transformation capturing diminishing marginal returns to media spend — each additional dollar at high spend levels adds less than each additional dollar at low spend levels. Common forms include Hill, logistic, and Michaelis-Menten curves. Implemented in transforms/saturation.py.
MMM (Marketing Mix Model): A regression-style model that decomposes an outcome metric (revenue, conversions) into contributions from media spend, price, distribution, seasonality, and other drivers. The framework's MMMs are Bayesian and use saturation + adstock transforms per channel. See: Modeling Guide, Technical Guide
MFF (Master Flat File): The framework's input data format — a single long-format table with one row per period (and optionally geography, product, campaign, outlet, creative) per variable: a VariableName column says which series the row belongs to (the KPI, a media channel, a control) and a VariableValue column holds its value. Loaded via MFFLoader; column mappings and variable configuration are validated against a schema (MFFConfig) before fitting.
Hierarchical model: A model where parameters are themselves drawn from a higher-level distribution — for example, channel-level coefficients drawn from a category-level prior. Pooling tightens estimates for data-poor channels by borrowing strength from data-rich siblings. Critical for portfolio MMMs with many channels.
ROI / mROI: Return on investment — outcome generated per unit of spend (e.g., $1.40 of revenue per $1 of media spend). Marginal ROI (mROI) is the return on the next dollar at the current spend level — the relevant quantity for budget allocation under saturation.
ROAS (Return on Ad Spend): Revenue attributed to a channel divided by its spend — a ROAS of 2.1 means $2.10 of revenue per $1 spent. Distinct from profit-based ROI, which nets out margin and costs; the docs quote revenue ROAS unless stated otherwise. As with ROI, the decision-relevant quantity under saturation is the marginal ROAS of the next dollar, not the average.
Baseline / contribution: The two halves of a decomposition. The baseline is the outcome the model says would have occurred with no media — trend, seasonality, price, distribution, and other non-media drivers. A channel's contribution is the incremental outcome attributed to it; baseline plus contributions sum to the model's predicted outcome. Contributions are posterior distributions, not single numbers — report intervals.
Counterfactual: The unobserved outcome that would have occurred under a different action — e.g., "what would sales have been if we had spent zero on TV last quarter?" Computing counterfactuals is the central task of causal MMM analysis. See analysis.py and the Causal Inference guide.
Decomposition: Attributing each unit of outcome to a specific driver (channel, baseline, seasonality). Bayesian decomposition is a posterior distribution over contributions, not a single number; report intervals, not just means.
Cross-effects: When a marketing action on one product or channel affects outcomes for another (cannibalisation, halo effects, multi-pack pull-through). Ignoring cross-effects inflates per-channel ROI. Modeled via the multivariate extension; see Technical Guide.
Halo effect: A positive cross-effect: marketing for one product or channel lifts outcomes for another (e.g., advertising a new variant raising sales of the original). The mirror image of cannibalisation. Modeled as a cross-effect in the multivariate extension; ignoring halo understates portfolio-level value just as ignoring cannibalisation overstates it.

Bayesian Inference

Bayes' rule: $p(\theta \mid y) \propto p(y \mid \theta)\, p(\theta)$. Posterior beliefs about an unknown parameter $\theta$ are proportional to the likelihood of the observed data times the prior. The mathematical engine behind every model in the framework.
Prior: $p(\theta)$ — a probability distribution encoding what we believe about a parameter before seeing the current data. Priors are how we regularise; defensible priors beat flat priors. See the Bayesian Workflow guide.
Likelihood: $p(y \mid \theta)$ — the probability of observing the data given a candidate value of the parameter. Combined with the prior to form the posterior.
Posterior: $p(\theta \mid y)$ — the updated belief about a parameter after seeing the data. The posterior is what every Bayesian inference produces. Summarised by mean, median, and credible interval.
Conjugate prior: A prior whose posterior is in the same family. The Gaussian-Gaussian conjugate update (precisions add; means are precision-weighted) is the workhorse of the calibration loop.
Credible interval (CI): An interval that contains the parameter with a stated posterior probability (e.g., 95% CI). Distinct from a frequentist confidence interval — a credible interval is a direct probabilistic statement about the parameter. The default reporting unit in this framework.
HDI (Highest Density Interval): The narrowest credible interval at a given probability mass — the set of most probable parameter values. Useful when the posterior is skewed or multi-modal.
ROPE (Region of Practical Equivalence): A pre-specified range around a null value within which differences are considered practically meaningless. Decision rule: if the HDI lies entirely outside the ROPE, accept the effect; if entirely inside, reject; if overlapping, undecided.
Posterior predictive check (PPC): Simulating new data from the posterior and comparing it to the observed data. The basic smoke test for whether a Bayesian model is even capable of producing the kind of data we see. ArviZ provides standard plots; see Bayesian Workflow.
$\hat{R}$ (R-hat): A convergence diagnostic for MCMC chains. $\hat{R} \approx 1.0$ means chains have mixed; $\hat{R} > 1.01$ is a red flag. Always check before interpreting any posterior.
ESS (Effective Sample Size): The number of independent posterior samples your MCMC chain is effectively worth. Low ESS means high autocorrelation and unreliable summary statistics. Aim for ESS in the thousands for any quantity you'll report.
Divergence: A NUTS-sampler failure mode signalling that the posterior geometry has features the sampler can't navigate (typically funnels in hierarchical models). Even one divergence invalidates inference; reparameterise rather than ignore.

Measurement & Experiments

Measurement loop: The framework's operating rhythm: measurement as a cycle, not a one-time report. Fit the model (T₀) → find where uncertainty is most expensive (T₁: EIG and EVOI) → run the experiment that buys the most learning (T₂: pre-registered tests) → feed the result back into the model (T₃: calibrated refit) → reallocate budget with the sharper answer (T₄) → re-evaluate as information decays (T₅, then the cycle repeats). The Program page of the web app tracks where you are in the cycle. See: Calibration Loop, Platform Overview
EIG (Expected Information Gain): The expected reduction in posterior entropy from running an experiment. Measures how much we'd learn. Defined as the expected KL divergence from prior to posterior. Under Gaussian-Gaussian: $\mathrm{EIG} = \tfrac{1}{2}\log(1 + \sigma_{\text{prior}}^2 / \sigma_{\text{exp}}^2)$. Used to prioritise the experimental portfolio. See Calibration Loop.
EVOI (Expected Value of Information): The expected dollar improvement in budget decision quality from running an experiment. Measures how much an answer would change what we do. EIG and EVOI together drive the 2x2 priority map. EVPI is the upper bound.
EVPI (Expected Value of Perfect Information): The expected dollar gain from resolving all uncertainty about a parameter before deciding — the ceiling on what any experiment about it can be worth. The EVOI of a real (noisy, finite) experiment is always below its EVPI. Useful as a screen: if a channel's EVPI is small, no test of that channel is worth running.
Calibration: Using experimental causal estimates to anchor MMM parameters so that model-implied ROIs are consistent with experimental truth. Three mechanisms: soft prior, likelihood augmentation, and hierarchical pooling. See Calibration Loop.
Geo-lift / matched-market test: An experiment that withholds (or boosts) media spend in a set of geographic markets and compares outcomes against matched control markets. Produces a causal estimate for the tested channel that's independent of the rest of the marketing plan.
TWFE / DiD (Difference-in-Differences): A panel-data estimator that identifies treatment effects from differential changes between treated and control units across pre/post periods. The standard estimator for geo-lift readouts; outputs a point estimate and standard error that plug directly into the Bayesian update via the bridge equation.
SCM (Synthetic Control Method): An estimator that constructs a counterfactual control unit as a weighted combination of donor pool units, then attributes the post-treatment gap to the intervention. Useful when no clean matched control exists. Augmented Synthetic Control (ASC) extends with outcome model bias correction.
MDE (Minimum Detectable Effect): The smallest effect size an experiment can detect at a given power and significance level. Computed before launch from sample size, baseline variance, and design choices. Drives geo selection and test duration.
ITT (Intent-to-Treat): Analyse subjects according to their assigned arm regardless of compliance. The default and pre-registered analysis for geo-lift to avoid post-hoc adjustments biased toward results that look good.
Pre-registration: Locking in the experimental design, estimator, and primary outcome before the test runs. Eliminates the temptation to choose the analysis that produces the most favourable answer (a major source of bias).
Submodularity: A property of set functions: marginal value diminishes as the set grows. The EIG of a portfolio of experiments is submodular — meaning greedy selection is provably within $1 - 1/e \approx 63\%$ of optimum, and the test program naturally caps at 2-3 experiments per cycle.

Causal Inference

DAG (Directed Acyclic Graph): A diagram of cause-effect relationships among variables. Edges point from cause to effect; no cycles. The basic representation of a causal model. The framework's dag_model_builder validates DAGs before allowing model fits. See Causal Inference.
Confounder: A variable that causes both the treatment and the outcome — e.g., promo periods that drive both media spend and sales. Confounders must be included in an MMM; omitting them biases the media coefficient.
Mediator: A variable on the causal path from media to outcome (e.g., brand awareness mediates between TV and purchase). Including a mediator as a control absorbs the very effect you're trying to measure. Use a NestedMMM to model mediation explicitly.
Collider: A variable caused by both the treatment and the outcome. Conditioning on a collider opens a spurious path and biases estimates — the opposite mistake from omitting a confounder.
do-operator: $\mathrm{do}(X = x)$ denotes the interventional distribution: the world in which $X$ is set to $x$ by external intervention, breaking incoming causal arrows. Distinct from conditioning $X = x$ (which observes without intervening).
Backdoor criterion: A graphical condition for selecting a control set that blocks all non-causal paths between treatment and outcome. The standard tool for identifying causal effects from observational data.
Incrementality: The portion of an outcome that media actually caused — sales that happened because the campaign ran and would not have happened otherwise (the gap between the observed outcome and its counterfactual). Incrementality is the quantity both the MMM and lift experiments estimate, which is what makes them comparable and calibratable against each other. See: Causal Inference, Calibration Loop
Specification shopping: Re-running models with different choices (variables, transforms, lags) until one produces the desired result. Inflates apparent precision and biases conclusions. The framework's emphasis on pre-specified analyses is a direct response.

Data & Operations

Frequentist: The school of statistics built around long-run frequency: probabilities are limits of repeated trials, parameters are fixed, and inference is via confidence intervals and p-values. Most production geo-lift estimators are frequentist; their outputs feed Bayesian calibration through the bridge equation.
NumPyro: A JAX-backed probabilistic programming library used by the framework as a fast alternative sampler to native PyMC. Select it on the model config — ModelConfigBuilder().bayesian_numpyro() (in fact the default) — not via a fit() argument; the framework then passes nuts_sampler="numpyro" to pm.sample internally. Measured speedup: ~3x wall-clock vs sequential PyMC chains at equal draws on a 7-channel national model (down from ~6x under PyMC 5, whose native sampler was slower; narrower still against parallel PyMC chains); see nbs/validation/runtime_benchmark.ipynb.
NutPie: A Rust-backed NUTS sampler available as another fast PyMC backend. Sometimes faster than NumPyro for small-to-medium models; benchmark on your data.
MAPE (Mean Absolute Percentage Error): Standard predictive accuracy metric: average of $|y_t - \hat{y}_t| / y_t$. Reported on a held-out test window; used as the gate for whether a new model spec is at least as good as the baseline.
ARQ: Async job queue used by the API service for long-running model fits. Workers run via arq worker.WorkerSettings against a Redis backend.
Agent workspace: The AI analyst-assistant built into the React web app (the /workspace page). A chat session whose assistant has tools to validate and explore data, configure and fit models, analyze ROI and decompositions, plan and pre-register experiments, run Python in a sandboxed kernel, search an uploaded knowledge base, and generate branded client reports. Works with Anthropic, OpenAI, Google (Gemini), Vertex AI, and local LM Studio language-model providers. See: Platform Overview
Pre-period: The historical window used to estimate baseline geo-week variance and calibrate experiment designs before a test launches. Powers the noise estimate $\sigma_{\text{exp}}^2$ used in EIG/EVOI.
OLS (Ordinary Least Squares): The classical frequentist regression estimator. Many incumbent MMMs are OLS-based; the framework's recommended migration path is to fit a Bayesian MMM in parallel and only decommission OLS after two cycles of comparable answers under audit.
Uncertainty: The width of the posterior. Critical to report alongside any point estimate — a narrow interval is information; a wide interval is also information. See Interpreting Results for how to communicate it.
Variable selection: The process of deciding which controls to include. The framework's stance: confounders always, mediators never, colliders never. Apply selection only to precision controls (variables that don't bias but improve fit). See Variable Selection.

Mathematical Symbols

Notation used throughout the technical guides:

Symbol	Meaning
$\theta_k$	True causal ROI / elasticity for channel $k$
$\mu_k, \sigma_k^2$	MMM posterior mean and variance for channel $k$
$\tau_k$	Same parameter as $\theta_k$ viewed through the experimental lens
$\hat{y}_k, \hat{\tau}_k$	Experimental estimate of channel-$k$ effect
$\sigma_{\text{exp},k}^2$	Experimental noise variance per design
$b_k$	Budget allocated to channel $k$
$s_k$	Channel-$k$ spend share
$p(\theta)$	Prior distribution
$p(y \mid \theta)$	Likelihood
$p(\theta \mid y)$	Posterior distribution
$\mathcal{N}(\mu, \sigma^2)$	Gaussian distribution with mean $\mu$ and variance $\sigma^2$
$H[p]$	Differential entropy of distribution $p$
$\mathrm{KL}(p \,\\|\, q)$	Kullback-Leibler divergence from $q$ to $p$
$I(\theta; \hat{y})$	Mutual information between parameter and observation
$\mathrm{do}(X = x)$	Causal intervention setting $X$ to $x$
$\hat{R}$	MCMC convergence diagnostic (R-hat)
$\mathrm{ESS}$	Effective sample size
$\lambda_{\text{decay}}$	Calibration information decay rate

Modeling Terms

Bayesian Inference

Measurement & Experiments

Causal Inference

Data & Operations

Mathematical Symbols

Missing a term?