Aggregation of Relations
Summary
Market response models are almost always estimated on aggregate data (stores, markets, or national), while the underlying behavioral processes occur at the individual level. This note covers the conditions under which aggregate models correctly recover individual parameters, and when aggregation bias arises.
The Aggregation Problem
Aggregation Bias
Aggregation bias occurs when the functional form of an individual-level response function does not carry over to the aggregate level. If individual has response , the aggregate generally cannot be written as unless is linear or strong distributional assumptions hold.
Conditions for Exact Aggregation
Exact Aggregation (Linear Case)
For the linear model , summing over individuals:
The aggregate OLS on and recovers the same . Exact aggregation holds for linear models.
For nonlinear forms (log-log, logistic, etc.), exact aggregation fails except in special cases. The aggregate elasticity is a weighted average of individual elasticities, and the weights depend on the distribution of across the population.
Approximate Aggregation (Second-Order Correction)
For a nonlinear function , a second-order Taylor expansion around the mean gives:
The correction term is:
- Zero for linear (exact aggregation)
- Negative for concave (aggregate overestimates mean individual response)
- Positive for convex (aggregate underestimates mean individual response)
Marketing Implications
Aggregation and Advertising Response
If individual advertising response is concave (diminishing returns at the individual level), the aggregate response function is also concave but may appear more linear. The slope at the aggregate mean understates diminishing returns — i.e., the aggregate model overestimates the return to further advertising at the margin.
This matters for budget optimization: optima derived from aggregate models may differ from those that maximize individual-level expected utility.
Cross-Sectional Heterogeneity and Random Coefficients
When varies across individuals (heterogeneous response), the aggregate model picks up an average effect. The random coefficients model (see Functional Forms in Marketing) explicitly models this heterogeneity:
where . The aggregate-level regression recovers but conceals the distribution of individual effects, which matters for targeted marketing decisions.
Temporal vs. Cross-Sectional Aggregation
| Type | Mechanism | Key Bias |
|---|---|---|
| Temporal (weekly → monthly) | Carryover effects compressed | Retention rate inflated (Clarke 1976) |
| Cross-sectional (store → market) | Heterogeneous responses averaged | Elasticity biased toward mean |
| Functional-form | Nonlinear aggregated | Second-order correction term |
See Carryover Effects and Distributed Lags for temporal aggregation corrections.
Cross-Links
- Functional forms: Functional Forms in Marketing
- Temporal aggregation bias: Carryover Effects and Distributed Lags
- Hierarchical models for heterogeneity: Hierarchical Linear Models
- Random coefficients: Functional Forms in Marketing