Aggregation of Relations

Summary

Market response models are almost always estimated on aggregate data (stores, markets, or national), while the underlying behavioral processes occur at the individual level. This note covers the conditions under which aggregate models correctly recover individual parameters, and when aggregation bias arises.

The Aggregation Problem

Aggregation Bias

Aggregation bias occurs when the functional form of an individual-level response function does not carry over to the aggregate level. If individual has response , the aggregate generally cannot be written as unless is linear or strong distributional assumptions hold.

Conditions for Exact Aggregation

Exact Aggregation (Linear Case)

For the linear model , summing over individuals:

The aggregate OLS on and recovers the same . Exact aggregation holds for linear models.

For nonlinear forms (log-log, logistic, etc.), exact aggregation fails except in special cases. The aggregate elasticity is a weighted average of individual elasticities, and the weights depend on the distribution of across the population.

Approximate Aggregation (Second-Order Correction)

For a nonlinear function , a second-order Taylor expansion around the mean gives:

The correction term is:

  • Zero for linear (exact aggregation)
  • Negative for concave (aggregate overestimates mean individual response)
  • Positive for convex (aggregate underestimates mean individual response)

Marketing Implications

Aggregation and Advertising Response

If individual advertising response is concave (diminishing returns at the individual level), the aggregate response function is also concave but may appear more linear. The slope at the aggregate mean understates diminishing returns — i.e., the aggregate model overestimates the return to further advertising at the margin.

This matters for budget optimization: optima derived from aggregate models may differ from those that maximize individual-level expected utility.

Cross-Sectional Heterogeneity and Random Coefficients

When varies across individuals (heterogeneous response), the aggregate model picks up an average effect. The random coefficients model (see Functional Forms in Marketing) explicitly models this heterogeneity:

where . The aggregate-level regression recovers but conceals the distribution of individual effects, which matters for targeted marketing decisions.

Temporal vs. Cross-Sectional Aggregation

TypeMechanismKey Bias
Temporal (weekly → monthly)Carryover effects compressedRetention rate inflated (Clarke 1976)
Cross-sectional (store → market)Heterogeneous responses averagedElasticity biased toward mean
Functional-formNonlinear aggregatedSecond-order correction term

See Carryover Effects and Distributed Lags for temporal aggregation corrections.