BLP Demand Estimation - Overview

Summary

The Berry, Levinsohn, and Pakes (1995) model is the workhorse estimator for differentiated-products demand. It specifies a random-coefficients (mixed) logit demand system that allows flexible substitution patterns, addresses price endogeneity via instruments, and can be paired with a supply side (Bertrand-Nash markups) to recover marginal costs. Conlon & Gortmaker (2020) review the modern literature, derive a slightly different formulation amenable to fixed effects and optimal instruments, and collect concrete best practices implemented as defaults in the PyBLP Python package.

Overview

Empirical supply-and-demand models for differentiated products are central to the New Empirical Industrial Organization (NEIO) literature, used for merger evaluation, valuing new goods, and studying two-sided markets. The BLP approach scales to many products and uses both aggregate and disaggregate data.

The model is “simple to understand, challenging to estimate.” At its core it is a nonlinear change of variables from observed market shares to mean utilities . After this change of variables, BLP reduces to a linear IV regression (demand alone) or a two-equation linear IV problem (supply and demand). The difficulty is that the parameters governing the nonlinear change are unknown, producing a non-linear, non-convex GMM optimization with a simulated objective.

The paper organizes its contribution around the major tasks of the BLP estimator: solving the fixed point (share inversion), optimization, numerical integration, instrument construction, and solving counterfactual pricing equilibria. Its headline empirical findings (via Monte Carlo) differ from prior literature: multiple local optima appear rare in well-identified problems, and good finite-sample performance is achievable even in small samples, especially when optimal instruments are used together with supply-side restrictions.

Main Content

The BLP problem at a glance ^blp-pipeline

  1. Demand utility (random-coefficients logit): — see Random Coefficients Logit Model.
  2. Share inversion: invert observed shares to mean utilities, , via the BLP contraction mapping.
  3. Linear index: , with structural error .
  4. GMM moments: using instruments for endogenous price — see GMM Estimation and Instruments for Price Endogeneity.
  5. Optional supply side: Bertrand-Nash FOCs give markups, recover marginal cost , add supply moments — see Supply Side and Markups.
  6. Estimation: nested fixed-point (NFXP) GMM with numerical integration and gradient-based optimization — see Numerical Integration and Optimization in PyBLP.

Parameter partition ^theta-partition

The parameter vector is split into three parts:

  • (): linear demand parameters .
  • (): linear supply parameters .
  • (): the nonlinear parameters — the price coefficient and the parameters governing heterogeneous tastes. These are common to both supply and demand and govern the endogenous objects.

The NFXP algorithm concentrates out the linear and searches only over the nonlinear parameters , so the Hessian is only and large numbers of (linear) fixed effects are “essentially free.”

What is novel in Conlon & Gortmaker (2020) ^novelty

  • A reformulation of the BLP problem (placing on the LHS of the linear system) that supports simultaneous supply and demand with high-dimensional fixed effects and analytic gradients.
  • A characterization of the feasible approximation to optimal instruments (Amemiya 1977 / Chamberlain 1987) that makes exclusion and cross-equation restrictions explicit, paralleling Berry & Haile (2014).
  • Concrete, benchmarked best practices: SQUAREM / Levenberg-Marquardt for the inner loop, Gauss-Hermite product rules (sparse grids / scrambled Halton in high dimensions) for integration, gradient-based optimization with box constraints and tight tolerances, and the log-sum-exp trick for numerical stability.

Examples

A canonical setup is the automobile or ready-to-eat cereal application (e.g., BLP 1995; Nevo 2000b/2001):

  • Markets are model-years or city-quarters; products are car models or cereal brands; the outside good is “buy nothing.”
  • Characteristics : horsepower, fuel economy, size (cars), or sugar/brand dummies (cereal); price is endogenous.
  • A random coefficient on a characteristic (e.g. ) generates realistic substitution: consumers who like big cars substitute toward other big cars when one’s price rises, breaking IIA.
  • The Conlon-Gortmaker Monte Carlo baseline: markets, firms , products per firm , structural errors with , ; demand parameters , ; outside shares .

Connections

See Also