Regression and the CEF

Summary

The population regression function is the best linear approximation to the conditional expectation function (CEF). This relationship holds regardless of whether the CEF is actually linear, giving regression a robust interpretation even when functional form assumptions fail.

The CEF

The conditional expectation function $E [Y_{i} ∣ X_{i}]$ is:

The best predictor of $Y_{i}$ given $X_{i}$ (minimizes mean squared error)
A function that decomposes any random variable: $Y_{i} = E [Y_{i} ∣ X_{i}] + ε_{i}$ where $ε_{i}$ is uncorrelated with any function of $X_{i}$

Three Justifications for Regression

1. Linear CEF Theorem

If the CEF is linear, then the population regression function equals the CEF.

2. Best Linear Predictor Theorem

The regression function $X_{i}^{'} β$ is the best linear predictor of $Y_{i}$ given $X_{i}$ in the MMSE sense.

3. Regression-CEF Theorem (the key one)

Even when the CEF is nonlinear, regression provides the MMSE linear approximation to it:

β = ar g b min E {(E [Y_{i} ∣ X_{i}] - X_{i}^{'} b)^{2}}

Regression Anatomy

The coefficient on regressor $k$ in a multivariate regression:

β_{k} = \frac{C o v ( Y _{i} , x ~ _{ki} )}{V ( x ~ _{ki} )}

where $\tilde{x}_{ki}$ is the residual from regressing $x_{ki}$ on all other covariates. This is the Frisch-Waugh result: each multivariate coefficient equals the bivariate coefficient after “partialling out” other variables.

Robust Standard Errors

The heteroskedasticity-consistent (robust) covariance matrix:

E [X_{i} X_{i}^{'}]^{- 1} E [X_{i} X_{i}^{'} e_{i}^{2}] E [X_{i} X_{i}^{'}]^{- 1}

Always Use Robust Standard Errors

Since regression approximates a possibly nonlinear CEF, heteroskedasticity is the natural state of affairs. Robust and conventional standard errors that differ by more than 30% may indicate a problem.

Saturated Models

A saturated model has a separate parameter for every possible covariate combination — it fits the CEF perfectly and is inherently linear. Example: with two dummies $x_{1}, x_{2}$ , the saturated model includes both main effects and their interaction.

Second Brain

Explorer

Regression and the Conditional Expectation Function

Regression and the CEF

The CEF

Three Justifications for Regression

1. Linear CEF Theorem

2. Best Linear Predictor Theorem

3. Regression-CEF Theorem (the key one)

Regression Anatomy

Robust Standard Errors

Saturated Models

See Also

Graph View

Table of Contents

Backlinks