Model Testing and Specification

Summary

After estimation, models must be validated: coefficients tested for significance, overall fit assessed, and specification errors detected. This note covers F-tests, t-tests, the RESET general misspecification test, seven types of specification error, and detection procedures.

Significance Tests

F-Test for Model Significance

Tests whether all slope coefficients are jointly zero:

Reject when .

t-Test for Individual Coefficients

Under . Two-sided implies rejection. For advertising elasticities, use one-sided test ().

RESET Test for General Misspecification

Ramsey RESET

The Regression Equation Specification Error Test (Ramsey 1969):

  1. Estimate the original model; save fitted values
  2. Add powers of fitted values as regressors:
  3. Test their joint significance via F-test (Eq 5.40)

Significant RESET statistic indicates general misspecification: wrong functional form, omitted nonlinear terms, or structural breaks. Does not identify the specific misspecification.

Seven Specification Error Types

#Error TypeDetection MethodConsequence for OLS
1Omitted variableCompare with/without; theoryBiased, inconsistent
2Irrelevant variable (over-specification)t-test; BICInefficient but unbiased
3Wrong functional formRESET; Box-Cox; nested testsBiased
4Measurement error in Compare IV vs. OLSAttenuation bias
5Autocorrelation in DW test; Ljung-Box QInefficient; SE wrong
6HeteroscedasticityBreusch-Pagan; White testSE wrong (OLS valid)
7SimultaneityHausman testBiased, inconsistent

Errors 1, 4, and 7 cause inconsistency; errors 2, 5, 6 cause inefficiency but not bias.

Restricted Least Squares

When theory imposes restrictions (e.g., homogeneity, adding-up conditions in share models), restricted OLS imposes these as linear constraints :

The restrictions can be tested via an F-test on the incremental RSS from imposing them.

Nested vs. Non-Nested Tests

  • Nested: test whether a special case (linear) fits as well as the general (ADBUDG) via F-test on restrictions
  • Non-nested: compare log-log vs. linear using Davidson-MacKinnon J-test or AIC/BIC

Detecting Autocorrelation

The Durbin-Watson (DW) statistic tests AR(1) residual autocorrelation:

: no autocorrelation; : positive AR; : negative AR.

For higher-order autocorrelation or models with lagged dependent variables: use the Ljung-Box Q statistic on residuals at multiple lags.

Box-Cox Test for Functional Form

To test linear vs. log-linear form, the Box-Cox transformation (Eq 5 in Ch.5 context):

MLE over with (linear) or (log-linear). Confidence interval on indicates whether the data prefer linear or log transformation.

Related to Box-Cox variance stabilization in ARIMA — see Single Marketing Time Series.