Practical Issues in Simulation Estimation

Summary

Implementing simulation-based estimators requires attention to several practical concerns: using common random numbers to ensure convergence of the iterative optimizer, variance reduction techniques (antithetic variates, control variates) to reduce Monte Carlo noise, careful selection of the auxiliary model for indirect inference, appropriate step sizes for numerical derivatives, and understanding the trade-off between simulation size and computational cost. This note covers implementation guidance from both Liesenfeld & Breitung (1998) and Oh & Patton (2011).

Common Random Numbers

Common Random Numbers Are Essential for Convergence

At every iteration step of the optimization over , the criterion function is estimated via simulations. For convergence of the iterative algorithm, it is critical to use common random numbers: the same set of simulated random variables is used to generate simulated values for every value of during the optimization.

If new random draws were made at each iteration, the randomness would introduce additional noise into the objective function surface, and the algorithm would fail to converge (Hendry, 1984).

For the reduced form :

  1. Before optimization: draw and store for
  2. During optimization: for each candidate , compute using the stored draws
  3. The criterion function then varies smoothly in (modulo the inherent non-smoothness from indicator functions in EDF-based moments)

Variance Reduction Techniques

The overall variance of simulation-based estimators consists of two components:

  1. Irreducible component: the variance the estimator would have if based on the exact criterion function
  2. Monte Carlo sampling variance: additional variance from evaluating the criterion function by simulation

The first component is irreducible; the second can be reduced by increasing or by variance reduction techniques.

Antithetic Variates

Definition: Antithetic Variates

To estimate a quantity by simulations, construct two negatively correlated estimates and such that their average has lower variance than either individual estimate.

Implementation: If the reduced form error term has a symmetric distribution around zero:

  • Compute using simulated values
  • Compute using (same draws, opposite sign)

The two estimates are negatively correlated, and their average has reduced variance. The additional computing cost is negligible (one extra pass through the model with pre-generated draws).

Control Variates

Definition: Control Variates

The control variate technique uses two components for the final Monte Carlo estimate of a quantity :

  1. The natural Monte Carlo estimate
  2. An estimate created from the same set of simulated random numbers as , with known expectation and positive correlation with

The control variate estimate is:

Under suitable conditions, .

Application to indirect inference (Calzolari, Di Iorio, and Fiorentini, 1998): For the parameter-based indirect inference estimator, the control variate adjusts by the difference , where is the auxiliary model estimate from observed data and is estimated from simulated data using as the parameter vector. Monte Carlo experiments show that combining indirect inference with control variates substantially reduces the Monte Carlo sampling variance, especially for continuous-time models.

Auxiliary Model Selection (Indirect Inference)

For Indirect Inference, the choice of auxiliary model determines efficiency. Two approaches:

Strategy 1: Simple, Close Auxiliary Model

Choose a tractable model that captures the salient features of the structural model:

Structural ModelNatural Auxiliary ModelRationale
Stochastic volatilityGARCHBoth capture volatility clustering
CIR interest rateDiscrete-time approximationBoth model mean reversion
Jump-diffusionGARCH with fat tailsBoth produce leptokurtic returns

Advantages: Simple to implement, few auxiliary parameters, stable estimation.

Disadvantages: May miss features of the structural model, leading to efficiency loss.

Strategy 2: Data-Dependent SNP Model

Use the SNP auxiliary model, increasing its dimension with sample size:

Advantages: Asymptotically efficient; captures all features of the data.

Disadvantages:

  • Over-parameterized SNP models can lead to substantial efficiency loss in small samples
  • Choosing the SNP dimension (, , , ) requires model selection (AIC/BIC)
  • Computational cost is higher

Over-Parameterization

Andersen, Chung, and Sørensen (1998) find evidence that score generators based on an over-parameterized SNP model lead to a substantial loss of efficiency, especially in smaller samples. Substituting an ARCH-type scale function for the polynomial scale in the SNP model improves efficiency because it directly captures the autocorrelation in variance typical of financial data.

Step Size for Numerical Derivatives

When estimating the asymptotic covariance matrix of the SMM estimator, the Jacobian must be estimated by numerical differentiation (see Proposition 3).

Step-Size Guidelines (Oh & Patton, 2011)

The step size must satisfy:

  1. (consistency)
  2. (convergence rate requirement)

Practical rule: For sample size :

Lower bound ()Recommended
2500.0630.1
1,0000.0320.01 – 0.1
5,0000.0140.01 – 0.05

Warning: Standard numerical differentiation defaults (e.g., MATLAB’s , or forward-difference ) are catastrophically too small for this application. Using these defaults can produce coverage rates as low as 2% for a nominal 95% confidence interval (see step-size sensitivity results).

Simulation Size ( or ) Trade-offs

The number of simulations affects both efficiency and computation time:

(or )Variance InflationWhen to Use
GMM varianceNever (too noisy)
Quick preliminary analysis
Standard practice
(Oh & Patton)Recommended for copula SMM
Final results if computation permits
(= GMM)Infeasible but useful benchmark

For the factor copula model, Oh and Patton use . The 4% efficiency loss relative to is negligible compared to the ~20-40% loss from using moments rather than the likelihood.

Small Sample Properties of Indirect Inference

Andersen, Chung, and Sørensen (1998) conduct a comprehensive Monte Carlo study of the EMM estimator for the stochastic volatility model:

  • EMM vs. GMM: EMM is generally more efficient than standard GMM
  • EMM vs. MLE: Likelihood-based estimators are generally more efficient, but EMM approaches their efficiency as sample size increases
  • Key practical finding: Substituting an ARCH-type scale function for the polynomial scale in the SNP model improves efficiency — it more directly captures the autocorrelation in variance implied by the SV model
  • Over-parameterized SNP specifications lose efficiency in small samples

Implementation Checklist

For practitioners implementing simulation-based estimation:

  • Common random numbers: Draw and fix before optimization begins
  • Sufficient simulations: Use (or ) to keep variance inflation below 5%
  • Appropriate step size: Set between and for numerical derivatives; never use software defaults
  • Bootstrap for : Use iid bootstrap replications
  • Weight matrix: Identity matrix is simple and stable; efficient weight matrix gives J-test but may be numerically unstable
  • Convergence: Check that the optimizer converges from multiple starting values
  • Variance reduction: Consider antithetic variates if the model has symmetric errors

Connections

See Also

Sources

  • tdb136.pdf — Liesenfeld & Breitung (1998), Section 6
  • Oh_Patton_SMM_copulas_nov11.pdf — Oh & Patton (2011), Sections 2.4, 3
  • Hendry, D.F. (1984), “Monte Carlo Experimentation in Econometrics,” Handbook of Econometrics Vol. 2
  • Calzolari, G., F. Di Iorio, and G. Fiorentini (1998), “Control Variates for Variance Reduction in Indirect Inference,” The Econometrics Journal, forthcoming