Practical Issues in Simulation Estimation
Summary
Implementing simulation-based estimators requires attention to several practical concerns: using common random numbers to ensure convergence of the iterative optimizer, variance reduction techniques (antithetic variates, control variates) to reduce Monte Carlo noise, careful selection of the auxiliary model for indirect inference, appropriate step sizes for numerical derivatives, and understanding the trade-off between simulation size and computational cost. This note covers implementation guidance from both Liesenfeld & Breitung (1998) and Oh & Patton (2011).
Common Random Numbers
Common Random Numbers Are Essential for Convergence
At every iteration step of the optimization over , the criterion function is estimated via simulations. For convergence of the iterative algorithm, it is critical to use common random numbers: the same set of simulated random variables is used to generate simulated values for every value of during the optimization.
If new random draws were made at each iteration, the randomness would introduce additional noise into the objective function surface, and the algorithm would fail to converge (Hendry, 1984).
For the reduced form :
- Before optimization: draw and store for
- During optimization: for each candidate , compute using the stored draws
- The criterion function then varies smoothly in (modulo the inherent non-smoothness from indicator functions in EDF-based moments)
Variance Reduction Techniques
The overall variance of simulation-based estimators consists of two components:
- Irreducible component: the variance the estimator would have if based on the exact criterion function
- Monte Carlo sampling variance: additional variance from evaluating the criterion function by simulation
The first component is irreducible; the second can be reduced by increasing or by variance reduction techniques.
Antithetic Variates
Definition: Antithetic Variates
To estimate a quantity by simulations, construct two negatively correlated estimates and such that their average has lower variance than either individual estimate.
Implementation: If the reduced form error term has a symmetric distribution around zero:
- Compute using simulated values
- Compute using (same draws, opposite sign)
The two estimates are negatively correlated, and their average has reduced variance. The additional computing cost is negligible (one extra pass through the model with pre-generated draws).
Control Variates
Definition: Control Variates
The control variate technique uses two components for the final Monte Carlo estimate of a quantity :
- The natural Monte Carlo estimate
- An estimate created from the same set of simulated random numbers as , with known expectation and positive correlation with
The control variate estimate is:
Under suitable conditions, .
Application to indirect inference (Calzolari, Di Iorio, and Fiorentini, 1998): For the parameter-based indirect inference estimator, the control variate adjusts by the difference , where is the auxiliary model estimate from observed data and is estimated from simulated data using as the parameter vector. Monte Carlo experiments show that combining indirect inference with control variates substantially reduces the Monte Carlo sampling variance, especially for continuous-time models.
Auxiliary Model Selection (Indirect Inference)
For Indirect Inference, the choice of auxiliary model determines efficiency. Two approaches:
Strategy 1: Simple, Close Auxiliary Model
Choose a tractable model that captures the salient features of the structural model:
| Structural Model | Natural Auxiliary Model | Rationale |
|---|---|---|
| Stochastic volatility | GARCH | Both capture volatility clustering |
| CIR interest rate | Discrete-time approximation | Both model mean reversion |
| Jump-diffusion | GARCH with fat tails | Both produce leptokurtic returns |
Advantages: Simple to implement, few auxiliary parameters, stable estimation.
Disadvantages: May miss features of the structural model, leading to efficiency loss.
Strategy 2: Data-Dependent SNP Model
Use the SNP auxiliary model, increasing its dimension with sample size:
Advantages: Asymptotically efficient; captures all features of the data.
Disadvantages:
- Over-parameterized SNP models can lead to substantial efficiency loss in small samples
- Choosing the SNP dimension (, , , ) requires model selection (AIC/BIC)
- Computational cost is higher
Over-Parameterization
Andersen, Chung, and Sørensen (1998) find evidence that score generators based on an over-parameterized SNP model lead to a substantial loss of efficiency, especially in smaller samples. Substituting an ARCH-type scale function for the polynomial scale in the SNP model improves efficiency because it directly captures the autocorrelation in variance typical of financial data.
Step Size for Numerical Derivatives
When estimating the asymptotic covariance matrix of the SMM estimator, the Jacobian must be estimated by numerical differentiation (see Proposition 3).
Step-Size Guidelines (Oh & Patton, 2011)
The step size must satisfy:
- (consistency)
- (convergence rate requirement)
Practical rule: For sample size :
Lower bound () Recommended 250 0.063 0.1 1,000 0.032 0.01 – 0.1 5,000 0.014 0.01 – 0.05 Warning: Standard numerical differentiation defaults (e.g., MATLAB’s , or forward-difference ) are catastrophically too small for this application. Using these defaults can produce coverage rates as low as 2% for a nominal 95% confidence interval (see step-size sensitivity results).
Simulation Size ( or ) Trade-offs
The number of simulations affects both efficiency and computation time:
| (or ) | Variance Inflation | When to Use |
|---|---|---|
| GMM variance | Never (too noisy) | |
| Quick preliminary analysis | ||
| Standard practice | ||
| (Oh & Patton) | Recommended for copula SMM | |
| Final results if computation permits | ||
| (= GMM) | Infeasible but useful benchmark |
For the factor copula model, Oh and Patton use . The 4% efficiency loss relative to is negligible compared to the ~20-40% loss from using moments rather than the likelihood.
Small Sample Properties of Indirect Inference
Andersen, Chung, and Sørensen (1998) conduct a comprehensive Monte Carlo study of the EMM estimator for the stochastic volatility model:
- EMM vs. GMM: EMM is generally more efficient than standard GMM
- EMM vs. MLE: Likelihood-based estimators are generally more efficient, but EMM approaches their efficiency as sample size increases
- Key practical finding: Substituting an ARCH-type scale function for the polynomial scale in the SNP model improves efficiency — it more directly captures the autocorrelation in variance implied by the SV model
- Over-parameterized SNP specifications lose efficiency in small samples
Implementation Checklist
For practitioners implementing simulation-based estimation:
- Common random numbers: Draw and fix before optimization begins
- Sufficient simulations: Use (or ) to keep variance inflation below 5%
- Appropriate step size: Set between and for numerical derivatives; never use software defaults
- Bootstrap for : Use iid bootstrap replications
- Weight matrix: Identity matrix is simple and stable; efficient weight matrix gives J-test but may be numerically unstable
- Convergence: Check that the optimizer converges from multiple starting values
- Variance reduction: Consider antithetic variates if the model has symmetric errors
Connections
- Implementation details for Method of Simulated Moments, Indirect Inference, and Efficient Method of Moments
- Step-size guidance directly affects Proposition 3 variance estimation
- Monte Carlo evidence in SMM Copula Simulation and Application validates these practical recommendations
See Also
- Simulation-Based Estimation - Overview — theoretical context
- Method of Simulated Moments — core MSM theory
- SMM Copula Asymptotic Theory — where step-size requirements arise formally
- SMM Copula Simulation and Application — empirical validation
- SMM Weighting Matrix and Inference — step-size guidance for numerical Jacobians in the parameter Σ̂ computation
- SMM Python Implementation — Python code illustrating the eps step-size issue in scipy L-BFGS-B and the common random numbers pattern
- Brock-Mirman Model - SMM Estimation Exercise — structural macro estimation example illustrating common random numbers and R-choice in practice
- SMM Estimation of Factor Copulas — high-dimensional application where R=25T and the step-size guidance is directly applied
Sources
- tdb136.pdf — Liesenfeld & Breitung (1998), Section 6
- Oh_Patton_SMM_copulas_nov11.pdf — Oh & Patton (2011), Sections 2.4, 3
- Hendry, D.F. (1984), “Monte Carlo Experimentation in Econometrics,” Handbook of Econometrics Vol. 2
- Calzolari, G., F. Di Iorio, and G. Fiorentini (1998), “Control Variates for Variance Reduction in Indirect Inference,” The Econometrics Journal, forthcoming