The method of moments
The method of moments before 1982
One-dimensional parameter
Consider an example where
If we focus on the first moment
You could have chosen to use other moments, for example, the second moment:
Observe that one moment was needed to form an estimator for a one-dimensional parameter.
Two-dimensional parameters
Normal distribution with unknown mean and variance
Consider an example where
Applying the idea behind the method of moments, we have
Best linear predictor
I will first introduce the idea of a best linear predictor, which may be thought of as an extension of best prediction I discussed when we were looking into unbiasedness.
Suppose we have two random variables
Consider a prediction rule of the form
As a result, we have
Therefore, the optimal prediction rule under a minimum MSE criterion is
To use this optimal prediction rule in practice, we need to estimate
Replace the population mean with the sample mean in order to obtain
These quantities may be familiar to students who have studied finding the “line of best fit”.
Observe again that you need at least two
The setup for the method of moments
All of our examples featured equations of the form
The function
The sample counterparts of the moment conditions are given by
Why use the method of moments if maximum likelihood is available?
Sometimes it is easier to compute method of moments estimators than maximum likelihood estimators. They also seem to be natural estimators because sometimes there is a tangible link between the moments of a random variable and a parameter is available.
More importantly, the method of moments does not exploit all the information stemming from the full distribution of the data. This may feel strange, especially in the example where we have IID random variables from
Finally, the method of moments estimator can be used as an initial starting point when maximizing the log-likelihood. Recall that the approximate quadraticity of the log-likelihood allows us to write
The method of moments: key idea as to why it works
We focus on the case where
In a similar spirit as what we did with the score function, we take a first-order Taylor series approximation of
If you can establish that
Computation
Some programming is also needed to use the method of moments, especially if you have to use it from scratch. Just like what we did in maximum likelihood, you have to set up the moment equations and then simultaneously solve for the roots of those equations. For one-dimensional problems, the function uniroot()
could be used. But the construction of standard errors is not automatic! For multi-dimensional problems, the package nleqslv
may be useful. I have used it to calculate rough estimates by method of moments, but I have not used it to construct standard errors. Finally, there is a whole framework proposed by the R package momentfit
meant to have something similar to an automatic procedure to generate estiimates, standard errors, construct confidence intervals, and test claims.
The method of moments after 1982
Economic models typically have parameters of interest and can be tied to moment restrictions, especially when one accounts for uncertainty. Just think back to simpler economic models without uncertainty. For example, the first-order condition for cost minimization ties parameters involving the production technology to observable quantities such as labor, capital, wages, and the interest rate. In more complicated models, some version of marginal cost being equal to marginal benefit holds on average. Economic models also generate a lot of moment restrictions which far exceed the dimension of the parameters.
Hansen (1982) shows how to actually exploit extra moment restrictions available. The idea is to put weights on the moments and allow for correlation between moment restrictions! The underlying idea can be understood in terms of our IID
What I described leads to the generalized method of moments (GMM) estimator. At least in the IID
If you want to learn more about the discovery and how it relates to economics and finance, read the background information provided by the Nobel Prize website.