🧪Estimation and inference
Last updated
Last updated
Econometricians build models for various reasons, as outlined on the page What is econometrics?. Up until now, we have discussed random variables and their statistical properties. However, in most cases, we do not know these (exact) properties. For example, you might have data at your disposal and by plotting the data as a histogram, it looks like it's normally distributed. Even though you cannot know the distribution for sure, assume you have convinced yourself that the normal distribution is accurate. In that case, you still do not know the mean and variance , also called distributional parameters, of this distribution. Thus, we have to estimate these parameters.
As another example, suppose that the econometrician hypothesizes that two random variables and are related in the following way:
,
where is the dependent variable, the independent variable and the error term. This model is called the linear regression model, as it assumes that the two random variables are linearly related. The error term takes into account that the relationship between and is not perfectly linear (i.e. if we collect data, it does not lie perfectly on a line), but is rather scattered around a line. The only quantities we did not yet discuss are and , which we call model parameters or model coefficients. They are almost always unknown to the econometrician and thus have to be estimated.
To estimate both types of parameters mentioned above, we can employ estimation methods such as ordinary least squares, maximum likelihood and methods of moments. These estimation methods are derived under specific sets of assumptions, which makes them appropriate in different contexts. During the minor Applied Econometrics, you will learn more about this.
Using our estimates of the parameters and their corresponding standard errors, we can perform inference. This typically consists of performing statistical tests on coefficients. Revisiting the example of the linear regression model, we might want to test the null hypothesis (i.e. there is no relation betweenand ) versus the alternative hypothesis that there is a relationship, i.e. . This is called hypothesis testing.
Statistical tests can be interpreted in different ways. An econometrician typically sets a significance level beforehand (usually or ). To draw conclusions on whether to "reject" or "fail to reject" the null hypothesis, one has to compare the significance level to the p-value of the test. As an alternative to testing, we can also compute confidence intervals (around coefficients), which represents a range of estimates for an unknown parameter.