Handbook of Regression Analysis With Applications in R. Samprit Chatterjee

Чтение книги онлайн.

Читать онлайн книгу Handbook of Regression Analysis With Applications in R - Samprit Chatterjee страница 19

Handbook of Regression Analysis With Applications in R - Samprit  Chatterjee

Скачать книгу

a model is overspecified. A predictor that does not add significantly to model fit should have an estimated slope coefficient that is not significantly different from images, and is thus identified by a small images‐statistic. So, for example, in the analysis of home prices in Section 1.4, the regression output on page 17 suggests removing number of bedrooms, lot size, and property taxes from the model, as all three have insignificant images‐values.

      The images‐tests and images‐test of Section 1.3.3 are special cases of a general formulation that is useful for comparing certain classes of models. It might be the case that a simpler version of a candidate model (a subset model) might be adequate to fit the data. For example, consider taking a sample of college students and determining their college grade point average (images), Scholastic Aptitude Test (SAT) evidence‐based reading and writing score (images), and SAT math score (images). The full regression model to fit to these data is

equation

      Instead of considering reading and math scores separately, we could consider whether images can be predicted by one variable: total SAT score, which is the sum of images and images. This subset model is

equation

      with images. This equality condition is called a linear restriction, because it defines a linear condition on the parameters of the regression model (that is, it only involves additions, subtractions, and equalities of coefficients and constants).

      The question about whether the total SAT score is sufficient to predict grade point average can be stated using a hypothesis test about this linear restriction. As always, the null hypothesis gets the benefit of the doubt; in this case, that is the simpler restricted (subset) model that the sum of images and images is adequate, since it says that only one predictor is needed, rather than two. The alternative hypothesis is the unrestricted full model (with no conditions on images). That is,

equation

      versus

equation

      These hypotheses are tested using a partial images‐test. The images‐statistic has the form

      An alternative form for the images‐test above might make clearer what is going on here:

equation

      That is, if the strength of the fit of the full model (measured by images) isn't much larger than that of the subset model, the images‐statistic is small, and we do not reject the subset model; if, on the other hand, the difference in images values is large (implying that the fit of the full model is noticeably stronger), we do reject the subset model in favor of the full model.

      The

Скачать книгу