Introduction to Linear Regression Analysis. Douglas C. Montgomery

Чтение книги онлайн.

Читать онлайн книгу Introduction to Linear Regression Analysis - Douglas C. Montgomery страница 27

Introduction to Linear Regression Analysis - Douglas C. Montgomery

Скачать книгу

image

      To obtain a 100(1 − α) percent CI on E(y|x0), first note that in31-1 is a normally distributed random variable because it is a linear combination of the observations yi. The variance of in31-2 is

ueqn31-1

      since (as noted in Section 2.2.4) in31-3. Thus, the sampling distribution of

ueqn31-2

      is t with n − 2 degrees of freedom. Consequently, a 100(1 − α) percent CI on the mean response at the point x = x0 is

      Note that the width of the CI for E(y|x0) is a function of x0. The interval width is a minimum for in31-4 and widens as in31-5 increases. Intuitively this is reasonable, as we would expect our best estimates of y to be made at x values near the center of the data and the precision of estimation to deteriorate as we move to the boundary of the x space.

      Example 2.6 The Rocket Propellant Data

ueqn31-3 ueqn32-1

       TABLE 2.6 Confidence Limits on E(y|x0) for Several Values of x0

Lower Confidence Limit x 0 Upper Confidence Limit
2438.919 3 2593.821
2341.360 6 2468.481
2241.104 9 2345.836
2136.098 12 2227.942
2086.230 in32-5 2176.571
2024.318 15 2116.822
1905.890 18 2012.351
1782.928 21 1912.412
1657.395 24 1815.045
image

      Equation (2.43) points out that the issue of extrapolation is much more subtle; the further the x value is from the center of the data, the more variable our estimate of E(y|x0). Please note, however, that nothing “magical” occurs at the boundary of the x space. It is not reasonable to think that the prediction is wonderful at the observed data value most remote from the center of the data and completely awful just beyond it. Clearly, Eq. (2.43) points out that we should be concerned about prediction quality as we approach the boundary and that as we move beyond this boundary, the prediction may deteriorate rapidly. Furthermore, the farther we move away from the original region of x space, the more likely it is that equation or model error will play a role in the process.

      This is not the same thing as saying “never extrapolate.” Engineers and economists routinely use prediction equations to forecast a variable of interest one or more time periods in the future. Strictly speaking, this forecast is an extrapolation. Equation (2.43) supports such use of the prediction equation. However, Eq. (2.43) does not support using the regression model to forecast many periods in the future. Generally, the greater the extrapolation, the higher is the chance of equation error or model error impacting the results.

      The probability statement associated with the CI (2.43) holds only when a single CI on the mean response is to be constructed. A procedure for constructing several CIs that, considered jointly, have a specified confidence

Скачать книгу