Introduction to Linear Regression Analysis. Douglas C. Montgomery

Чтение книги онлайн.

Читать онлайн книгу Introduction to Linear Regression Analysis - Douglas C. Montgomery страница 32

Introduction to Linear Regression Analysis - Douglas C. Montgomery

Скачать книгу

to imply that a straight line passing through the origin should be fit to the data. A no-intercept regression model often seems appropriate in analyzing data from chemical and other manufacturing processes. For example, the yield of a chemical process is zero when the process operating temperature is zero.

      (2.48) image

      Given n observations (yi, xi), i = 1, 2, …, n, the least-squares function is

ueqn47-1

      The only normal equation is

      (2.49) image

      and the least-squares estimator of the slope is

      The estimator of in47-1 is unbiased for β1, and the fitted regression model is

      (2.51) image

      The estimator of σ2 is

      (2.52) image

      with n − 1 degrees of freedom.

      Making the normality assumption on the errors, we may test hypotheses and construct confidence and prediction intervals for the no-intercept model. The 100(1 − α) percent CI on β1 is

      (2.53) image

      The 100(1 − α) percent prediction interval on a future observation at x = x0, say y0, is

image image

      The scatter diagram sometimes provides guidance in deciding whether or not to fit the no-intercept model. Alternatively we may fit both models and choose between them based on the quality of the fit. If the hypothesis β0 = 0 cannot be rejected in the intercept model, this is an indication that the fit may be improved by using the no-intercept model. The residual mean square is a useful way to compare the quality of fit. The model having the smaller residual mean square is the best fit in the sense that it minimizes the estimate of the variance of y about the regression line.

      Generally R2 is not a good comparative statistic for the two models. For the intercept model we have

ueqn49-1

      Note that R2 indicates the proportion of variability around in49-1 explained by regression. In the no-intercept case the fundamental analysis-of-variance identity (2.32) becomes

ueqn49-2

      so that the no-intercept model analogue for R2 would be

ueqn49-3

Скачать книгу