Applied Regression Modeling. Iain Pardoe

Чтение книги онлайн.

Читать онлайн книгу Applied Regression Modeling - Iain Pardoe страница 17

Applied Regression Modeling - Iain Pardoe

Скачать книгу

      So, the probability that images is greater than 291.703 is 0.10.

      1.4.2 Central limit theorem—t‐version

      For example, the following table shows critical values (i.e., horizontal axis values or percentiles) and tail areas for a t‐distribution with 29 degrees of freedom: Probabilities (tail areas) and percentiles (critical values) for a t‐distribution with images degrees of freedom.

Upper‐tail area 0.1 0.05 0.025 0.01 0.005 0.001
Critical value of images 1.311 1.699 2.045 2.462 2.756 3.396
Two‐tail area 0.2 0.1 0.05 0.02 0.01 0.002

      Compared with the corresponding table for the normal distribution in Section 1.2, the critical values are slightly larger in this table.

      We will use the t‐distribution from this point on because it will allow us to use an estimate of the population standard deviation (rather than having to assume this value). A reasonable estimate to use is the sample standard deviation, images. Since we will be using an estimate of the population standard deviation, we will be a little less certain about our probability calculations—this is why the t‐distribution needs to be a little more spread out than the normal distribution, to adjust for this extra uncertainty. This extra uncertainty will be of particular concern when we are not too sure if our sample standard deviation is a good estimate of the population standard deviation (i.e., in small samples). So, it makes sense that the degrees of freedom increases as the sample size increases. In this particular application, we will use the t‐distribution with images degrees of freedom in place of a standard normal distribution in the following t‐version of the central limit theorem.

      Suppose that a random sample of images data values, represented by images, comes from a population that has a mean of images. Imagine taking a large number of random samples of images data values and calculating the mean and standard deviation for each sample. As before, we will let images represent the imagined list of repeated sample means, and similarly, we will let images represent the imagined list of repeated sample standard deviations. Define

equation

      Under very general conditions, t has an approximate t‐distribution with images degrees of freedom. The two differences from the normal version of the central limit theorem that we used before are that the repeated sample standard deviations, images, replace an assumed population standard deviation, images, and that the resulting sampling distribution is a t‐distribution (not a normal distribution).

equation

      Thus, the 90th percentile of the sampling distribution of images is images (to the nearest images).

      Turning this around, what is the probability that images is greater than 292.893?

equation

      So, the probability that images is greater than 292.893 is 0.10.

      So far, we have focused on the sampling distribution of sample means, images, but what we would really like to do is infer what the observed sample mean, images, tells us about the population mean, images. Thus, while the preceding calculations have been useful for building up intuition about sampling distributions and manipulating probability statements, their main purpose has been to prepare the ground for

Скачать книгу