Medical Statistics. David Machin
Чтение книги онлайн.
Читать онлайн книгу Medical Statistics - David Machin страница 35
![Medical Statistics - David Machin Medical Statistics - David Machin](/cover_pre843611.jpg)
Figure 4.10 The Normal probability distribution.
The Normal distribution (Figure 4.10), is completely described by two parameters: one, μ, represents the population mean or centre of the distribution and the other, σ, the population standard deviation. The formula for the Normal distribution is given as Eq. (4.3). Populations with small values of the standard deviation σ have a distribution concentrated close to the centre, μ; those with large standard deviation have a distribution widely spread along the measurement axis (Figure 4.11).
Figure 4.11 Probability distribution functions of the Normal distributions with different means and standard deviations. (a) Effect of changing mean (μ2 > μ1). (b) Effect of changing SD (σ2 > σ1).
There are infinitely many Normal distributions depending on the values of μ and σ. The Standard Normal distribution has a mean of zero and a variance (and standard deviation) of one and a shape as shown in Figure 4.10. The formula is given as Eq. (4.4) in Section 4.9. If the random variable X has a Normal distribution with mean, μ and standard deviation, σ, then the standardised Normal deviate
The areas under the Standard Normal distribution curve have been tabulated in Table T1 in the appendix and some examples in Table 4.1. In column (i), the table gives for a positive value of Z, (that is the number of standard deviations above the mean of zero), the area under the Normal curve to the right of this value. The same value is obtained for the area below the same numerical, but negative, value −Z. Column (ii) gives the combination of these two equal areas. Using Figure 4.12 or Table 4.1, we can note that much of the area (68%) of the probability is between −1 and +1 SD, the large majority (95%) between −2 and +2 SD, and almost all (99%) between −3 and +3.
Table 4.1 Selected probabilities associated with the Normal distribution.
Standardised deviate | Probability of greater deviation | |
---|---|---|
Z = (X − μ)/σ | (i) Area in one direction | (ii) Area both directions |
0 | 0.5000 | 1.0000 |
1.000 | 0.1590 | 0.3170 |
1.645 | 0.0500 | 0.0100 |
1.960 | 0.0250 | 0.0500 |
2.000 | 0.0230 | 0.0460 |
2.576 | 0.0050 | 0.0100 |
3.000 | 0.0013 | 0.0027 |
Figure 4.12 Areas (percentages of total probability) under the standard Normal curve. (a) 31.7% of observations lie outside the mean ± 1 SD. (b) 4.6% of observations lie outside the mean ± 2SD.
As can be seen from Table 4.1, using Z values of 1.96 (that is, 1.96 SD away from the mean) then exactly 95% of the Normal distribution lies between
Changing the multiplier 1.96 to 2.58, exactly 99% of the Normal distribution lies in the corresponding interval.
How Do We Use the Normal Distribution?
The Normal probability distribution can be used to calculate the probability of different values occurring. We could be interested in the probability of being within 1 SD of the mean (or outside it). We can use a Normal distribution table, which tells us the probability of being outside this value.
Illustrative Example – Normal Distribution – Birthweights
Using the birthweight data from the O'Cathain et al. (2002) study let us assume that the birthweight for new born babies has a Normal distribution with a mean of 3.4 kg and a standard deviation of 0.6 kg. So, what is the probability of giving birth to baby with a birthweight of 4.5 kg or higher?
Since birthweight is assumed to follow a Normal distribution, with mean of 3.4 kg and SD of 0.6 kg, we therefore know that approximately 68% of birthweights will lie between 2.8 and 4.0 kg and about 95% of birthweights will lie between 2.2 and 4.6 kg. Using Figure 4.13 we can see that a birthweight of 4.5 kg is between one and two standard deviations away from the mean.
Figure 4.13 Normal distribution curve for birthweight with a mean of 3.4 kg and SD of 0.6 kg.
First