Applied Regression Modeling. Iain Pardoe

Чтение книги онлайн.

Читать онлайн книгу Applied Regression Modeling - Iain Pardoe страница 14

Applied Regression Modeling - Iain Pardoe

Скачать книгу

       Optional—technical details of QQ‐plots

      For the purposes of this book, the technical details of QQ‐plots are not too important. For those that are curious, however, a brief description follows. First, calculate a set of images equally spaced percentiles (quantiles) from a standard normal distribution. For example, if the sample size, images, is 9, then the calculated percentiles would be the 10th, 20th, images, 90th. Then construct a scatterplot with the images observed data values ordered from low to high on the vertical axis and the calculated percentiles on the horizontal axis. If the two sets of values are similar (i.e., if the sample values closely follow a normal distribution), then the points will lie roughly along a straight line. To facilitate this assessment, a diagonal line that passes through the first and third quartiles is often added to the plot. The exact details of how a QQ‐plot is drawn can differ depending on the statistical software used (e.g., sometimes the axes are switched or the diagonal line is constructed differently).

      We saw in Section 1.2 that a normal distribution model fits the home prices example reasonably well. However, we can see from Figure 1.1 that a standard normal distribution is inappropriate here, because a standard normal distribution has a mean of 0 and a standard deviation of 1, whereas our sample data have a mean of 278.6033 and a standard deviation of 53.8656. We therefore need to consider more general normal distributions with a mean that can take any value and a standard deviation that can take any positive value (standard deviations cannot be negative).

      Let images represent the population values (sale prices in our example) and suppose that images is normally distributed with mean (or expected value), images, and standard deviation, images. This textbook uses this notation with familiar Roman letters in place of the traditional Greek letters, images (mu) and images (sigma), which, in the author's experience, are unfamiliar and awkward for many students. We can abbreviate this normal distribution as images, where the first number is the mean and the second number is the square of the standard deviation (also known as the variance). Then the population standardized images‐value,

equation

      has a standard normal distribution with mean 0 and standard deviation 1. In symbols,

equation

      We are now ready to make a probability statement for the home prices example. Suppose that we would consider a home as being too expensive to buy if its sale price is higher than images. What is the probability of finding such an expensive home in our housing market? In other words, if we were to randomly select one home from the population of all homes, what is the probability that it has a sale price higher than images? To answer this question, we need to make a number of assumptions. We have already decided that it is probably safe to assume that the population of sale prices (images) could be normal, but we do not know the mean, images, or the standard deviation, images, of the population of home prices. For now, let us assume that images and images (fairly close to the sample mean of 278.6033 and sample standard deviation of 53.8656). (We will be able to relax these assumptions later in this chapter.) From the theoretical result above, images has a standard normal distribution with mean 0 and standard deviation 1.

      Next, to find the probability that a randomly selected images is greater than 380, we perform some standard algebra on probability statements. In particular, if we write “the probability that images is bigger than images” as “images,” then we can make changes to images (such as adding, subtracting, multiplying, and dividing other quantities) as long as we do the same thing to images. It is perhaps easier to see how this works by example:

equation

Скачать книгу