Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP. Bhisham C. Gupta

Чтение книги онлайн.

Читать онлайн книгу Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP - Bhisham C. Gupta страница 53

Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP - Bhisham C. Gupta

Скачать книгу

was focused on only univariate statistics because we were interested in studying a single characteristic of a subject. In all the examples we considered, the variable of interest was either qualitative or quantitative. We now study cases involving two variables; this means examining two characteristics of a subject. The two variables of interest could be either qualitative or quantitative, but here we will consider only variables that are quantitative.

      Example 2.9.1 (Cholesterol level and systolic blood pressure) The cholesterol level and the systolic blood pressure of 10 randomly selected US males in the age group 40–50 years are given in Table 2.1. Construct a scatter plot of this data and determine if there is any association between the cholesterol levels and systolic blood pressures.

      A numerical measure of association between two numerical variables is called the Pearson correlation coefficient, named after the English statistician Karl Pearson (1857–1936). Note that a correlation coefficient does not measure causation. In other words, correlation and causation are different concepts. Causation causes correlation, but not necessarily the converse. The correlation coefficient between two numerical variables in a set of sample data is usually denoted by r, and the correlation coefficient for population data is denoted by the Greek letter images (rho). The correlation coefficient r based on n pairs of images, say images is defined as

      or

      (2.9.2)equation

       Table 2.9.1 Cholesterol levels and systolic BP of 10 randomly selected US males.

Subject 1 2 3 4 5 6 7 8 9 10
Cholesterol (x) 195 180 220 160 200 220 200 183 139 155
Systolic BP (y) 130 128 138 122 140 148 142 127 116 123

Scatterplot of systolic blood pressure versus cholesterol level displaying a positive slope line with 10 scattered circle markers.

      Perfect association means that if we know the value of one variable, then the value of the other variable can be determined without any error. The other special case is when images, which does not mean that there is no association between the two variables, but rather that there is no linear association between the two variables. As a general rule, the linear association is weak, moderate, or strong when the absolute value of images is less than 0.3, between 0.3 and 0.7, or greater than 0.7, respectively. For instance, if (2.9.1) is computed for the data in Table 2.9.1, then images. Hence, we can conclude that the association between the two variables X and Y is strong.

      MINITAB:

      1 Enter the pairs of data in columns C1 and C2. Label the columns X and Y.

      2 From the Menu bar select Graph Scatterplot. This prompts a dialog box to appear on the screen. In this dialog box, select scatterplot With Regression and click OK. This prompts the following dialog box to appear:In this dialog box, under the X and Y variables, enter the columns in which you have placed the data. Use the desired options and click OK. The Scatter plot

Скачать книгу