Applied Biostatistics for the Health Sciences. Richard J. Rossi

Чтение книги онлайн.

Читать онлайн книгу Applied Biostatistics for the Health Sciences - Richard J. Rossi страница 29

Applied Biostatistics for the Health Sciences - Richard J. Rossi

Скачать книгу

style="font-size:15px;">      3 Smoke is a nominal qualitative variable.

      4 No. Cigarettes is a quantitative variable.

      5 Age is a quantitative variable.

      6 No. Physical Activity is a quantitative variable.

      7 Health is an ordinal qualitative variable.

      A quantitative variable can also be classified as either a discrete variable or a continuous variable. A quantitative variable is a discrete variable when it can take on a finite or a countable number of values; a quantitative variable is a continuous variable when it can take on any value in one or more intervals. Note that the values that a discrete variable can take on are distinct, isolated, and can be counted. In the previous example, the variables Age, No. Cigarettes, and No. Physical Activity are discrete variables. A counting variable is a specialized discrete variable that simply counts how many times a particular event has occurred. The values a counting variables can take on are the values 0,1,2,3,…,∞. For example, in the Framingham Heart Study the variables No. Cigarettes and No. Physical Activity are counting variables.

       Example 2.5

      The following variables are all counting variables:

      1 The number of cancer patients in remission following treatment at a hospital.

      2 The number of laboratory mice that survive in an experiment.

      3 The number of white blood cells in a 10 ml blood sample.

       Example 2.6

      The following variables are continuous variables that might be measured on a discrete measurement scale:

      1 Body temperature since it is usually measured in tenths of degrees.

      2 Lung capacity since it is a volume and is usually measured in cubic centimeters.

      3 Tumor size since it is measured as a depth in tenths of centimeters.

      It is important that a variable truly reflects the characteristic being studied. A variable is said to be a valid variable when the measurements on the variable truly represent the characteristic the variable is supposed to be measuring. The validity of a variable depends on the characteristic being measured and the measuring device being used to measure the characteristic. When a characteristic of a unit is subjective in nature, it will be difficult to measure the characteristic accurately, and in this case, the validity of any variables used to measure this subjective characteristic is usually questionable.

       Example 2.7

      The intelligence of an individual is a subjectively measured characteristic. There are many tests that have been developed to measure intelligence. For example, the Fagan test measures the amount of time an infant spends inspecting a new object and compares this time with the time spent inspecting a familiar object (Fagan and Detterman, 1992). The validity of the Fagan test as a measure of intelligence, however, has been questioned by several scientists who have studied the relationship between intelligence and the Fagan test scores.

      Figure 2.1 Different types of classifications for variables.

      2.1.3 Multivariate Data

      A multivariate data set often consists of a mixture of qualitative and quantitative variables. For example, in a biomedical study, several variables that are commonly measured are a subject’s age, race, gender, height, and weight. When data have been collected, the multivariate data set is generally stored in a spreadsheet with the columns containing the data on each variable and the rows of the spreadsheet containing the observations on each subject in the study.

      Figure 2.2 Weight-by-age chart for girls in the NHANES study.

       Example 2.8

      In the article “The validity of self-reported weight in US adults: a population based cross-sectional study” published in BMC Public Health (Villanueva, 2001), the author reported the results of a study on the validity of self-reported weight. The data set used in the study was a multivariate data set with response variable being the difference between the self-reported weight and the actual weight of an individual. The explanatory variables in this study were gender, age, race–ethnicity, highest educational attainment, level of activity, and perception of the individuals’ current weight.

      2.2

Скачать книгу