Statistics in Nutrition and Dietetics. Michael Nelson

Чтение книги онлайн.

Читать онлайн книгу Statistics in Nutrition and Dietetics - Michael Nelson страница 22

Statistics in Nutrition and Dietetics - Michael Nelson

Скачать книгу

is the attention paid to confounding factors and bias. Confounding factors are associated with both the exposure and the outcome. Suppose that we were comparing a group of cases who had recently had their first heart attack with a group of controls who had never had a heart attack. Say that we were interested in knowing if higher oily fish consumption was associated with lower rates of heart attack. We could measure oily fish consumption using a food frequency questionnaire that asked about usual diet. Suppose we found that it was lower among the cases. Is this good evidence that higher oily fish consumption is protective against having a heart attack? What if the cases, on average, were 10 years older than the controls, and that younger people tended to have higher levels of oily fish in their diet. This could explain the apparent association of higher oily fish consumption with decreased risk of heart attack. In this case, age would be referred to as a confounding factor. Confounding factors need to be associated with both the exposure and the outcome that we are interested in. We could match our cases with controls of the same age. Alternatively, we could use a statistical approach that took age into account in the analysis. The most common confounding factors – things like age, gender, social class, and education – need to be controlled for when comparing one group with another. Other factors such as smoking, disease status (e.g. diabetes), or body mass index (BMI) may also be taken into account, but these may be explanatory factors or factors in the causal pathway rather than true confounders.

      Bias is a problem associated with measuring instruments or interviewers. Systematic bias occurs when everyone is measured with an instrument that always gives an answer that is too high or too low (like an inaccurate weighing machine). Bias can be constant (every measurement is inaccurate by the same amount) and or proportional (the size of the error is proportional to the size of the measurement, e.g. the more you weigh the greater the inaccuracy in the measurement). Bias is a factor that can affect any study and should be carefully controlled.

      Other types of bias mean that the information that we obtain is influenced by the respondent’s ability to give us accurate information. Subjects who are overweight or obese, for example, or who have higher levels of dietary restraint, tend to under‐report their overall food consumption, especially things like confectionery or foods perceived as ‘fatty’. Subjects who are more health‐conscious may over‐report their fruit and vegetable consumption because they regard these foods as ‘healthy’ and want to make a good impression on the interviewer. In these instances, making comparisons between groups becomes problematic because the amount of bias is related to the type of individual which may in turn be related to their disease risk.

      Dealing with issues such as confounding, residual confounding, factors in the causal pathway, and different types of bias are fully addressed in epidemiological textbooks [9, 10].

      

      First of all, a few definitions are needed:

       Statistic – a numerical observation

       Statistics – numerical facts systematically collected (also the science of the analysis of data)

       Data – what you collect (the word ‘data’ is plural – the singular is ‘datum’ – so we say ‘the data are…’ not ‘the data is…’)

       Results – a summary of your data

      1.7.1 Data Are What You Collect, Results Are What You Report

      No one else is as interested in your data as you are. You must love your data, look after them carefully (think of the cuddly statistician), and cherish each observation. You must make sure that every observation collected is accurate, and that when the data are entered into a spreadsheet, they do not contain any errors. When you have entered all your data, you need to ‘clean’ your data, making sure that there are no rogue values, and that the mean and the distribution of values is roughly what you were expecting. Trapping the errors at this stage is essential. There is nothing worse than spending days or weeks undertaking detailed statistical analysis and preparing tables and figures for a report, only to discover that there are errors in your data set, meaning that you have to go back and do everything all over again.

      TIP

       Allow adequate time in your project to clean the data properly. This means

       Check for values that are outside the range of permitted values.

       Look at the distributions of variables and check for extreme values. Some extreme values may be genuine. Others may be a result of ‘fat finger’ syndrome (like typing an extra zero and ending up with 100 rather than 10 as a data point).

       Understand how to use ‘missing values’ in SPSS. These help you to identify gaps in the data and how to handle them (for example, the difference between ‘I don’t know’, Not Applicable, or missing measurement).

      1.7.2 Never Present Endless Detailed Tables Containing Raw Data

      It is your job as a scientist to summarize data in a coherent form (in tables, graphs, and figures), tell an interesting story about the relationships between the variables you have measured, and interpret the results intelligently for your reader, using appropriate statistical analyses.

      Of course, you need to keep accurate records of observations, and make sure that your data set (spreadsheet) is stored securely and that you have backup copies of everything. Bulking up a report with tables of raw data is bad practice, however. No one will read them.

      Chapter 15 provides lots of examples about how to summarize data to make the presentation of results interesting. It also shows how to present results according to the type of audience you are talking to. If I am presenting new results about the impact of school food on attainment to scientific colleagues, I will include lots of information about the methods that I used to identify my samples, make observations, and analyze the data, as well as details about the findings themselves. My scientific colleagues will need enough information to be confident that my data are unbiased, that I have used the right analytical approaches, and that the results are statistically significant. This is the same basic approach that I will take when I am writing a paper for submission to a peer‐reviewed journal. In contrast, if I am presenting results on the same topic to a group

Скачать книгу