Statistics in Nutrition and Dietetics. Michael Nelson
Чтение книги онлайн.
Читать онлайн книгу Statistics in Nutrition and Dietetics - Michael Nelson страница 22
Bias is a problem associated with measuring instruments or interviewers. Systematic bias occurs when everyone is measured with an instrument that always gives an answer that is too high or too low (like an inaccurate weighing machine). Bias can be constant (every measurement is inaccurate by the same amount) and or proportional (the size of the error is proportional to the size of the measurement, e.g. the more you weigh the greater the inaccuracy in the measurement). Bias is a factor that can affect any study and should be carefully controlled.
Some types of bias may simply reduce our ability to detect associations between exposure and outcome. This is ‘noise in the system’. It means that there may be an association between exposure and outcome, but our data are too ‘noisy’ for us to be able to detect it. For example, we know that there is day‐to‐day and week‐to‐week variation in food and drink consumption. We need to try and collect sufficient information to be able to classify subjects according to their ‘usual’ consumption.
Other types of bias mean that the information that we obtain is influenced by the respondent’s ability to give us accurate information. Subjects who are overweight or obese, for example, or who have higher levels of dietary restraint, tend to under‐report their overall food consumption, especially things like confectionery or foods perceived as ‘fatty’. Subjects who are more health‐conscious may over‐report their fruit and vegetable consumption because they regard these foods as ‘healthy’ and want to make a good impression on the interviewer. In these instances, making comparisons between groups becomes problematic because the amount of bias is related to the type of individual which may in turn be related to their disease risk.
Dealing with issues such as confounding, residual confounding, factors in the causal pathway, and different types of bias are fully addressed in epidemiological textbooks [9, 10].
1.7 DATA, RESULTS, AND PRESENTATION
First of all, a few definitions are needed:
Statistic – a numerical observation
Statistics – numerical facts systematically collected (also the science of the analysis of data)
Data – what you collect (the word ‘data’ is plural – the singular is ‘datum’ – so we say ‘the data are…’ not ‘the data is…’)
Results – a summary of your data
1.7.1 Data Are What You Collect, Results Are What You Report
No one else is as interested in your data as you are. You must love your data, look after them carefully (think of the cuddly statistician), and cherish each observation. You must make sure that every observation collected is accurate, and that when the data are entered into a spreadsheet, they do not contain any errors. When you have entered all your data, you need to ‘clean’ your data, making sure that there are no rogue values, and that the mean and the distribution of values is roughly what you were expecting. Trapping the errors at this stage is essential. There is nothing worse than spending days or weeks undertaking detailed statistical analysis and preparing tables and figures for a report, only to discover that there are errors in your data set, meaning that you have to go back and do everything all over again.
TIP
Allow adequate time in your project to clean the data properly. This means
Check for values that are outside the range of permitted values.
Look at the distributions of variables and check for extreme values. Some extreme values may be genuine. Others may be a result of ‘fat finger’ syndrome (like typing an extra zero and ending up with 100 rather than 10 as a data point).
Understand how to use ‘missing values’ in SPSS. These help you to identify gaps in the data and how to handle them (for example, the difference between ‘I don’t know’, Not Applicable, or missing measurement).
If you find an unusual observation, check it with your supervisor or research colleagues. They may want to inspect your data to see that your observations are correct. Don’t try and hide an unusual observation (or worse still, ignore it, or leave it out of the data set without telling anyone). Always be frank and open about letting others inspect your data, especially if you or they think there may be something wrong. We all make mistakes. It is no great shame if there are some errors in the data that we missed and that someone else helpfully spots for us. Be thick‐skinned about this. The real embarrassment comes if we do lots of analysis and the errors in the data only come to light when we make a presentation of our results.
1.7.2 Never Present Endless Detailed Tables Containing Raw Data
It is your job as a scientist to summarize data in a coherent form (in tables, graphs, and figures), tell an interesting story about the relationships between the variables you have measured, and interpret the results intelligently for your reader, using appropriate statistical analyses.
Of course, you need to keep accurate records of observations, and make sure that your data set (spreadsheet) is stored securely and that you have backup copies of everything. Bulking up a report with tables of raw data is bad practice, however. No one will read them.
Chapter 15 provides lots of examples about how to summarize data to make the presentation of results interesting. It also shows how to present results according to the type of audience you are talking to. If I am presenting new results about the impact of school food on attainment to scientific colleagues, I will include lots of information about the methods that I used to identify my samples, make observations, and analyze the data, as well as details about the findings themselves. My scientific colleagues will need enough information to be confident that my data are unbiased, that I have used the right analytical approaches, and that the results are statistically significant. This is the same basic approach that I will take when I am writing a paper for submission to a peer‐reviewed journal. In contrast, if I am presenting results on the same topic to a group