Practical Statistics for Nursing and Health Care. Jim Fowler

Чтение книги онлайн.

Читать онлайн книгу Practical Statistics for Nursing and Health Care - Jim Fowler страница 11

Автор:
Жанр:
Серия:
Издательство:
Practical Statistics for Nursing and Health Care - Jim Fowler

Скачать книгу

Starting at the top left corner, you may read: 2, 3, 1, 5, 7, 5, 4 …; or 23, 15, 75, 48, …; or 231, 575, 485 …; or 23.1, 57.5, 48.5, 90.1, …; and so on, according to your needs. When you have obtained the numbers you need for your investigation, mark the place in pencil. Next time, carry on where you left off. It is possible that a random number will prescribe a subject (sampling unit) that has already been drawn. In this event, ignore the number and take the next random number. The purpose is to eliminate your prejudice as to which items should be selected for measurement. Unfortunately, observer bias, conscious or unconscious, is notoriously difficult to avoid when gathering data in support of a particular hunch!

      Second, many calculators and statistical software have a facility for generating random numbers. For example, within LibreOffice Calc spreadsheet typing ‘=RAND()’ within a cell and pressing <Enter> generates a random number between 0.0 and 1.0 in the form of a decimal fraction, e.g. 0.2771459. To generate more random decimal fractions use the mouse to drag the lower right corner of the cell containing the results of applying the ‘=RAND()’ function down the required number of rows. Please note that many spreadsheets have an auto‐update function whereby formulae are updated after each calculation. To avoid this copy the column of random numbers you have generated and then go to a new cell, right button click your mouse and select ‘Paste Special’ and tick the ‘Numbers’ box and then ‘OK.’ Once you have fixed the random decimal fractions you may use this to provide a set of integers, 2, 7, 7, 1 by multiplying by 10 and using the first digit only; or 27, 71, 45, … by multiplying by 100; or 277, 145; or 2.7, 7.1; and so on, according to your needs.

      Random sampling is the preferred approach to sampling. Although it does not guarantee that a representative sample is taken from the study population (due to sampling error, described in Section 10.1), it gives a better chance than any other method of achieving this.

      Systematic sampling has similarities with simple random sampling, in that the first subject in the sample is chosen at random and then every subsequent tenth or twentieth patient (for example) is chosen to cover the entire range of the population.

      Example 2.1 Systematic Sampling Interval Calculation

      What interval is required to select a systematic sample of size 20 from a population of 800?

      The required fixed interval is:

      Therefore, the first patient (‘sampling unit’) is selected at random (as described in Section 2.8) from among patients numbered 1–40. Suppose number 23 is selected. The sample then comprises patients 23, 63, 103, 143, …, 783.

      A disadvantage of systematic sampling occurs when the patients are listed in the population in some sort of periodic order, and thus we might inadvertently systematically exclude a subgroup of the population. For example, given a population of 800 patients listed by ‘first attendance’ at the clinic, and that over a 20‐week period, 40 patients registered per week, 20 during the daytime and 20 during the evening surgeries. If these patients were listed in the following order: Week 1 daytime patients, Week 1 evening patients, Week 2 daytime patients, …, Week 10 evening patients, then selecting patients 23, 63, …, 783 would result in a sample of evening clinic patients, and exclude all the daytime patients. It is possible that this could generate a biased, or unrepresentative, sample.

      An argument in favour of systematic sampling occurs when patients are listed in the population in chronological order, say, by date of first attendance at the GP practice. A systematic sample would yield units whose age distribution is more likely to perfectly represent the study population.

      Stratified sampling is effective when the population comprises a number of subgroups (or ‘sub‐populations’) that are thought to have an effect on the data being collected, such as male and female, different age groupings, or ethnic origin. These subgroups are called strata. A stratum (‘layer’) is defined as a collection of individuals (sampling units) that are as alike as possible. For example, the credibility of results from a study of breast cancer would be in doubt if the proportion of premenopausal patients differed between two samples selected for comparison. By defining two strata, namely ‘pre‐menopausal patients’ and ‘not pre‐menopausal patients’, this problem is avoided.

      Quota sampling differs from stratified sampling in that a simple random sample is not chosen from each stratum. Instead, the sample is obtained by using the most accessible patients, as long as they represent the identified subgroups. For example, if we require details relating to 20 women patients with asthma between 30 and 50 years of age, we do not identify all individuals satisfying these criteria in the population in order to take a simple random sample of these. Rather, we simply select the first 20 individuals who present themselves and fulfil these criteria.

      Quota sampling is so called because the number of sampling units (e.g. patients) required in a particular sample is referred to as the quota to be obtained. If making comparisons between different subgroups (e.g. adults and children), the sizes of the sample from each subgroup are usually decided to reflect the proportions in the population. For example, if there are twice as many adults as children in the available population, the quota of adults is twice as large as the children.

      The main problem with quota sampling is that accessible individuals may not be representative of the study population. Patients who attend at their GP practice regularly may be different from those who don't, or who are unable to attend through work or other commitments.

      Cluster sampling involves dividing the population into subgroups called clusters. However, unlike stratified sampling and quota sampling (in which the subjects in a particular stratum or subgroup are meant to be as alike as possible), the objective is to include in each cluster the various characteristics that the population might contain. The rationale for both stratified and quota sampling is the control of factors (e.g. age or sex differences) that are known (or suspected) to confound the response being investigated. In cluster sampling, the idea is not to have a homogeneous group, but one which is representative of the cluster through either a census (100% sample) or, more usually, by taking a representative sample of the cluster.

      Cluster sampling is commonly used when the population covers an area that can be divided by region (e.g. GP practices). A small number of these clusters is selected at random (using simple random sampling). Every subject in the chosen clusters is then included in the sample.

Скачать книгу