Genetic Analysis of Complex Disease. Группа авторов
Чтение книги онлайн.
Читать онлайн книгу Genetic Analysis of Complex Disease - Группа авторов страница 22
Introduction
Genetics is believed to contribute to the etiology of almost every human trait and condition. Even for infectious diseases that have been traditionally described as environmental, such as tuberculosis and HIV, genetic factors have been implicated either in the susceptibility to infection or in the severity of the condition (Bellamy 1998; Hill 1999; Bellamy et al. 2000; Gonzalez et al. 2001; Shields and Dell 2001). Understanding the role of genetics in disease etiology can allow development of successful therapies that improve the quality of life for affected individuals and their families. However, before embarking on expensive and labor‐intensive studies to identify the genetic factors involved in a particular condition, one should evaluate the evidence that genes contribute to that trait or condition.
For Mendelian disorders, such as sickle cell anemia, cystic fibrosis, and Duchenne muscular dystrophy, establishing a role for genetics is straightforward. These conditions have predictable, recognizable inheritance patterns, and their primary etiologies can be attributed to variations in single genes. The phenotypic expression of the condition is highly correlated with the genotype at the disease locus. For example, all individuals who carry two copies of the Hb S variant of the β‐globin gene are affected with sickle cell anemia (Neel 1949). While there is variation in the severity of the symptoms, all individuals who are homozygous for Hb S will exhibit some symptoms of the disorder. In contrast, complex disorders, such as cardiovascular disease, cancers, and psychiatric disorders, usually do not display distinct inheritance patterns. Such disorders have a significant genetic component but are caused by an intricate web of genetic and environmental interactions. As a result, establishing a role for genetics in the etiology of complex disorders can be more difficult. While the genotype at a single locus may contribute to the susceptibility to develop a condition, it is expected that other loci are involved, as well. That is, multiple loci may be interacting with each other or the environment to contribute to disease. Moreover, different loci may be contributing to disease susceptibility in different families. A classic example of a complex disease susceptibility gene is the association between the APOE locus and Alzheimer disease. Corder et al. (1993) demonstrated that individuals who carry the four allele of the APOE locus have a higher risk and earlier age of onset for late onset Alzheimer disease when compared with individuals who do not carry the four allele. Furthermore, this association is dose‐dependent. Individuals who have two copies of APOE‐4 are at greater risk for the condition than individuals who carry one copy of APOE‐4. Some Alzheimer families carry the APOE‐4 genotype and some do not. Thus, the APOE gene alone does not explain the etiology of the disease. In addition to the diminished correlation between genotype and phenotype, due to the high frequency of complex diseases in the general population, often one cannot be certain that two individuals within a family have developed the condition as a result of the same genetic liability or genetic heritability. For example, the family may exhibit bilineality (family history is present on both the maternal and paternal side). Consequently, each side of the family may contribute different genetic susceptibilities to the condition. Even in the absence of bilineality, there are complications. Some individuals in the family may express the condition as a result of exposure to an environmental insult. Therefore, it can be quite challenging to determine which family members carry the same genetic susceptibilities. For all these reasons, it is more difficult to establish a genetic basis for a complex disease than for a Mendelian disease.
However, even in the presence of such complexities, there are methods available to evaluate whether or not genetics plays an important role in the disease etiology. Those methods are the primary topic of this chapter and should be explored prior to embarking on more elaborate analyses such as genome‐wide association analyses or linkage analyses. Importantly, before considering any analysis, one must first consider study design.
Study Design
The study design process includes all clinical, molecular, bioinformatics, and analytic components of your study. A study population must be defined, as well as the process for selecting individuals from that population for study participation (ascertainment) and criteria for identifying individuals exhibiting the trait of choice (phenotype definition). One must determine the molecular technology to be used (e.g. genome‐wide association study versus rare variant gene burden testing) as well as establish the electronic process for storing and retrieving the clinical and molecular data, and the analysis methods to be used. Finally, one must ensure all ethical, legal, and social issues are addressed. Each component in the study design process should be given careful consideration, as the decisions one makes at each step will impact the conclusions that can be drawn from the results. Furthermore, many of the decisions in the study design process are inter‐related. For example, the ascertainment process will impact the analysis methods that can be performed. Since many of the steps in the study design process are covered in detail in other chapters of the book, they will not be discussed further here. Only the process of selecting a study population and an ascertainment scheme will be covered in this chapter and expanded upon in Chapter 4.
Selecting a Study Population
In a well‐designed study, the study population is clearly defined and individuals from the study population are systematically selected for participation. The two primary types of study population are population‐based and clinic‐based. A population‐based study is the gold standard for study design, but a clinic‐based population is a more realistic design for many genetic studies.
Population‐Based
Ideally, one will select a study population that is an unambiguous subset of a larger underlying population. For example, this could include individuals selected from a newborn state screening registry (e.g. determining the frequency of congenital hypothyroidism in individuals from the California Newborn Screening Program (Waller et al. 2000)), children who attend a particular school system (e.g. screening for FRAXA and FRAXE in a special needs population (Meadows et al. 1996)), and individuals in a local cancer registry (e.g. examination of the involvement of genetics to fallopian tube cancer in patients identified from the Ontario Cancer Registry (Aziz et al. 2001)). This epidemiologic approach to selecting a study population is much less susceptible to ascertainment bias than a clinic‐based study population, and conclusions made from analysis are usually extendable to the larger underlying population from which the study population was drawn. The disadvantage with this approach is that it may be difficult to achieve the desired patient sample size for rare disorders, and even for common disorders, it can be quite laborious to identify eligible participants. For example, for a condition with a frequency of 1/1 000, one would need to screen 100 000 individuals to identify even 100 cases, a rather small sample size.
Clinic‐Based
In reality, most genetic studies rely on clinic‐based study populations because they provide the investigators with faster access to patients. There are numerous examples of genetic studies relying on this design. Generally, patients are selected for participation from existing patient populations, such as a specialty clinic (e.g. examining the phenotypic heterogeneity of age‐related macular degeneration in patients selected from ophthalmology clinics at two academic centers (De La Paz et al. 1997)). Less effort is required for the initial identification of the patients in this approach compared with the population‐based approach. However, the disadvantage of a clinic‐based study population is that it can be quite challenging to determine the larger underlying population from which a clinic population was drawn. Identifying appropriate control subjects for clinic‐based studies can be difficult for this reason, as discussed later in this chapter. Most large, academic medical centers attract patients not only from the immediate geographic region, but often from all over the world. A further complication of clinic‐based samples is that patients ascertained in