Population Genetics. Matthew B. Hamilton
Чтение книги онлайн.
Читать онлайн книгу Population Genetics - Matthew B. Hamilton страница 25
The first example uses observed genotypes for the MN blood group, a single locus in humans that has two alleles (Table 2.4). First, we need to estimate the frequency of the M allele, using the notation that the estimated frequency of M is
(2.5)
(2.6)
Since
Using these allele frequencies allows calculation of the Hardy–Weinberg expected genotype frequency and number of individuals with each genotype, as shown in Table 2.4. In Table 2.4, we can see that the match between the observed and expected is not perfect, but we need some method to ask whether the difference is actually large enough to conclude that Hardy–Weinberg equilibrium does not hold in the sample of 1066 genotypes. Remember that any allele frequency estimate
Box 2.1 DNA profiling
The loci used for human DNA profiling are a general class of DNA sequence marker known as simple tandem repeat (STR), simple sequence repeat (SSR), or microsatellite loci. These loci feature tandemly repeated DNA sequences of one to six base pairs (bp) and often exhibit many alleles per locus and high levels of heterozygosity. Allelic states are simply the number of repeats present at the locus, which can be determined by electrophoresis of polymerase chain reaction (PCR) amplified DNA fragments. STR loci used in human DNA profiling generally exhibit Hardy–Weinberg expected genotype frequencies; there is evidence that the genotypes are selectively “neutral” (e.g. not affected by natural selection), and the loci meet the other assumptions of Hardy–Weinberg. STR loci are employed widely in population genetic studies and in genetic mapping (see reviews by Goldstein and Pollock 1997; McDonald and Potts 1997).
Figure 2.8 The original data for the DNA profile given in Table 2.2 and Problem Box2.1 obtained by capillary electrophoresis. The PCR oligonucleotide primers used to amplify each locus are labeled with a molecule that emits blue, green, or yellow light when exposed to laser light. Thus, the DNA fragments for each locus are identified by their label color as well as their size range in base pairs. Panel A shows a simulation of the DNA profile as it would appear on an electrophoretic gel (+ indicates the anode side). Blue, green, and yellow label the 10 DNA profiling loci, shown here in grayscale. The red DNA fragments are size standards with a known molecular weight used to estimate the size in base pairs of the other DNA fragments in the profile. Panel B shows the DNA profile for all loci and the size standard DNA fragments as a graph of color signal intensity by size of DNA fragment in base pairs. Panel C shows a simpler view of trace data for each label color independently with the individual loci labeled above the trace peaks. A few shorter peaks are visible in the yellow, green, and blue traces of Panel C that are not labeled as loci. These artifacts, called “pull up” peaks, are caused by intense signal from a locus labeled with another color (e.g. the yellow and blue peaks in the location of the green labeled amelogenin locus). A full color version of this figure is available on the textbook website.
This is an example of the DNA sequence found at a microsatellite locus. This sequence is the 24.1 allele from the fibrinogen alpha chain gene, or FGA locus (Genbank accession no. AY749636; see Figure 2.8). The integral repeat is the 4 bp sequence CTTT, and most alleles have sequences that differ by some number of full CTTT repeats. However, there are exceptions where alleles have sequences with partial repeats or stutters in the repeat pattern, for example, the TTTCT and CTC sequences imbedded in the perfect CTTT repeats. In this case, the 24.1 allele is 1 bp longer than the 24‐allele sequence.
GCCCCATAGGTTTTGAACTCACAGATTAAACTGTAACCAAAATAAAATTAGGCATTATTTACAAGCTAGTTT CTTT CTTT CTTT TTTCT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTC CTTC CTTC CTTT CTTC CTTT CTTT TTTGCTGGCA ATTACAGACAAATCAA |
Table 2.4 Expected numbers of each of the three MN blood group genotypes under the null hypotheses of Hardy–Weinberg. Genotype frequencies are based on a sample of 1066 Chukchi individuals, a native people of eastern Siberia (Roychoudhury and Nei 1988).
Frequency of M = |
|||
---|---|---|---|
Genotype | Observed | Expected number of genotypes | Observed – Expected |
MM | 165 |
|
−21.6 |
MN |