Welcome to the Genome. Michael Yudell
Чтение книги онлайн.
Читать онлайн книгу Welcome to the Genome - Michael Yudell страница 23
Hybridization doesn’t have to be perfect; only 60–70% of the two strands of a helix must match for the two strands to stick together.
Southern created a technique whereby a small piece of an organism’s genome can be arranged by size along an agarose gel. The technique then involves the transfer of DNA from gels onto nitrocellulose membranes. To detect where a specific gene is, a fragment of the gene of interest is labeled with radioactivity and then hybridized (attached) to the DNA on the nitrocellulose. Radioactive molecules hybridizing to the DNA on the filter emit particles that react with photographic film and can therefore be seen as a dark spot on the film. With his blot, Southern solved the problem of finding a genetic needle in a genomic haystack. Techniques were later developed to similarly isolate RNA molecules (jokingly named Northern blotting) and protein molecules (named Western blotting). (27)
Figure 2.4 DNA hybridization occurs when a single strand of a double helix finds a complementary strand to form a new double helix.
Credit: Exhibitions Department, American Museum of Natural History
RESEARCH MILESTONE 5: COPYING DNA
During the 1970s scientists improved upon the Southern blot and other gel electrophoresis methods. Southern’s method required a tremendous amount of DNA and thus a tremendous amount of laboratory labor. It also lacked the precision to see the location of individual bases. To get around this shortcoming, scientists developed methods to amplify or clone (meaning simply to copy) DNA.
Found in bacteria, the small, circular extrachromosomal DNA molecule known as a plasmid often times carries genes that confer antibiotic resistance to bacteria. Under normal conditions a plasmid can facilitate genetic exchange between different bacterial strains: A gene fragment from one bacterium is carried to and inserted into a chromosome by a plasmid. In 1973 scientists discovered that if you biochemically insert a target stretch of DNA into a plasmid and put the plasmid into a bacterial cell, such a cell makes thousands and perhaps millions of copies of the plasmid and hence the attached DNA. (28) This procedure, incorporated into sequencing technology, made it easier to make large amounts of a desired stretch of DNA. There are, however, two serious shortcomings of the use of plasmids. First, bacterial plasmids must be cultured. This is time‐consuming. Second, plasmids can take up only a small piece of DNA efficiently. If the DNA stretch picked up by a plasmid is too large, the plasmid is unable to make accurate copies.
Since the discovery of plasmids or what might be termed bacterial copying machines, other vehicles have been created that can copy larger pieces of DNA. The average limiting size of a plasmid is about 5000 bases. Phages, a specific class of viruses that infect bacteria and can be stably replicated by them, can carry about 15,000 bases; cosmids, an artificial cloning vector with a phage gene, can carry about 35,000 bases; bacterial artificial chromosomes (also known as BACs) can take over 100,000 bases of sequence; and yeast artificial chromosomes (also known as YACs) can take approximately 1,000,000. Although these microbial methods remain an important component of DNA sequencing and were central to the effort to sequence the human genome, they are all arduous ways to copy DNA. BAC‐copied DNA was used in the sequencing of the human genome. (29)
RESEARCH MILESTONE 6: SEQUENCING A VIRAL GENOME
By the 1970s advances in sequencing technology brought biology and genetics to the brink of the genomic revolution. The most important developments in sequencing technology occurred simultaneously in laboratories on opposite sides of the Atlantic. Two groups—biologist Walter Gilbert’s group at Harvard and Frederick Sanger’s group at Cambridge—exploited the chemistry of nucleic acids to come to the same brilliant idea. Unlike Edward Southern’s method, which revealed only the presence of DNA and genes, Gilbert’s and Sanger’s methods revealed the actual sequences of nucleotides along strands of DNA. The two methods had “complementary strengths,” and were used depending on what was to be sequenced. (30) The men shared the Nobel Prize in 1980 for this work. It was Sanger’s second Nobel. (31)
Sanger’s sequencing success rested on several premises. First, he knew that he could take a piece of DNA and synthesize its entire length with DNA polymerase. He was also aware of discoveries that showed that by using a class of nucleotides called chain terminators he could interrupt the synthesis of a DNA chain. These chain terminators come in four forms—terminator G, terminator C, terminator T, and terminator A—and when they were placed in a test tube with a DNA fragment and DNA polymerase and then placed on a gel, Sanger could determine the order of nucleotides in a given DNA fragment. He accomplished this by radioactively labeling the locations where the chain terminators stopped DNA synthesis at one of the four particular nucleotides. (32)
Sanger’s method of labeling fragments of DNA with radioactivity, using gel electrophoresis to separate the fragments, and using X‐ray film to visualize them quickly became commonplace in molecular biology laboratories and is still today the basis for gene sequencing. (33) In 1977, using his own method, Sanger himself accomplished the once unthinkable by completing the sequence of the entire genome of Phi‐X174, a virus that infects E. coli in the human digestive tract. Despite the fact that this virus was just over 5000 base pairs long, it took Sanger’s group years to sequence it. (34) By 2000 the Phi‐X174 genome could be sequenced in just a few hours.
The sequence itself revealed remarkable information about genes and gene structure. Among the most intriguing was the finding that even though there are 5386 nucleotides and nine proteins made from genes in the genome of Phi‐X174, calculations showed that there was not enough DNA to code for the proteins that the Phi‐X174 genome produced. This was confusing to scientists. The larger number of proteins than available DNA in PhiX was accounted for by some stretches of the genes in the PhiX genome coding for two or more different proteins by having one gene embedded in another. (35) This important finding is characteristic of many genomes, including the human genome. (36)
RESEARCH MILESTONE 7: THE ULTIMATE DNA COPYING TOOL
Few scientists have a moment of inspiration like the one that came to Kerry Mullis in 1983. According to Mullis, he was driving along a winding moonlit California mountain road when he thought up “a process that could make unlimited numbers of copies of genes.” As he drove, he designed the polymerase chain reaction (PCR) in his head. (37) PCR would soon become the newest and most advanced gene amplification technique, allowing for millions of copies of selected fragments of DNA to be made without plasmid cloning in as little as an hour, as opposed to the tedious vector‐based cloning that could take weeks or even months.
Mullis and his colleagues reasoned that four things were needed to make DNA: (i) a template (one of the strands of the target sequence from a double helix), (ii) the nucleotides (the basic building blocks of DNA—G, A, T, and C), (iii) primers (short single strands of DNA designed to find their base pair complements), and (iv) an enzyme, a DNA polymerase. They also recognized the key from previous work on DNA replication—that in order to replicate a specific region of DNA in a genome you would need to have two primers, one for each strand to be read in opposite directions. The distance between these primers would define the length of the sequence that this new method would amplify. (38)
With these basic tools and a simple but ingenious algorithm, Mullis created the three‐stage polymerase chain reaction. In the first step the temperature of DNA is raised to above 95–97 °C, a temperature at which the strands of a double helix come apart. Second, the temperature is lowered to 45–65 °C, which forces the primers to anneal or stick to the target region of DNA. Finally, at a temperature conducive to the DNA polymerase, the reaction is activated