DNA Technology and Genomics
DNA Technology and Genomics
- Biologists have developed techniques for artificial manipulation of DNA, cells, and organisms.
Challenges of Studying DNA
- The size of DNA molecules poses a significant challenge.
- Naturally occurring DNA is very long, and specific genes may comprise a small portion (e.g., 1/100,000) of the chromosome.
- There may only be a small difference in the surrounding nucleotides.
Restriction Enzymes
- Restriction enzymes are molecular scissors.
- They recognize DNA sequences 4-6 bp in length and cut DNA molecules at specific locations.
- In nature, these enzymes help protect bacteria and archaea from foreign DNA viral attacks.
- There are many different types of restriction enzymes.
- Each restriction enzyme is very specific and recognizes a short DNA sequence known as a restriction site.
- The DNA is cut at specific sites within the DNA strand.
- Restriction enzymes cut DNA in repeatable ways.
- Restriction enzymes that produce sticky ends are desirable.
- A mathematical problem associated with the use of restriction enzymes: the larger the genome, the more cut sites there will be.
PCR (Polymerase Chain Reaction)
- PCR is used to prepare a large quantity of DNA when only a small amount is present.
- It can quickly generate a large amount of DNA from a small amount.
- A 3-step cycle brings about a chain reaction that produces exponential growth of identical DNA molecules.
- A. The solution containing the piece of DNA is heated to break the H-bonds and separate the DNA into single strands.
- B. DNA primers added to the mixture anneal to the DNA strands.
- C. The heat-stable Taq polymerase adds nucleotides to the primers in the standard 5’à3’ direction synthesizing the target sequence.
- Specificity is a key benefit. If a target segment is identified and a primer is made to it, then only a small amount is necessary to start.
- The primer will only replicate the target segment because this is all they are able to bind to.
- After just a few cycles, a very large amount of the target segment will be identified.
Tandem Repeat DNA
- Short non-coding DNA sequences that repeat in a tandem pattern of one or more nucleotides is called a tandem repeat.
- The STRs can be cut with restriction enzymes.
- Tandem repeats are useful in paternity testing, forensic science, and other forms of DNA profiling.
- Polymorphisms in chromosomal DNA can arise from the presence of these repeats.
- The numbers of repeats varies from person to person and is unique for each individual.
- It serves as a molecular marker that can be used to provide a lot of useful information.
- Using PCR to amplify the DNA at particular sites yields fragments of DNA of varying lengths depending on how many repeated segments are contained within the fragment.
- Variation in the number of these tandem repeats leads to differences between individuals.
Restriction Fragment Analysis
- Treating the DNA with restriction enzymes and then running the samples through a gel enable researchers to produce banding patterns characteristic of the starting molecule and the restriction enzyme(s) used to treat the DNA.
- This analysis can be used to examine bacterial chromosomes for certain genes, settle paternity suits, and to provide evidence to crime scene investigators.
- Analysis of these repeats provides a powerful tool for scientists to analyze differences in the nucleotide sequences of DNA molecules.
Restriction Fragment Length Polymorphisms (RFLPs)
- When non-coding regions of DNA were treated with restriction enzymes and banded, scientists discovered differences in non-coding regions on homologous chromosomes.
- These serve as genetic markers of non-coding DNA that appears near a particular locus in a genome.
- There are many RFLP variants within a population.
- RFLP data is often used in crime investigations because the likelihood that two individuals will have the same banding patterns are vanishingly minuscule at best.
- These are variations within the restriction sites of individuals within a population.
- Variation between 2 individuals occurs in about 1 out of every 1000bp.
- Our genome is about 3.2 billion bp.
- That means 2 individuals will vary by more than 3 million bp.
- These changes disrupt the restriction site and create the unique genetic fingerprint.
Key Differences Between RLFP and STR Analysis
- RFLPs look at differences in nucleotide sequences, while STRs look at repeat segments within the DNA.
- RFLP analysis relies on restriction enzymes.
- STR analysis relies on PCR to amplify and identify the variations within the number of repeat sequences.
Gel Electrophoresis
- Agarose gel separates DNA fragments by size and charge.
- The fibers in the gel separate the fragments; smaller fragments migrate farther than the larger fragments.
- The negatively charged DNA fragments migrate toward the positive pole of the electrophoresis box.
- Treating the DNA with various restriction enzymes and manipulating it in a variety of ways can give a lot of information.
Southern Blot Analysis
- Scientists use a Southern Blot to Analyze RFLPS or Regular DNA.
- Gel electrophoresis and the analysis of restriction fragments is used in crime scene investigations and paternity suits all the time.
- Scientists can use restriction fragment information to determine if a person has a particular disease.
- Scientist can also use RFLP information to determine the likelihood of inheriting a certain genetic disease.
DNA Sequencing
- Sequencing became the biggest drawback when the HGP was moving forward. There wasn’t a fast way to sequence the genes. The initial challenge was to develop new technologies to quickly sequence the DNA.
- Frederick Sanger developed the first method to sequence the genes called the dideoxy chain-termination method.
The Dideoxy Chain-Termination Method
- A set of complementary DNA strands are synthesized from an original DNA strand.
- Each strand starts with the same primer and ends with a modified nucleotide that is fluorescently labeled and lacks a hydroxyl group; it is called a dideoxyribonucleotide.
- The dideoxyribonucleotide terminates the growing DNA strand because it lacks a 3’- OH group.
- In the newly synthesized strands, each nucleotide position along the original sequence is represented by strands ending at that point with the complementary ddNTP.
- Each type of ddNTP, tagged with a distinct fluorescent label, identifies the ending nucleotides of the new strands, ultimately revealing the sequence of the DNA.
3 Main Steps of the Dideoxy Method
- 1. Millions of fragments of DNA to be sequenced are denatured into single strands and incubated in a test tube with a primer that will base pair with the known 3’ end of the template stand, DNA polymerase, A,T,C,&G and the 4 ddNTP’s which are tagged with a specific fluorescent molecule.
- 2. Synthesis of the new strand starts at the 3’ end of the primer and continues until a ddNTP is added at random. This prevents further elongation. Eventually, a labeled set of strands of various lengths is generated
- 3. The labeled strands in the mixture are separated by passage through a polyacrylamide gel in a capillary tube; the shorter the strands move through faster than the larger ones.
- A fluorescent detector can sense the color of each tag as the strands come through. Strands that differ in as little as 1 nucleotide can be distinguished.
- The color of the fluorescent tag on each strand indicates the identity of each nucleotide at its end, and the results can be printed out on a spectrogram and the sequence, which is complementary to the template strand, can then be read from the bottom to top.
Uses of Whole Genome Sequencing
- Whole genome sequencing has a wide variety of uses. The increasing speed of this process and the reduced cost are leading to more uses.
- Diagnosis of disease
- Understanding the genetic basis of some diseases (cancer)
- Epidemiology
- Studying evolution
Transposable Elements
- Transposable elements are known as “jumping genes.”
- They have the ability cut themselves out of a genome and move themselves to another portion of the genome.
DNA as Hereditary Material
- So how do we know that DNA is the hereditary material?
- From the standpoint of the nature of science, the answer to this question is quite interesting.
Determining the Chemical Composition of DNA
- After Morgan determined that genes were on chromosomes, scientists began wondering what they were made of.
- Early on, scientists thought they were made of proteins because they were large and structurally sound.
- Nucleic acids were deemed too simple to carry out many of the complex tasks chromosomes required.
- As scientists continued their experiments with viruses and bacteria, many results were observed that gave support to the notion that DNA was the hereditary material.
- A classic experiment demonstrated the genetic role of DNA.
Frederick Griffith's Experiment
- Studied the bacterium that caused pneumonia—S. pneumonia.
- Worked with two strains: pathogenic (S) and non-pathogenic (R).
- (S) smooth cells produce mucous capsules that protect the bacteria from an organism’s immune system—pathogenic.
- (R) rough cells have no mucous capsule and are attacked by an organism’s immune system—non-pathogenic.
- Griffith mixed heat-killed pathogenic (S) bacteria with living non-pathogenic (R) bacteria, the non-pathogenic (R) bacteria began producing the mucous capsule and became pathogenic (S).
- The new bacteria that arose from the bacteria were somehow transformed into pathogenic S. pneumonia.
- Griffith called this process transformation.
- This experiment did not identify DNA as the transforming factor, but it set the stage for other experiments.
Avery, McCarty, and McCleod
- They worked for about 10 years trying to identify the transforming factor.
- After isolating and purifying numerous macromolecules from the heat killed pathogenic bacteria, they could only get DNA to work.
- The prevailing beliefs about proteins vs. DNA continued to generate skepticism.
The Hershey-Chase Experiment
- In 1952, Alfred Hershey and Martha Chase performed experiments with viruses showing that DNA is genetic material.
- Viruses (aka phages) are DNA or RNA wrapped in a protein.
- E. coli is a bacteria that is often used in experiments.
- Hershey and Chase used the T2 phage because it was generally accepted to be DNA wrapped in protein.
- E. coli was used because it was easily obtainable and was readily attacked by T2.
- They sought to determine whether or it was DNA or protein that was the hereditary material.
- Their experiment demonstrated which part of the T2 entered the E. coli.
- They grew T2 in the presence of radioactive sulfur—proteins contain sulfur, DNA does not.
- Next, they grew the T2 in a separate batch of radioactive phosphorous. The DNA of T2 contains phosphorous—the proteins do not.
- The scientists now had 2 batches of T2, one labeled with radioactive sulfur, and one labeled with radioactive phosphorous.
- These 2 batches were separately incubated with non-radioactive samples of E. coli and analyzed shortly after infection.
- Shortly after infection, the E. coli samples were spun in a blender to knock off loose parts of T2.
- The mixtures were then spun in high-speed centrifuges for a long time to separate out various parts of the mixture.
- At the bottom of the tube was a pellet of E. coli.
- The pellet was examined for radioactivity and radioactive phosphorous was found.
- The supernatant was analyzed and a lot of radioactive sulfur was found, but no radioactive phosphorous.
- This indicates that the DNA got into the E. coli and was in the pellet
- The protein did not get into the bacteria and was left in the supernatant.
- Furthermore, when the bacteria in the pellet were plated on culture medium, they produced more T2 containing radioactive phosphorous.
- They concluded:
- That the virus injects DNA into the E. coli and it is the genetic material that programs the cells to produce new T2 phages.
- The protein stays outside.
- This experiment provided firm evidence that DNA was the hereditary material and not protein.
Mutations
- Mutations result from a change in the DNA sequence.
- They can be caused by errors in DNA replication and/or repair.
- They can also be caused by environmental factors such as chemicals, radiation or pathogens.
- Mutagens cause mutations.
- Mutations are completely random, but some base pairs are more prone to mutating than others.
- Cytosine, for example is prone to mutation when it becomes methylcytosine.
- 5-methylcytosine is the methylated form of cytosine. This can affect expression patterns of associated genes.
- 5-methylcytosine can also lose its amino group and become thymine. This is another form of the cytosine mutation.
- Some mutations are caused by spontaneous chemical reactions.
- For instance, removing the amino group and forming a keto group below converts C to U.
- Some mutations occur when chemicals bind to the DNA.
- For instance, benzopyrene binds to G and when the DNA polymerase reaches the modified G, it can’t read it and randomly inserts one of the four bases resulting in a mutation.
- Mutations occur in the genome and can be silent, can disrupt the function of proteins, and can create new variations of alleles.
- Substitutions, insertions, and deletions are common genetic mutations.
Substitutions
- Substitutions may or may not result in a change in the protein.
- The redundancy of the genetic code may result in a silent mutation.
- This particular mutation changes the amino acid and ultimately the protein.
- This is an example of an SNP.
- This substitution mutation results in a missense mutation.
- This example results in a nonsense mutation and an immediate stop.
- Substitution mutations can produce single nucleotide polymorphisms (SNPs).
- SNPs can also refer to changes in the genetic sequence of non-coding DNA. It is often used in crime scene analysis and other forms of forensic analysis.
Insertions and Deletions
- Insertions and deletions disrupt the reading frame resulting in a missense or nonsense mutation. Some involve a single nucleotide.
- Insertions and deletions can also involve any number of base pairs.
- Insertions and deletions shift the reading frame resulting in a missense or nonsense mutation.
Chromosomal Mutations
- These mutations affect large sections of a chromosome. Includes:
- Inversion
- Deletion
- Duplication
- Insertion
- Translocation
Germline vs. Somatic Mutations
- Germline mutations can result in the formation of new alleles, or birth defects. Germline mutations can be inherited.
- Somatic mutations can result in cancer.
Gene Knockout
- Gene knockout is a technique used to determine the function of a particular gene.
- Knowing the steps is not important, but understanding how the information can be used is.
- There is a library of knockout organisms that can be used as models in research.
CRISPR
- CRISPR is an acronym for Clustered Regularly Interspaced Short Palindromic Repeats.
- It has a wide variety of uses and the potential for misuse.
- Palindromic segments are part of the CRISPR sequence.
- The PAM sequence is required for the Cas9 to cut the DNA within the protospacer sequence. It will make the cut 3 base pairs away from the PAM.
- CrRNA + tracRNA (trans-activating CRISPR RNA) create a scaffold region and a 20-nucleotide guiding region.
- A Single Guide RNA (sgRNA) also contains a 20-nucleotide guiding region and a scaffold region.
- Cas9 differentiates between bacterial DNA (does NOT cut) and viral DNA (cut).
- Ethical issues should be considered regarding CRISPR.