DNA Replication, Telomeres, Sequencing, PCR, and Forensics — Study Notes 9/2
Telomeres, Telomerase, and the End Replication Problem
Ends of chromosomes are called telomeres. Telomeres are repetitive, G-rich DNA sequences at the ends of linear chromosomes.
End replication problem: DNA polymerases can only extend DNA in the 5' to 3' direction and require a free 3' hydroxyl group to prime synthesis. On the lagging strand, Okazaki fragments are extended 5' to 3' toward the end, but when you reach the end of the chromosome there is no downstream primer to fill the ultimate gap, so the very ends cannot be fully replicated.
Consequence: With each round of replication, linear chromosomes shorten unless a mechanism prevents it.
Telomerase resolves the end-replication problem in germ cells, stem cells, and many cancer cells by adding repeats to the ends of chromosomes so that complete replication can occur.
Telomere repeats in humans: (tandem repeats). Other eukaryotes have similar repeats with species-specific variants.
Telomerase structure: an enzyme that carries its own RNA template (RNA component) and a protein component. It acts as a reverse transcriptase to extend the 3' end using its RNA template.
Mechanism overview:
Telomerase recognizes the 3' end of the telomere (tip of the lagging-strand template).
It elongates the 5' to 3' direction by adding telomere repeats using its RNA template as the primer/template.
The lagging strand is then completed by DNA polymerase alpha (which carries a DNA primase as a subunit).
Nucleases then degrade the small overlap, leaving a short telomere after processing; the essential genetic information is preserved because it is mainly repeats.
Functional significance:
Telomeres protect coding DNA from erosion and prevent chromosome end-to-end fusions.
Telomere length shortens with cell divisions in somatic cells, contributing to aging and senescence if telomere shortening becomes critical.
Most adult somatic cells have low or no telomerase activity and thus limited proliferative capacity.
Cancer cells often reactivate telomerase to maintain telomeres, supporting uncontrolled division.
Telomerase composition and unique features:
Telomerase includes a reverse transcriptase protein component and an RNA template; the enzyme uses its RNA to synthesize DNA de novo at chromosome ends.
The RNA template is not a primer in the conventional sense; rather, it provides the template sequence that directs repeat addition.
Visual summary (conceptual):
At the end of replication, the lagging-strand template ends in a gap due to lack of a downstream primer.
Telomerase binds, extends the 3' end with repeats (via its RNA template), creating a new priming site for DNA polymerase to complete synthesis.
Nucleases trim the overhang; the net effect is preservation of genetic information via telomere repeats.
Practical implications and examples:
In humans, telomere shortening is associated with aging of somatic tissues; telomerase reactivation is a hallmark of many cancers.
Experimental videos exist to illustrate telomere biology and telomerase action, which can help visualization beyond static slides.
DNA Replication: Leading vs Lagging Strands and Key Enzymes
Leading strand: synthesized continuously in the 5' to 3' direction toward the replication fork.
Lagging strand: synthesized discontinuously as Okazaki fragments in the 5' to 3' direction away from the fork.
Required enzymes and steps:
DNA polymerase: synthesizes DNA in the 5' to 3' direction; provides proofreading.
RNase H: removes RNA primers used to start Okazaki fragments.
DNA ligase: ligates Okazaki fragments into a continuous strand.
Telomerase (as above) deals with chromosome ends.
Note on RNA primers and three-prime hydroxyl groups:
RNA primers provide 3' OH groups for DNA synthesis on the lagging strand.
At chromosome ends, there is no available 3' OH to prime DNA synthesis after primer removal, necessitating telomere maintenance.
DNA Polymerase Proofreading and Fidelity
DNA polymerase has proofreading ability (self-editing) via 3' to 5' exonuclease activity to remove mispaired nucleotides.
Fidelity mechanism:
If an incorrect nucleotide is incorporated, base-pairing is unstable, the polymerase stalls, and the wrong nucleotide is excised.
Correct nucleotide is then inserted, continuing replication with higher fidelity.
Why polymerization is in the 5' to 3' direction (integral to proofreading):
Energy for polymerization comes from nucleoside triphosphates (NTPs) with two extra phosphates released as pyrophosphate (PPi).
Hypothetical reversal (3' to 5' synthesis) would remove energy in a way that would prevent efficient proofreading, reducing fidelity.
Energy and chemistry notes:
Insertion step: incoming nucleotide with three phosphate groups participates in forming a phosphodiester bond, releasing PPi.
If proofreading occurs after a reverse-direction extension, the lack of a terminal pyrophosphate barrier would compromise the proofreading energy balance.
Mathematical representation of the polymerization step:
The energy stored in the triphosphate (three phosphates) drives bond formation and provides the basis for error checking.
Gel Electrophoresis and an In-Class Problem on DNA Replication
Gel electrophoresis principle: DNA fragments migrate in an electric field according to size; larger fragments migrate slower (toward the top) and smaller fragments migrate faster (toward the bottom).
In-class scenario (temperature-sensitive DNA ligase in E. coli):
Cells grown at non-permissive temperature show failure to ligate Okazaki fragments on the lagging strand.
Resulting fragments are visible as shorter bands (~200 base pairs) on the gel, corresponding to unligated Okazaki fragments.
Concept check: Okazaki fragments are formed about every ~200 base pairs on the lagging strand and require ligation to form a continuous strand.
DNA Sequencing: From Sanger to Next-Generation Methods
DNA sequencing basics:
Sequencing determines the order of nucleotides in DNA.
Gel-based visualization allows reading the sequence by the length of separated fragments.
Sanger (dideoxy) sequencing (Fred Sanger, Nobel Prize):
Key idea: use dideoxynucleotides (ddNTPs) that lack a 3' OH group, causing chain termination when incorporated.
Reaction setup: four separate tubes, each containing normal deoxynucleotides (dNTPs) and one type of ddNTP (ddATP, ddTTP, ddCTP, ddGTP).
Termination occurs at each occurrence of the corresponding base, producing fragments of varying lengths that end at every occurrence of that base.
Readout: pieces are separated by size on a gel, and the sequence is read from bottom to top. Modern practice uses fluorescent labels for each ddNTP, allowing a single lane to represent all four bases.
Historical note: Sanger sequencing costs decreased dramatically since its inception; the first human genome project required enormous expense, whereas today sequencing is inexpensive and high-throughput.
Next-generation sequencing (NGS, Illumina and others):
Concept: parallel sequencing of many DNA fragments on a chip or flow cell, enabling whole-genome or targeted sequencing rapidly.
Illumina approach (example): reversible terminator chemistry on a flow cell where fluorescently labeled, chain-terminating nucleotides are incorporated and detected; the terminator is then removed to allow the next cycle.
Outcome: massively parallel sequencing with shorter reads but high depth; cost per genome has dropped dramatically since the early 2000s.
Nanopore sequencing (not covered in depth here): a different next-gen technology that reads long DNA molecules by measuring ionic current as DNA passes through a pore; distinct from Illumina-style reversible terminators.
Practical takeaway:
Sanger sequencing remains the gold standard for small-scale, high-accuracy sequencing tasks.
Illumina and other NGS platforms enable fast, large-scale sequencing for whole genomes and large panels of targets.
PCR (Polymerase Chain Reaction): Principles, Thermodynamics, and Applications
Purpose: amplify a specific DNA region to obtain enough material for analysis, cloning, or forensic testing.
Core steps (roughly 35 cycles in typical PCR):
Denaturation: heat to about to separate DNA strands.
Annealing: cool to around to allow primers to hybridize to complementary sequences.
Extension: polymerase (typically Taq polymerase, a thermostable enzyme) extends from primers to synthesize new DNA strands.
Thermostable polymerase (Taq):
Is derived from thermophilic organisms; remains active at high temperatures, enabling repeated heating and cooling cycles.
Relevance and uses:
Forensics: amplify trace DNA from crime scenes to obtain measurable signals on gels or sequencing assays.
Cloning and genetic analysis: amplify genes or regions of interest for downstream experiments.
Primer design for PCR and specificity:
Primers are typically ~20 bases long and must anneal specifically to target regions.
Annealing temperature is tuned to ensure specificity; incorrect annealing can produce spurious products, visible as extra bands on gels.
Short Tandem Repeats (STRs) and Forensic DNA Profiling
STRs are short DNA motifs repeated in tandem (e.g., TCAT, etc.).
Forensic use:
Primers flank STR regions; PCR amplifies the repeats; fragment length reflects the number of repeats.
Variation in repeat number between individuals makes STRs highly discriminative.
Example: amelogenin gene (AMEL) is used to determine sex because it has female (XX) and male (XY) variants with different sizes.
AMEL primers flank a region with different band sizes on X and Y chromosomes when amplified.
Gel readout can show two bands for males (one X-derived, one Y-derived) and one band for females (two X copies).
Forensic STR panel and probability of a random match:
The FBI and other agencies use multiple core STR markers (historically 13 core markers) spread across different chromosomes to maximize discrimination.
The combined probability of a random match can be extremely low, often cited as around for a multi-locus profile.
Practical technique notes:
Accurate STR analysis requires careful optimization of annealing temperatures to avoid non-specific amplification.
Some labs supplement STR analysis with sequencing for higher resolution and specificity.
The Amelogenin Gene and Sex Determination in Forensics
Amelogenin (AMEL) gene has copies on both X and Y chromosomes but with slight size differences between the two, enabling sex determination from DNA.
PCR strategy:
Primers flank the region; PCR amplifies the AMEL region on X and Y; gel or sequencing reveals two bands (XY males) or a single band (XX females).
Forensic implications:
AMEL is commonly used as a quick initial check to determine sex from a DNA sample.
However, AMEL alone cannot uniquely identify individuals; it provides one piece of the DOB (date of birth) of the evidence in context with STR profiling.
Forensic Power and Ethical Considerations
Multi-locus STR profiling offers very high discriminative power:
With multiple markers, the probability of a random match is exceedingly low, enabling strong identification.
Innocence Project and legal/ethical context:
DNA evidence has been used to exonerate wrongfully convicted individuals after re-examining evidence.
DNA testing raises ethical considerations about privacy, data storage, and the potential for misuse or misinterpretation.
Related notes:
Related individuals share similar STR patterns, but overall profiles are highly unique except for identical twins.
Connections to Foundational Principles and Real-World Relevance
Foundational concepts:
DNA replication fidelity depends on correct base pairing and proofreading.
The structure of DNA (leading vs lagging strand, Okazaki fragments) dictates the need for specialized enzymes to maintain genome integrity.
Real-world relevance:
Telomere biology connects to aging and cancer biology, with telomerase activity as a potential therapeutic target in cancer and aging research.
DNA sequencing technologies (Sanger and NGS) revolutionize biology and medicine, enabling personalized genomics, disease diagnosis, and forensic science.
PCR and STR analysis underpin modern forensic genetics and criminal justice, but raise discussions about privacy, consent, and the potential for false positives/negatives if not carefully controlled.
Quick Reference: Key Terms and Core Concepts
Telomere: protective, repetitive ends of linear chromosomes; human repeat =
Telomerase: reverse transcriptase enzyme with RNA template; extends telomeres.
End replication problem: inability to fully replicate chromosome ends without telomere maintenance.
Okazaki fragments: short DNA segments synthesized on the lagging strand; require ligation.
RNase H: removes RNA primers after DNA synthesis.
DNA ligase: seals nicks between Okazaki fragments.
5' to 3' polymerization: direction of DNA synthesis; supports proofreading.
ddNTPs: dideoxynucleotides used in Sanger sequencing to terminate DNA chain elongation.
Sanger sequencing: chain-termination sequencing method; historically foundational for DNA sequencing.
Next-generation sequencing (NGS): high-throughput sequencing technologies enabling whole genomes and large-scale analyses.
PCR: exponential amplification of a target DNA region using thermostable DNA polymerase.
STR (short tandem repeat): variable-number repeats used in forensic profiling.
Amelogenin (AMEL): gene used for sex determination in forensic DNA tests.
Innocence Project: legal initiative using DNA for exonerations; highlights ethical issues in forensic science.
{\text{Note: This set of notes mirrors the topics covered in today’s lecture: telomeres and telomerase; DNA replication; proofreading; gel electrophoresis; sequencing; PCR; STRs; and forensic applications.}