DNA Sequencing
Learning Objectives
Describe in detail the process of Dideoxy sequencing, also known as Sanger sequencing
Similarity to PCR but with specific differences
The detail of the four stages of the reaction itself
Size separation of the reaction products by gel electrophoresis
Base-calling by software
Describe uses of Dideoxy sequencing in healthcare and research
Dideoxy Sequencing (Sanger Sequencing): Detailed Explanation
Sanger sequencing, developed by Fred Sanger in 1977, remains a gold standard for DNA sequencing due to its high accuracy and reliability. It is based on the principle of dideoxy chain termination, where DNA synthesis is interrupted by the incorporation of dideoxynucleotide triphosphates (ddNTPs), which lack the 3' hydroxyl group required for elongation.
Similarity to PCR and Key Differences
Similarities:
Both require:
Template DNA to be sequenced.
DNA primers to initiate DNA synthesis.
DNA polymerase to catalyze strand elongation.
dNTPs as building blocks of the new strand.
Involve cycling through temperatures for denaturation, annealing, and extension.
Differences:
Single Primer: Sanger sequencing uses one primer, resulting in linear (not exponential) amplification.
Dideoxynucleotides (ddNTPs): These are chain-terminating nucleotides, unique to Sanger sequencing.
Product Population: Generates a mixed population of DNA fragments, each terminated at different lengths.
Output: Sanger sequencing provides sequence information, not amplified DNA for other applications.
Four Stages of the Dideoxy Sequencing Reaction
1. Reaction Setup:
Components:
Template DNA: Pure, high concentration.
Single DNA primer: Complementary to a known sequence flanking the target.
DNA polymerase.
dNTPs (deoxy nucleotide triphosphates).
ddNTPs (dideoxy nucleotide triphosphates) tagged with unique fluorophores for each base (A, T, G, C).
Mg²⁺ ions and buffer.
Reaction principle: DNA polymerase elongates the strand using dNTPs. Incorporation of a ddNTP halts elongation, creating fragments of varying lengths.
2. Strand Elongation and Termination:
DNA synthesis begins at the primer and proceeds in the 5' to 3' direction.
ddNTPs are randomly incorporated, terminating synthesis at specific bases.
Example:
Sequence: ATGC
Fragments: AT, ATG, ATGC (terminated at T, G, and C due to ddNTPs).
3. Size Separation by Gel Electrophoresis:
The reaction products (DNA fragments) are separated by size using capillary electrophoresis.
DNA is negatively charged and moves through a gel matrix toward the positive electrode.
Smaller fragments migrate faster.
4. Base Calling by Detection of Fluorescence:
A laser excites the fluorophores attached to ddNTPs as fragments pass the detector.
Each ddNTP emits a unique fluorescent signal corresponding to A, T, G, or C.
The sequence is determined by reading the order of fluorescent signals.
Capillary Electrophoresis and Base Calling
Electrophoresis: Separates fragments by size in a capillary filled with gel-like polymer.
Detection: A laser detects fluorescent signals, producing an electropherogram—a graph of fluorescence intensity vs. time.
Sequence Assembly:
Peaks correspond to bases (A, T, G, C).
Software reads the sequence directly from the electropherogram.
Applications of Dideoxy Sequencing
In Healthcare:
Genetic Testing:
Confirms mutations associated with diseases (e.g., cystic fibrosis, sickle cell anemia).
Identifies silent, missense, nonsense, truncating, and splice-site mutations.
HIV Resistance Profiling:
Detects mutations in HIV genomes conferring resistance to antiretrovirals.
Prenatal Diagnostics:
Confirms suspected genetic abnormalities in prenatal samples.
In Research:
Gene Characterization:
Sequencing cloned genes or PCR products to confirm structure and function.
Walking a Gene:
Sequential sequencing using overlapping primers to map unknown DNA regions.
Mutation Studies:
Confirmation of causative variants from genome-wide association studies (GWAS).
Pathogen Identification:
Sequencing bacterial, viral, or fungal genomes for strain typing.
Advantages and Limitations
Advantages:
High accuracy (error rate <0.05%).
Long read lengths (up to 900 bases).
Suitable for small-scale projects and confirmatory tests.
Limitations:
Expensive and time-consuming for large-scale sequencing.
Limited throughput compared to Next-Generation Sequencing (NGS).
Sanger sequencing remains an indispensable tool in molecular biology, particularly for projects requiring high accuracy or as a confirmatory method for NGS findings. Let me know if you'd like further detail on any specific aspect!