DNA Sequencing 3
Third Generation DNA Sequencing
Introduction
Course: MMG2040 2026
Key Focus: Third Generation Sequencing (TGS)
Comparison of Next Generation Sequencing (NGS) Platforms
Features of Different Platforms:
Ion Torrent:
Detection: Measures pH change (H⁺)
Speed: Faster than other platforms
Accuracy: Lower accuracy in repetitive sequences
Equipment: Utilizes a semiconductor chip
Illumina:
Detection: Based on fluorescence
Speed: Slightly slower than Ion Torrent
Accuracy: Very high accuracy
Equipment: Employs an optical imaging system
Third Generation Sequencing (TGS) Overview
Characteristics of TGS:
Single-Molecule Sequencing: Can sequence individual molecules directly without amplification.
No PCR Amplification Required: Avoids biases introduced by the amplification process.
Real-Time Sequencing: Capable of providing results as the sequencing occurs.
Long-Read Capabilities: Can generate reads from 10 kb to over 1 Mb.
Single Strand Sequencing: Only one strand of DNA is needed, eliminating the demand for multiple copies.
Technologies:
Nanopore Detectors: Utilizes nanopore technology for sequencing.
SMRT Sequencing (Single Molecule Real-Time): A key technology used in TGS.
Reasons for Moving Beyond NGS
Limitations of NGS:
Short Reads: Make genome assembly a challenge.
Difficulty Resolving Repeats: Short reads struggle to resolve repetitive regions effectively.
Limited Structural Variant Detection: Short reads are unable to identify larger structural variants.
PCR Bias: Amplification processes introduce biases that can affect data integrity.
Nanopore Sequencing Overview
Process:
DNA or RNA passes through a nanopore protein, which measures changes in ionic current.
Changes in the ionic current are translated into a nucleotide sequence.
Development: Developed by Oxford Nanopore Technologies.
Nanopore Detectors in TGS
Process:
Single-stranded DNA (ssDNA) passes through a nanopore approximately 1 nm in diameter.
Each base of the DNA sequence generates a unique signal change in the electrical current.
Error-Prone: The process is considered to be error-prone.
1X Coverage: Generally provides one coverage depth, which is acceptable for quick analyses or re-sequencing.
How Nanopore Sequencing Works
Mechanism:
A motor protein unwinds the DNA double helix.
The single strand of DNA passes through the nanopore.
Each nucleotide base alters the electrical current, and machine learning algorithms decode these signals into nucleotide sequences.
Advantages and Disadvantages of Nanopore Sequencing
Advantages:
Capable of ultra-long reads (greater than 1 Mb).
Portable devices available (e.g., MinION).
Permits real-time sequencing and the capability for direct RNA sequencing.
Lower capital costs compared to some other sequencing methods.
Disadvantages:
Higher error rates, averaging around 5% to 10%.
Presence of signal noise can complicate data interpretation.
Complexity of data analysis can be a challenge to many users.
Lower throughput compared to Illumina platforms.
PacBio Sequencing Overview
Technology: Single Molecule Real-Time (SMRT) sequencing.
Components:
Utilizes DNA polymerase combined with zero-mode waveguides (ZMWs).
Employs fluorescently labeled nucleotides for detection.
Detailed Features of PacBio Sequencing
Process:
Single Molecule Real-Time (SMRT): Also known as PacBio.
Nanocontainers: ZMWs facilitate the single DNA template fitting, with each container having a diameter of 20 nm.
Detection of Incorporation: Fluorescent deoxynucleotide triphosphates (dNTPs) report the incorporation of nucleotides, which results in a distinctive light flash as the pyrophosphate group is discarded.
Typical Read Length: About 20,000 base pairs (bps). Works effectively for sequencing repetitive sequences.
How PacBio Sequencing Works
Mechanism:
DNA polymerase is anchored in the ZMW and incorporates nucleotides while emitting light.
Each base emits a distinct fluorescence signal that is detected in real-time, enabling continuous monitoring of DNA mutations or variations as they occur.
Advantages and Disadvantages of PacBio Sequencing
Advantages:
Long Read Lengths: Typical reads range from 10 kb to 100 kb.
High Consensus Accuracy: Known as HiFi reads, offering improved accuracy.
No Amplification Bias: Eliminates biases since no PCR is utilized.
Detection of Epigenetic Modifications: Capable of identifying epigenetic changes in DNA.
Disadvantages:
Generally higher costs compared to nanopore sequencing.
Lower throughput compared to Illumina sequencing methodologies.
Requires specialized equipment for operation.
Comparison of PacBio and Other NGS Platforms
Features Comparison:
Read Length: PacBio (very long) vs. Illumina/Ion Torrent (short).
Amplification: PacBio (none) vs. Illumina/Ion Torrent (Yes).
Detection Method: PacBio uses fluorescence (real-time) while Illumina and Ion Torrent use fluorescence (stepwise) and pH, respectively.
Strengths:
Genome Assembly: PacBio excels in assembling complex genomes due to longer reads.
Accuracy: Illumina offers high throughput while PacBio emphasizes accuracy and read length.
Applications of Third Generation Sequencing
Typical applications include:
De Novo Genome Assembly: Constructing genomes from scratch without reference sequences.
Structural Variant Detection: Identifying structural variations within genomes such as insertions, deletions, and other alterations.
Metagenomics: Studying genetic material recovered directly from environmental samples.
Transcript Isoform Analysis: Analyzing different versions of RNA transcripts from a single gene.
Epigenetics: Studying the heritable changes in gene expression that do not involve changes to the underlying DNA sequence.
Comparison of Sanger, NGS, and Third Generation Sequencing
Sanger Sequencing: Known for low throughput but high accuracy with short reads.
Next Generation Sequencing (NGS): Offers high throughput but produces short reads (e.g., Illumina).
Third Generation Sequencing (TGS): Delivers long reads, real-time capabilities, and the ability to work with single molecules directly.
Key Differences Among Sequencing Methods
Read Length:
Shortest: Sanger < NGS < TGS (third Generation Sequencing).
Accuracy:
Highest: Sanger, with improvements observed in TGS methods.
Throughput:
Highest: NGS platforms.
Cost per Base:
Lowest: NGS platforms, typically more affordable than both PacBio and nanopore technologies.
When to Use Each Sequencing Method
Usage Guidelines:
Sanger Sequencing: Best suited for validation of small regions or specific applications requiring very high accuracy.
NGS: Ideal for large-scale sequencing projects, including RNA sequencing (RNA-seq).
Third Generation Sequencing: Highly effective for genome assembly tasks and structural variant analyses due to its long-read capabilities.
Summary of Key Points on Third Generation Sequencing
Evolving technology enabling long-read sequencing capacity.
Nanopore Sequencing: Notable for its portability, flexibility, and ability for real-time results.
PacBio Sequencing: Recognized for its high accuracy and ability to produce long reads.
Serves as a complementary technology to existing NGS platforms, enhancing the completeness and accuracy of genomic projects.
Learning Objectives
After reviewing the content, students should be able to:
Describe fundamental steps in various sequencing technologies: Illumina, Ion Torrent, Nanopore, and SMRT sequencing.
Understand the principles governing third generation sequencing methodologies.
Describe specific technologies, including Nanopore and PacBio, and how they differ from Sanger/NGS methods.
Evaluate applications, advantages, and limitations associated with each sequencing platform.