BIO121: Detailed Study Notes on Sequencing and Evolutionary Biology Concepts

Sanger Sequencing and Shotgun Sequencing

  • Sanger sequencing (also known as chain termination sequencing or dideoxy sequencing)

    • Involves using four separate tubes, one for each nucleotide (A, T, C, G).

  • Shotgun sequencing

    • A method where a large DNA molecule is fragmented into smaller pieces, which are then sequenced.

    • Each fragment is sequenced, and the pieces are reassembled into a complete sequence.

Illumina Sequencing

  • Also known as sequencing by synthesis (SBS).

  • Utilizes a flow cell instead of multiple tubes for sequencing.

    • A flow cell contains "lawns" of probes that can bind single-stranded pieces of DNA.

    • Allows high-throughput sequencing of millions of DNA molecules simultaneously.

  • Bridge amplification affects the process:

    • Instead of sequencing a molecule once, it sequences it thousands of times.

    • Ensures error rates are significantly reduced (almost zero).

    • Repeated cycles of amplification build up a dense cluster of identical DNA sequences on the flow cell, allowing for better accuracy.

Bridge Amplification Process

  • In a three-dimensional space:

    • Probes designed to bind to DNA are present.

    • A piece of denatured double-stranded DNA binds to the probes, then replicates to form a bridge.

    • This bridge folds, allowing synthesis of a new strand of DNA with more rounds of binding to probes.

    • Continuous synthesis results in a massive number of single DNA molecules attached to probes on the flow cell.

  • Result:

    • DNA molecules are amplified much faster and more effectively than in prior methods (e.g., Sanger).

Sequencing by Synthesis Technique

  • After DNA amplification,

    • The sequencing process involves adding labeled nucleotides (A, T, C, G) with different fluorescent colors.

    • As nucleotides are incorporated into the growing DNA strand, a flash of light indicates which nucleotide was added.

    • Can capture thousands of these flashes concurrently for massive parallel sequencing.

  • Sequencing multiple fragments allow for vastly more comprehensive information than traditional methods.

Advancements of Sequencing Techniques

  • From the original Sanger sequencing:

    • Human genome project took 13 years to sequence the first human genome.

  • With advances like Illumina sequencing,

    • The same task can now be completed in two to three days, drastically reducing time and cost.

  • Further advancements led to nanopore sequencing.

    • Requires no PCR (polymerase chain reaction) and can sequence DNA directly.

    • Processes faster with an estimated sequencing time of about 6 hours for a human genome.

Nanopore Sequencing

  • Processes: Direct read of DNA without amplification.

  • Method:

    • Keep the original strand intact, which allows for immediate sequencing.

    • The error rate is higher than other methods, making it still a developing technology.

  • Size and Accessibility: modern sequencing machines are considerably smaller and more accessible than earlier models.

Bioinformatics and Data Processing

  • Essential for stitching together sequences from millions of fragments to reconstruct a complete genome.

  • The nature of DNA fragmentation:

    • DNA is not chopped uniformly.

    • In typical sequencing applications, hundreds of thousands of copies of DNA fragments are sequenced which include overlaps.

Evolutionary Biology: Homology and Analogy

  • Homology: Similar traits due to shared ancestry. E.g., humans and other primates.

  • Analogy: Similar traits due to convergent evolution; organism adapting to similar environments independently.

Use of BLAST in Phylogenetics

  • An exercise using BLAST involves finding genetic sequences to conduct phylogenetic analyses.

    • Generating a phylogenetic tree can illustrate genetic relationships.

  • Importance of homologous characters in determining phylogeny, such as the presence of collagen in multiple species.

Cladistics and Clade Definition

  • Clade: A group of species that includes an ancestral species and all its descendants.

  • Types of groups in cladistics:

    • Monophyletic: Consists of an ancestor and all its descendants.

    • Paraphyletic: Includes an ancestor and only some of its descendants; excludes unsuccessful descendants (e.g., Neanderthals).

    • Polyphyletic: Includes species from different ancestors without their most recent common ancestor.

Shared Ancestral and Derived Characteristics

  • Shared Ancestral Characteristics: Traits that originated from an ancestor.

  • Shared Derived Characteristics: Traits unique to a clade, resulting from evolutionary changes that address specific environmental challenges.

Phylogenetic Trees and Time Scales

  • Phylogenetic trees represent relationships but do not inherently indicate time scales.

  • Length of branches represents genetic changes and can be adjusted according to fossil data.

Molecular Clocks and Evolution Rates

  • Used to estimate evolutionary time based on constant rates of mutation in some genes.

  • Estimation techniques include:

    • Calibration using fossil data to link genetic variations and historical events.

  • Limitations:

    • Evolution can occur at irregular rates due to various environmental factors and stresses.

Conclusion and Future Directions

  • Continuous advancements in sequencing will lead to quicker, more affordable genetic analysis.

  • Understanding molecular evolution helps provide insights into how genetic traits arise and adapt through time.