Week 4 - Recombinant DNA Techniques & DNA Sequencing

Restriction Endonucleases

  • Enzymes purified from bacteria that cut DNA at specific sites; act as molecular scissors to defend against foreign DNA.
  • Type II restriction endonucleases: cleavage site is within or close to the recognition site; most recognition sites are $4$–$8$ nucleotides long and palindromic; require Mg}^{2+}$ for activity; can cut to blunt ends or generate sticky ends.
  • Recognition sites are sequence-specific and often palindromic; EcoRI example creates sticky ends from GAATTC / CTTAAG with 5′ overhangs; cleavage pattern yields complementary, directional ends.
  • Host bacteria protect their own DNA by methylating recognition sites; EcoRI will not cleave methylated DNA (restriction–modification system).
  • Common enzymes include EcoRI, HindIII, PstI, HpaI, HhaI; named after the source organism.

DNA Fragmentation & Cloning Basics

  • Fragment genomic DNA into manageable pieces using restriction enzymes; generates reproducible fragments for analysis.
  • Fragments are ligated into plasmid vectors cut with the same enzymes by DNA ligase to form recombinant DNA.
  • Competent bacteria are transformed with recombinant plasmids; selective antibiotics enable growth of transformed cells.
  • Directional cloning: use different restriction enzymes at each end to control orientation of the insert in the vector.
  • Blue–white screening (LacZ disruption): insert disrupts β-galactosidase; white colonies indicate presence of insert; blue colonies indicate no insert.

Plasmid Vectors & Expression Cassettes

  • Components: promoter, ribosome-binding site (RBS), transcription terminator, multiple cloning site (MCS)/polylinker, selectable antibiotic resistance marker, and optional purification tag.
  • Inducible expression: regulatory elements (repressor/operator) allow controlled expression.
  • Directional cloning via MCS ensures correct orientation for transcription.

Recombinant Protein Expression

  • Expression hosts: Escherichia coli, yeast, insect, or mammalian cells; choice affects yield, folding, and activity.
  • Considerations: promoter strength, ribosome binding, folding chaperones, post-translational modifications, and potential need for purification tags.
  • Applications: production of vaccines/therapeutics (e.g., insulin, HBV/HPV vaccines, antibodies), enzymes, biomaterials, and gene therapy vectors.

Gene Manipulation & Mutagenesis

  • Site-directed mutagenesis: PCR with modified primers to introduce mutations; template DNA is degraded with methylation-dependent endonuclease (e.g., DpnI).
  • Gene replacement/knockout/addition: replace a normal gene, inactivate a gene, or insert a mutant gene to study function.
  • Conditional mutations: regulatory DNA sequences enable selective control of gene expression; tissue-specific activation possible.
  • Genome editing tools: CRISPR-Cas9, TALENs, Zinc Finger Nucleases (ZFNs) for targeted genome modification.

DNA Sequencing: Sanger Method (Chain-Termination)

  • Principle: DNA synthesis terminates at ddNTPs lacking 3′-OH; four reactions with ddATP, ddTTP, ddCTP, ddGTP in separate tubes produce fragments terminating at each nucleotide.
  • Separation by gel electrophoresis; labeled fragments detected to infer sequence complementary to the template strand.
  • Automation: fluorescently labeled ddNTPs in a single-tube reaction; four colors detected by a laser; readout is the sequencing trace.
  • Limitations: short read lengths; large genomes require fragmentation and overlap assembly (shotgun sequencing).

DNA Sequencing: From Sanger to Genome Projects

  • Shotgun sequencing: long DNA broken into random fragments; sequence reads are assembled by overlaps to reconstruct the genome.
  • Next-Generation Sequencing (NGS): sequencing by synthesis on beads/slides; massively parallel sequencing; depth/reads define coverage and accuracy; up to billions of clusters.
  • Read depth (coverage) is the average number of reads covering a position; higher depth increases confidence.

Human Genome Project & Genomics (Big Picture)

  • Genome annotation relies on sequence homology (comparative genomics) and genetics/biochemistry to assign function to genes.
  • Gene families: many genes are related; some essential genes are highly conserved across species.
  • Noncoding DNA: introns, regulatory elements, and repetitive sequences; important for regulation and genome organization.
  • Costs of sequencing have fallen dramatically over time: (3imes108)(3 imes 10^{8}) USD (early estimates) in 2003 to ≲(1imes103)(1 imes 10^{3}) USD per genome by 2020; Archon X Prize highlighted rapid genome sequencing.
  • Model organisms (yeast, nematode, fruit fly, zebrafish, mouse) have revealed fundamental cellular principles transferable to humans.

Genomics: Applications & Implications

  • Diagnostics: identify disease-causing genes and genetic variants.
  • Personalized medicine: sequencing informs targeted therapies and risk assessment.
  • Biotechnology: transgenic organisms, production of vaccines, enzymes, and biofuels; gene therapy vectors.
  • Population genetics & evolution: compare genomes across populations to trace history and variation.

Genome Organization & Functional Genomics

  • Genome organization includes coding and noncoding regions; many regulatory elements control gene expression.
  • Protein-coding genes vs. noncoding DNA; conservation helps predict gene function.
  • Tools and databases support annotation (e.g., model organism resources, genome browsers).

Model Organisms: Why They Matter

  • Model organisms reveal core cellular processes that are conserved across species.
  • Examples: Yeast, Worm (C. elegans), Flies (Drosophila), Zebrafish, Chickens, Mouse.
  • Studies in models underpin understanding of human biology and disease.

Quick Concepts to Recall

  • Restriction enzymes generate reproducible fragments; blunt vs sticky ends affect ligation strategy.
  • Cloning couples restriction digestion with ligation and transformation, followed by selection and screening.
  • Expression vectors require proper regulatory elements for controlled protein production.
  • Site-directed mutagenesis and genome editing enable precise genetic changes.
  • Sanger sequencing laid the groundwork; NGS enables large-scale genome projects with high depth and speed.
  • The genome projects inform diagnostics, therapeutics, and our understanding of biology across species.