HMG - Week 8: Recombinant DNA Technology: Cloning and Expression Vectors LO

Overview of Cloning Vectors

  • Definition: A cloning vector is a DNA molecule used to transport foreign DNA into a host cell. Once inside the host cell, the vector allows the foreign DNA to be replicated and copied multiple times.

  • Primary Functions:

    • Hold DNA fragments: Providing a physical structure to carry genetic material.

    • Allow DNA replication: Utilizing the host's machinery to duplicate the inserted DNA.

    • Increase copy numbers: Amplifying a specific DNA fragment to produce large quantities.

    • Storage and manipulation: Enabling scientists to store genes in a stable form and perform genetic engineering task.

  • Characteristics of a High-Quality Cloning Vector:

    • Origin of Replication (ori): A specific DNA sequence where replication begins; essential for the vector to replicate independently within the host cell.

    • Selectable Marker Gene: Often an antibiotic resistance gene that allows for the identification of host cells that have successfully taken up the vector.

    • Multiple Cloning Site (MCS): A region containing several unique restriction sites where foreign DNA can be inserted without disrupting essential vector functions.

    • Small Physical Size: Facilitates easier manipulation during laboratory procedures and reduces the likelihood of DNA breakage.

    • Stability: Must remain stable during replication within the host cells to ensure the genetic information is not lost over successive generations.

Types of Cloning Vectors and Capacity

  • Plasmids:

    • Structure: Small, circular, double-stranded DNA molecules.

    • Behavior: They replicate independently of the bacterial chromosome.

    • Usage: Most commonly utilized in Eschericia coli (E. coli).

    • Insert Capacity: Typically carry DNA inserts up to approximately 10kb10\,\text{kb}.

    • Advantages: Easy to isolate, easy to manipulate in vitro, and often possess a high copy number.

  • Bacteriophage Vectors:

    • Definition: Viruses that specifically infect bacterial cells.

    • Behavior: They replicate inside bacteria and typically lead to the lysis of the host cell to release new viral particles.

    • Advantages: More efficient DNA delivery compared to plasmids and a larger insert capacity for longer DNA fragments.

  • Yeast Artificial Chromosomes (YACs):

    • Structure: Artificial chromosomes maintained within yeast cells.

    • Capacity: Can accommodate very large DNA fragments, ranging from hundreds of kilobases to several megabases.

    • Advantages: Essential tool for genome mapping projects.

  • Key Selection Criteria for Exams:

    • The choice of vector depends primarily on the size of the DNA being cloned.

    • Plasmid: Small inserts.

    • Bacteriophage: Medium inserts.

    • YAC: Very large inserts.

Expression Vectors and Protein Production

  • Definition: Specialized cloning vectors designed not only for DNA replication but for the actual production (expression) of the protein encoded by the inserted gene.

  • Primary Purposes:

    • Facilitation of gene expression.

    • Large-scale protein production.

    • Protein purification processes.

    • Manufacture of recombinant proteins for medicine and industry.

  • Industrial/Medical Examples:

    • Human insulin production.

    • Human growth hormone production.

    • Vaccine protein manufacturing.

  • Critical Features:

    • Promoter: A DNA sequence located upstream of the inserted gene that controls transcription. It must be recognized by the host cell's RNA polymerase.

  • Inducible Promoters:

    • Function: These can be switched "ON" or "OFF" using specific chemical triggers.

    • Advantages: They prevent the production of proteins that might be toxic to the host cell and allow for controlled timing of protein expression.

    • The lac Promoter Example:

      • Normal State: The promoter is "OFF."

      • Induction: When Isopropyl β\beta-D-1-thiogalactopyranoside (IPTG) is added, it turns manual transcription "ON."

  • Rationale for Inducible Regulation:

    • Without a switch, excess protein production can kill host cells.

    • Unregulated protein synthesis may interfere with normal cell metabolism.

    • High-level expression can cause the host cells to grow poorly, reducing overall yield.

Plasmids and the pUC19 Model

  • General Definition: Small circular double-stranded DNA molecules found naturally in bacteria, separate from the genomic chromosome, and capable of self-replication.

  • The pUC19 Plasmid Features:

    • Origin of Replication (ori): The starting point for duplication.

    • ampR Gene: Provides resistance to the antibiotic ampicillin; serves as a selectable marker to ensure only cells with the plasmid survive antibiotic treatment.

    • lacZ Gene: Encodes the enzyme β\beta-galactosidase.

    • Multiple Cloning Site (MCS): Positioned within the lacZ gene, containing several unique restriction sites for foreign DNA insertion.

  • Blue-White Screening Principle:

    • This technique identifies which bacteria have taken up a recombinant plasmid (plasmid with an insert) versus an empty plasmid.

    • Mechanism: Foreign DNA is inserted into the MCS, which is located inside the lacZ gene.

    • Scenario A: No Insert Present:

      • The lacZ gene remains functional.

      • β\beta-galactosidase is produced.

      • The substrate X-gal is broken down by the enzyme.

      • Result: Blue colonies develop.

    • Scenario B: Insert Present (Recombinant):

      • The lacZ gene is disrupted (insertional inactivation).

      • No β\beta-galactosidase is produced.

      • X-gal remains intact.

      • Result: White colonies develop.

  • Interpretation Table:

    • Blue Color: Empty vector (No insert).

    • White Color: Recombinant plasmid (Contains insert).

  • Exam Tip: In a laboratory setting, white colonies are the desired outcome because they contain the recombinant DNA.

Restriction Enzymes and DNA Ligation

  • Definition: Also known as restriction endonucleases, these are "molecular scissors" that cut DNA at specific recognition sequences.

  • Natural Role: Part of the bacterial defense system to protect against foreign or viral DNA.

  • Recognition Sites:

    • Usually between 44 and 88 base pairs (bp) in length.

    • Sequences are palindromic (they read the same 535' \rightarrow 3' on both complementary strands).

  • Example (EcoRI):

    • Sequence: 5'-GAATTC-3' and 3'-CTTAAG-5'.

  • Types of DNA Ends:

    • Sticky Ends: Produced by enzymes like EcoRI, BamHI, and PstI. These have single-stranded overhangs that can hydrogen bond with complementary sequences, making them easier to ligate.

      • EcoRI produces AATT overhangs.

      • BamHI produces 55' overhanging sticky ends.

      • PstI produces 33' overhanging sticky ends.

    • Blunt Ends: Produced by enzymes like SmaI. These have no overhangs and are significantly harder to ligate.

  • DNA Ligation:

    • DNA Ligase: An enzyme that seals the phosphodiester bonds of the DNA backbone.

    • Process:

      1. Restriction enzyme cuts the DNA.

      2. Complementary sticky ends anneal through base pairing.

      3. DNA ligase seals the sugar-phosphate backbone, resulting in a recombinant DNA molecule.

Restriction Site Probability Calculations

  • Formula: Probability=(14)nProbability = (\frac{1}{4})^n, where nn is the number of bases in the recognition sequence.

  • Example (EcoRI):

    • Recognition site has 66 bases.

    • Probability=(14)6=14096Probability = (\frac{1}{4})^6 = \frac{1}{4096}.

    • This means an EcoRI site occurs approximately once every 4,096bp4,096\,\text{bp}.

  • Frequency Relationship Table:

    • 4 bp site: 11 in 256bp256\,\text{bp}.

    • 6 bp site: 11 in 4,096bp4,096\,\text{bp}.

    • 8 bp site: 11 in 65,536bp65,536\,\text{bp}.

  • Exam Tip: Longer recognition sequences occur less frequently in a genome.

Genomic and cDNA Libraries

  • Genomic Library:

    • Definition: A collection of cloned DNA fragments representing the entire genome of an organism.

    • Construction Steps: Extract genomic DNA \rightarrow Digest with restriction enzymes \rightarrow Clone fragments into vectors \rightarrow Transform host cells.

    • Contents: Includes exons, introns, promoters, regulatory sequences, and intergenic DNA (everything in the genome).

    • Uses: Whole genome sequencing, gene mapping, and identification of disease genes.

  • cDNA Library:

    • Definition: Contains DNA copies of mature mRNA molecules expressed in a specific cell or tissue.

    • Construction Steps: Isolate mRNA \rightarrow Use Reverse Transcriptase to synthesize complementary DNA (cDNA) \rightarrow Clone cDNA into vectors.

    • Contents: Contains exons only. It lacks introns, promoters, and intergenic DNA because the source material (mRNA) has already been spliced.

  • Comparison Table:

    • Feature: Source | Genomic Library: Genomic DNA | cDNA Library: mRNA

    • Feature: Exons | Genomic Library: Yes | cDNA Library: Yes

    • Feature: Introns | Genomic Library: Yes | cDNA Library: No

    • Feature: Promoters | Genomic Library: Yes | cDNA Library: No

    • Feature: Represents whole genome | Genomic Library: Yes | cDNA Library: No

    • Feature: Represents expressed genes | Genomic Library: No | cDNA Library: Yes

  • Critical Exam Question: Why is cDNA used for expressing human proteins in bacteria?

    • Answer: Bacteria do not possess the cellular machinery to remove introns. Therefore, intron-free cDNA must be used to ensure the correct protein is synthesized.

Analytical Techniques: Southern Blotting and Yeast Two-Hybrid

  • Southern Blotting:

    • Purpose: To detect a specific DNA sequence within a complex mixture of DNA.

    • Steps:

      1. DNA extraction.

      2. Digestion via restriction enzymes.

      3. Gel electrophoresis to separate fragments by size.

      4. Denaturation of DNA.

      5. Transfer of DNA to a membrane.

      6. Addition of a labeled probe.

      7. Probe hybridization to the target sequence.

      8. Detection of the label.

    • Probe Definition: A labeled, single-stranded DNA fragment that is complementary to the target sequence.

    • Mnemonic: Southern = DNA.

  • Yeast Two-Hybrid System:

    • Purpose: To detect and study protein-protein interactions.

    • Principle: Transcription factors often have two distinct domains: a DNA-binding domain (DBD) and an activation domain (AD). A reporter gene is only activated when these two domains are brought into close proximity.

    • Experimental Design:

      • Bait Protein: The protein of interest attached to the DNA-binding domain (DBD).

      • Prey Protein: The potential interacting partner attached to the activation domain (AD).

    • Outcomes:

      • Interaction: If Bait and Prey interact, DBD and AD are brought together, the reporter gene turns "ON," and the interaction is detected.

      • No Interaction: The reporter gene remains "OFF."

    • Applications: Pathway analysis, drug target identification, and functional genomics.