CH14 – Site-Specific Recombination & Transposition
Site-Specific Recombination (SSR)
Definition & Essential Features
- Recombination reaction that occurs only at a short, pre-defined DNA sequence (recombination site).
- Typical site length: \approx 20-200\,\text{bp}.
- Two indispensable components:
- DNA site(s): e.g. LOX P, FRT (Flp Recombination Target), HIX in Salmonella.
- Site-specific recombinase (endonuclease/ligase in one): e.g. Cre (for LOX P), Flp (for FRT), Hin (for HIX).
- Net genetic outcomes dictated by relative orientation of the sites on the same DNA molecule:
- Inverted repeats → inversion of intervening segment.
- Direct repeats → excision/deletion of the intervening segment (can operate in reverse to mediate insertion).
- Sites on separate molecules → integration (insertion) events.
Catalytic Chemistry
- Two recombinase families distinguished by active-site residue:
- Tyrosine recombinases (Cre, Flp)
- Serine recombinases (Hin, some phage integrases)
- Key mechanistic points:
- Four recombinase monomers load: two per site (one per strand).
- Active-site Tyr or Ser performs nucleophilic attack on phosphate → covalent 3′- or 5′-phospho-protein intermediate.
- Strand exchange in two steps (first pair, then the other) → transient Holliday junction-like intermediate.
- No ATP required; energy of cleaved phosphodiester bond conserved in protein–DNA intermediate.
Yeast 2-µm Plasmid Copy-Number Control (Flp/FRT)
- Two-micron plasmid harbors two FRT sites in opposite orientation flanking the origin of replication.
- When copy number drops, Flp recombinase is expressed.
- During bidirectional replication the two FRTs are replicated; Flp inverts the FRT-bounded segment mid-replication, reorienting one fork → both forks chase each other around the circle.
- Result: rolling replication that makes multiple plasmid copies after a single initiation event.
Phage P1 Lysogenic Maintenance (Cre/LOX P)
- Injected as linear DNA containing two LOX P sites.
- Cre recombinase circularizes the genome by recombining the sites.
- Unlike most lysogenic phage, circular P1 remains episomal (extra-chromosomal), genes repressed.
- Low copy number sensed → nick initiates rolling-circle replication, producing concatemer with LOX P units; Cre resolves into monomeric circles for packaging.
Biotechnology Applications of SSR
- Place recognition sites flanking a cassette → inducible excision, inversion, or insertion by timed expression of recombinase.
- Remove transcription terminator to activate a silent gene.
- Site-specific gene knock-in via recombination between chromosomal site and plasmid site.
- Allele exchange: two crossovers replace wild-type allele with mutant while placing wild-type on the excised plasmid.
Phase Variation in Salmonella enterica (Hin/HIX)
- Salmonella alternates between two flagellin genes (FljB vs FliC) to evade host immunity.
- Hin invertase binds two HIX inverted repeats within a supercoiled loop and flips a promoter-containing segment.
- Orientation 1: promoter drives fljB + fljA (repressor) → FljA suppresses fliC.
- Orientation 2: promoter flipped → fljB & fljA OFF, repression lifted → FliC expressed.
- Characteristics of Hin system:
- Uses Ser active site.
- Requires accessory DNA-bending protein FIS.
- Only acts when both HIX sites are on the same supercoiled DNA domain.
Transposition
Minimal Transposable Element (TE)
- At least one gene: transposase (a recombinase).
- Flanked by terminal inverted repeats (IRs); sequence varies per TE, specifically recognized by its own transposase.
Three Mechanistic Classes
- Conservative ("cut-and-paste")
- TE excised from donor site and inserted into target; copy number unchanged.
- Replicative
- TE copied during transposition; donor retains original, target receives duplicate.
- Retrotransposition (RNA-mediated)
- TE transcribed → RNA.
- Reverse-transcribed → cDNA → integration; always increases copy number.
Conservative / Cut-and-Paste Details
- Transposase forms synaptic complex on IRs.
- Double-strand breaks at both ends via hydrolysis → 3′-OH ends form hairpins.
- Transposase makes staggered cuts in target DNA; TE inserted, gaps filled by host polymerase.
- Staggering produces target site duplications (TSDs) (direct repeats) flanking inserted TE.
Replicative Pathway (“copy-and-paste”)
- Single-strand nicks at TE ends; TE joined to target creating Shapiro intermediate (donor & recipient fused).
- DNA replication across branches duplicates TE.
- Resolution of cointegrate (two TEs head-to-head) by homologous or site-specific recombination separates donor and target, each now bearing a copy.
Retrotransposition
- Two subclasses:
- LTR retrotransposons (Ty, Gypsy)
- LTRs harbor promoters.
- RNA transcribed by host \text{RNAP II}.
- Reverse transcriptase (RT) plus integrase (IN) encoded within.
- cDNA synthesis primed by a host tRNA ("extra-chromosomally primed").
- IN inserts new copy with TSDs.
- Non-LTR retrotransposons (LINEs, SINEs)
- Lack LTRs; often poly-A tail.
- ORF encodes endonuclease/RT fusion.
- Mechanism: target-primed reverse transcription (TPRT).
• Endonuclease nicks target → 3′-OH primes reverse transcription of TE RNA directly at chromosomal site.
• Second-strand cleavage & synthesis completes insertion.
Transposable Elements in Bacteria
| Class | Hallmarks | Example |
|---|
| Insertion Sequence (IS) | Only transposase + IRs | IS1, IS3 |
| Composite Transposon | Two IS elements flanking cargo genes | Tn5 (kanamycin, bleomycin, streptomycin resistance) |
| Complex Transposon / Mu-like | Many genes incl. phage-like functions; moves by replicative transposition | Tn3, phage Mu |
- Mu acts both as a lytic/lysogenic phage and as a replicative transposon; insertion mutagenesis gives its name ("Mutator").
Transposons in Eukaryotes
- Roughly \approx 46\% of human genome = transposon-derived; \approx 90\% of those are retrotransposons.
- DNA TEs similar to bacterial ISs: e.g. Mariner family—encodes only transposase + IRs.
- Retrotransposon examples:
- Ty (yeast) & Gypsy (Drosophila): LTR, extra-chromosomal priming.
- LINEs (L1) & SINEs (Alu): non-LTR, TPRT.
Ty Retrotransposon Life Cycle (Saccharomyces)
- Transcription from LTR promoter; RNA poly-adenylated.
- Export to cytoplasm; translation of Gag, protease, RT, integrase.
- Gag packages RNA & enzymes into virus-like particle (VLP).
- Reverse transcription inside VLP → dsDNA.
- VLP transports cDNA + integrase back to nucleus → integration with TSDs.
Retrotransposons vs. Retroviruses
- Retroviruses resemble LTR retrotransposons but carry an additional env gene coding for envelope/capsid proteins enabling extracellular infection.
- Shared gene architecture: gag–pol–(env) framed by LTRs.
- Replication: RNA genome reverse-transcribed, integrated, transcribed, packaged, and released by cell lysis or budding.
Evolutionary Interplay Between TEs & Hosts
Protein Homologies
- Many transposases & retroviral integrases share D–D–E catalytic triad → common evolutionary origin.
\text{Asp}–\text{Asp}–\text{Glu} residues coordinate Mg^{2+} for DNA cleavage/strand transfer.
Life-cycle within a Host Genome
- Horizontal entry (rare) introduces active, highly mobile TE.
- Initial bursts of transposition may be deleterious (gene disruption).
- Host and TE co-evolve: mutations accumulate in IRs or coding region → mobility attenuates or silences TE.
Co-option of TE Machinery — Immune System Example
- V(D)J recombination that generates antibody diversity parallels DNA transposition.
- Recombination Signal Sequences (RSS) resemble IRs.
- RAG-1/RAG-2 recombinases are evolutionarily related to transposases.
- Antibody light-chain locus architecture:
- \sim 1{-}300 Variable (V) segments, 1{-}5 Joining (J) segments, one Constant (C).
- RAG-mediated recombination picks one V and one J → transcription/translation yields unique variable domain.
- Approximate potential diversity from V–J combinations: 300\times5 \approx 1.5\times10^{3} per light-chain; combinatorial pairing with heavy chains + junctional variability boosts total to \approx 10^{7} distinct antibodies.
- Hypothesis: an ancient transposon invasion supplied the RAG genes and RSS architecture, later domesticated for adaptive immunity.
Summary of Key Numerical Facts
- Recombination site length: 20-200\,\text{bp}.
- Human genome: \approx 46\% transposon-derived; 90\% of those are retrotransposons.
- Yeast 2-µm plasmid: amplification via Flp/FRT.
- Antibody repertoire: \approx 10^{7} specificities from <10^{3} germ-line segments.
Practical / Ethical / Evolutionary Implications
- SSR systems (Cre-LOX, Flp-FRT) are powerful genomic engineering tools—enable conditional knock-outs, gene switches, targeted insertions with minimal off-target effects when sites are unique.
- TE activity can drive genome evolution but can also cause disease (insertional mutagenesis, genome instability).
- Domestication of TE proteins (e.g. RAG recombinase, telomerase RT domain) illustrates how genomes repurpose mobile-element machinery.