Molecular Biology and Biochemistry - Transcription
The Central Dogma
- DNA contains information in a sequence of bases.
- Proteins are made from a sequence of amino acids.
- RNA acts as an intermediate 'messenger' by making a copy of the DNA.
- Only certain bits of the DNA are copied, and multiple copies are made.
- Proteins perform the work in a cell.
- The central dogma states that once information gets into a protein, it can’t get out again.
- Perpetuation of genetic information from generation to generation.
- Control of the phenotype through gene expression.
- Replication: DNA-dependent DNA polymerase
- Transcription: DNA-dependent RNA polymerase, resulting in mRNA
- Translation: Complex process involving ribosomes, tRNAs, and other molecules, resulting in a polypeptide
- Reverse transcription: RNA-dependent DNA polymerase (reverse transcriptase)
Gene Expression
- Prokaryotic transcription occurs without a nucleus and involves simpler regulation.
- Regulation is crucial in gene expression because all cells contain the same information, but only specific genes are transcribed and translated in particular cells.
- Muscle cells express more actin and myosin.
- Kidney cells express more ion transporters.
- Neuron cells express more ion channels and neurotransmitters.
- All cells express common genes for metabolic pathways, cell receptors, and membrane assembly.
Regulation of Gene Expression
- Regulation occurs at various stages:
- DNA: Replication, Degradation
- RNA: Transcription (Initiation, Elongation, Processing, Export), Degradation
- Protein: Translation (Initiation, Elongation, Termination), Targeting
Prokaryotic vs Eukaryotic Transcription
- Prokaryotic Transcription
- No nucleus
- The primary transcript is equivalent to the mature mRNA.
- Codons on the mRNA are translated into an amino acid sequence by ribosomes.
- Eukaryotic Transcription
- With a nucleus
- The primary transcript is pre-mature (pre-mRNA).
- The pre-mRNA is modified at both ends, and introns are removed to produce the mRNA.
- After processing, the mature mRNA is exported to the cytoplasm for translation by ribosomes.
Timing of Transcription and Translation
- Prokaryotes: Transcription and translation are coupled; proteins are synthesized directly from the primary transcript as it is made.
- Eukaryotes: Transcription and translation are separated; transcription occurs in the nucleus, and translation occurs in the cytoplasm on ribosomes.
Cellular Location of Transcription and Translation
- Prokaryotes: Transcription and translation occur in the cytoplasm.
- Eukaryotes: Transcription occurs in the nucleus, and translation occurs in the cytoplasm.
Types of RNA and Their Function
- mRNA: Encodes the sequence of amino acids.
- tRNA: Transfer RNA; charged with amino acids, serves as a link between mRNA and the growing amino acids chain during translation.
- rRNA: Ribosomal RNA; components of ribosomes, important for translation.
- microRNA: Short RNA involved in gene regulation.
- scRNA: Small cytoplasmic RNA; important for protein secretion in bacteria and protein targeting to the ER in eukaryotes.
- RNA in RNaseP: Ribozymes of this enzyme.
- snRNA: Small nuclear RNA; important in splicing.
- Telomerase RNA: Template for reverse transcription.
- snoRNA: Small nucleolar RNA; RNA processing in the nucleus.
- Viral RNA: Genome of the virus.
General Features of RNA Polymerization
- Similar to DNA polymerization except:
- Substrates are ribonucleoside triphosphates: ATP, GTP, CTP, UTP (dNTP in DNA replication).
- Only one strand of DNA serves as a template (compared with both DNA strands in DNA replication).
- RNA polymerization can be initiated de novo (no primer required).
- The RNA molecule will be:
- Complementary to the DNA template (antisense) strand.
- Identical (except that base uridine replaces thymidine) to the DNA non-template (sense) strand.
- RNA synthesis is catalyzed by RNA polymerases and proceeds in the 5’ → 3’ direction.
Template and Coding Strands
- The template or antisense (-) strand is complementary to the sequence of the RNA transcript.
- The coding or sense (+) strand of DNA has the same sequence as the transcript, with T in place of U.
- DNA synthesis is antiparallel and occurs in the 5’ to 3’ direction.
RNA Polymerases
- Catalyze the nucleophilic attack of the 3’-hydroxyl group of the last nucleotide in the chain on the α phosphoryl group of the incoming nucleoside triphosphate.
- Elongation occurs when a ribonucleoside triphosphate base-pairs with a nucleotide on the DNA template.
- The 3′-hydroxyl group of the last nucleotide in the chain attacks the α-phosphoryl group of the incoming nucleoside triphosphate, reforming a phosphodiester linkage and releasing PPi (pyrophosphate).
RNA Chains
- Formed de novo and grow in the 5′-to-3′ direction.
- The 5′ end of the transcript is usually pppG or pppA.
Transcription in Prokaryotes
- Polymerization catalyzed by RNA polymerase, which can initiate synthesis, uses rNTPs, requires a template, and unwinds and rewinds DNA.
- Three stages:
- Initiation (Recognition and binding)
- Elongation
- Termination and release
Regulatory Sequences on DNA
- Regulatory sequences: Sites for the binding of regulatory proteins; these influence the rate of transcription. They can be found in a variety of locations.
- Promoter: Site for RNA polymerase binding; signals the beginning of transcription.
- Terminator: Signals the end of transcription.
Stages of Transcription
- Initiation: The promoter functions as a recognition site for transcription factors, which enable RNA polymerase to bind to the promoter. Following binding, the DNA is denatured into an open complex.
- Elongation/synthesis of the RNA transcript: RNA polymerase slides along the DNA in an open complex to synthesize RNA.
- Termination: A terminator is reached that causes RNA polymerase and the RNA transcript to dissociate from the DNA.
- Binding of RNA polymerase holoenzyme to a promoter region in DNA.
- Localized unwinding of the two strands of DNA by RNA polymerase to provide a single-stranded template.
- Formation of phosphodiester bonds between the first few ribonucleotides in the nascent RNA chain.
- RNA synthesis occurs in a complex called the transcription bubble, where approximately 17 bases of the DNA are unwound.
- For another nucleotide to become incorporated into the transcript, translocation of RNA polymerase must occur. The transcription bubble must move to allow the next nucleotide to base-pair with the DNA template.
- As the bubble moves, the RNA product exits the enzyme, and the transcribed DNA rejoins its partner.
Starting Transcription
- Specific sequences signify the beginning of a gene; the promoter region (compared with the origin in DNA replication).
- The first nucleotide to be transcribed is denoted as +1. The nucleotide immediately preceding it is denoted as –1.
- Located on the sense strand/coding strand/non-template strand.
- Promoters function to provide a stable binding site for RNA polymerase and transcription factors.
- Core promoter elements for E. coli include the -10 box (Pribnow box) and the -35 box.
- RNA polymerase binds to different promoters with different strengths; binding strength relates to the level of gene expression.
- Contains a consensus sequence; not all promoters have this exact sequence, but the nearer they are to it, the more strongly RNA polymerase binds to it.
- The promoter can be on either DNA strand; whichever one it is on, the opposite (template) strand is transcribed.
- Both strands of DNA can act as the template in different sections.
- The promoter region specifies the site and direction of mRNA synthesis.
DNA Strands as Templates
- Both DNA strands can be templates but synthesize different RNA transcripts and peptides.
- The promoter region specifies the starting site and direction of transcription.
- The transcript initiation site is +1.
- Pribnow box (TATAAT) located at –10.
- -35 box.
- Consensus sequences (Non-template strand).
- Strong promoters are more similar to the consensus sequence; weak promoters have more variations.
- Changes in position and sequence of -10 element and changes in the distance between the -10 and -35 sequences affect efficacy.
E. Coli RNA Polymerase
- Multisubunit:
- Core enzyme (5 subunits): α2ββ’ω
- Holoenzyme = Core (α,α,β,β’,ω) + sigma (σ)
- Functions of the subunits:
- α: Holds enzymes together.
- β: Ribonucleoside triphosphate binding site.
- β’: DNA template binding region.
- ω: Maintains the conformation and recruits β’.
- σ: DNA recognition, unwinds DNA, initiation of transcription.
- Transcription starts at a promoter sequence and ends at a terminator sequence.
Sigma (σ) Factor
- Essential for recognition of promoter.
- RNA polymerase only binds to specific sequences (promoters) with tight affinity when the sigma factor joins it to form the RNA polymerase holoenzyme, stimulating transcription.
- Combines with core → holoenzyme, adopting an “open hand” conformation and positions the enzyme over the promoter.
- Does NOT stimulate elongation (it blocks the movement of RNA polymerase).
- Falls off after 4-9 nucleotides are incorporated, then the “hand” closes.
Initiation of Transcription Steps
- Formation of closed promoter (binary) complex.
- Formation of open promoter (binary) complex.
- Ternary complex (RNA, DNA, and enzyme), abortive initiation - the polymerase creates short mRNA transcripts that are released before the polymerase detaches itself from the promoter.
- Promoter clearance (elongation ternary complex):
- First ribonucleotide (rnt) becomes unpaired.
- Polymerase loses sigma.
- Elongation starts - ribonucleotides added to 3’ end.
- Occurs after 4 - 10 nucleotides are added.
- The first ribonucleotide becomes unpaired from the antisense (template) strand, and DNA strands re-anneal.
- Polymerase loses sigma, which is recycled, resulting in a “closed hand” that surrounds DNA.
- NusA binds to core polymerase to stabilize the binding.
- The transcription bubble moves along the DNA as DNA is unwound and then rewound, while the RNA product is extruded from the complex.
- RNA pol/NusA complex stays on until termination. Rate=20-50 nt/second.
- NO PROOFREADING - Unlike DNA polymerase, RNA polymerase has no 3’ to 5’ exonuclease activity, resulting in 1 mistake in 10,000 nucleotides.
Transcription Bubble
- RNA polymerase unwinds and rewinds DNA.
- Template strand is used for synthesis, and the coding strand is displaced.
- Nascent RNA is synthesized with an RNA-DNA hybrid helix.
Termination
- Occurs at specific sites on the template strand called Terminators.
- Two types of termination:
- Rho (ρ)-independent/ intrinsic terminator
- Rho (ρ) dependent terminators
Rho-Independent Termination
- Contains inverted complementary sequences that form a hairpin when transcribed, which slows transcription.
- A second repeat sequence is polyA (polyU on RNA), which is weak, and the transcript separates from the DNA template.
- Transcription terminates when inverted repeats form a hairpin followed by a string of uracils.
Characteristics of Rho-Independent Termination
- Transcription is terminated due to a specific sequence in the terminator DNA.
- The polymerase reaches a termination sequence of guanines and cytosines, followed by a sequence of repeating adenines.
- A loop is formed in the guanine-cytosine region, and as guanine forms three hydrogen bonds with cytosine, it takes longer for the RNA Polymerase to join these.
- This puts a strain on the adenine region and causes the strand to break off, releasing the polymerase.
- RNA polymerase passes over inverted repeats, hairpins begin to form in the transcript, and the poly-U:poly-A stretch melts, causing RNA polymerase and transcript to fall off.
Rho-Independent Termination Mechanism
- Stem-loop structure causes RNA polymerase to pause.
- U-rich sequence is not able to hold the RNA-DNA hybrid together.
- Termination occurs.
Rho-Dependent Termination
- Rho factor is an ATP-dependent helicase.
- Rho binds to a particular sequence on the RNA (called the rho utilization site, rut) and uses the energy of ATP hydrolysis to chase down the polymerase in the transcription bubble.
- Contact with rho causes the transcription bubble to dissociate, catalyzing unwinding of the RNA: DNA hybrid.
- Rate: 50~90 nucleotides/sec.
Characteristics of Rho-Dependent Termination
- Rho factor protein binds to regions with no secondary structure.
- The RNA sequence upstream from termination doesn't form secondary structure.
- Rho factor binds to RNA and moves toward the 3' end.
- At a hairpin, transcription slows, and rho factor can