SCH2226 Human Molecular Genetics - Gene Expression: Transcription

Topic = Transcription

Overview of Gene Expression and the Central Dogma

  • The Central Dogma: Proposed by Francis Crick in 1956, this concept describes the flow of genetic information from: DNARNAProtein\text{DNA} \rightarrow \text{RNA} \rightarrow \text{Protein}.

  • Definition of Transcription: The synthesis of an RNA molecule using a DNA template.

  • Key Characteristics:     

  • Only one of the DNA strands is transcribed.     

  • The process is catalyzed by the enzyme RNA polymerase.

  • Stages of Transcription: Transcription is divided into three distinct steps in both prokaryotes and eukaryotes:     

  • 1. Initiation: The process of getting started where RNA polymerase binds to the DNA.     

  • 2. Elongation: RNA polymerase adds nucleotides to the growing chain.     

  • 3. Termination: Transcription stops, and the RNA polymerase dissociates from the DNA.

RNA Molecules and Structures

  • Four Main Types of RNA:     

  • Messenger RNA (mRNA\text{mRNA}): Carries genetic information for protein synthesis.     

  • Transfer RNA (tRNA\text{tRNA}): Contains an anticodon and carries amino acids (attached via an ester bond to the ACC\text{ACC} sequence at the 3'-end).     

  • Ribosomal RNA (rRNA\text{rRNA}): Components of the ribosome.     

  • Small Nuclear RNA (snRNA\text{snRNA}): Essential for eukaryotic pre-mRNA processing.

  • Structural Elements:     

  • RNA can exhibit intramolecular base-pairing.     

  • tRNA molecules interact with mRNA codons via their specific anticodons.

Transcription in Prokaryotes (E.coliE. coli as Model)

  • Directionality: The RNA chain is synthesized in the 535' \rightarrow 3' direction.

  • Template: Only one strand of the DNA is used as a template to determine the base sequence.

  • Denaturation: The DNA double helix is opened/denatured by RNA polymerase in prokaryotes.

  • Initiation and the Sigma (σ\sigma) Factor:     

  • Initiation requires a complex consisting of RNA polymerase and the σ\sigma factor protein binding to the promoter.     

  • The σ\sigma factor guides the RNA polymerase to recognize specific promoter sequences.     

  • Once transcription begins (after 8 to 9 nucleotides are polymerized), the σ\sigma factor dissociates from the core enzyme, and the core enzyme completes the synthesis.

  • Regions of a Prokaryotic Gene:     

  • Promoter: A sequence that attracts RNA polymerase to begin transcription at a specific site.     

  • RNA-coding sequence: The sequence that is actually transcribed.     

  • Terminator: Specifies where transcription will stop.

  • Promoter Details and Consensus Sequences:     

  • 35-35 Region: The consensus sequence is 5-TTGACA-35'\text{-TTGACA-}3'. The holoenzyme binds loosely here first.     

  • 10-10 Region: Also known as the Pribnow box; the consensus sequence is 5-TATAAT-35'\text{-TATAAT-}3' RNA polymerase binds tightly here, causing a local untwisting of approximately 17bp17\,bp.     

  • +1+1 Site: The specific position where transcription initiation begins.

  • Gene Regulation:     

  • Transcription levels depend on the ability of the σ\sigma factor to recognize promoter sequences.     

  • Promoters deviating from the consensus sequence show different transcription levels.     

  • σ70\sigma^{70}: The most abundant σ\sigma factor in E.coliE. coli, used for most genes. Other factors are produced in response to changing environmental conditions.

Comparison: Replication vs. Transcription

Feature

Replication

Transcription

Template

Both strands used

Single strand used

Scope

Whole genome

Small portion of genome

Primer

Yes

No

Enzyme

DNA polymerase

RNA polymerase

Product

Double-stranded DNA (dsDNAdsDNA)

Single-stranded RNA (ssRNAssRNA)

Base pair

ATA-T, GCG-C

AUA-U, TAT-A, GCG-C

Proof reading

Yes

No

Transcription in Eukaryotes

  • Key Differences from Prokaryotes:     

  • Occurs in the nucleus (prokaryotic occurs in the cytoplasm).     

  • Involves three different RNA polymerases.     

  • Requires many transcription factor proteins.     

  • Includes enhancers and silencers to regulate the rate of initiation.

  • Eukaryotic RNA Polymerases:     

  • RNA Polymerase I: Located in the nucleolus; transcribes the three major rRNAs (28S28S, 18S18S, and 5.8S5.8S).    

  • RNA Polymerase II: Located in the nucleus; transcribes messenger RNA (mRNA\text{mRNA}) and some snRNA\text{snRNA}.    

  • RNA Polymerase III: Located in the nucleus; transcribes transfer RNA (tRNA\text{tRNA}), 5StRNA5S\,tRNA, and some snRNA\text{snRNA}.

  • Promoter Elements:     

  • Core Promoter Elements: Located near the start site; specify where transcription begins.        

  • TATA box (Hogness box): Located at 30-30 to 25-25. The full consensus sequence is 5-TATAAAA-35'\text{-TATAAAA-}3'.     

  • Proximal Promoter Elements: Located between 50-50 and 200-200; required for high levels of transcription (e.g., CAAT box at 70-70 to 80-80).

  • The Transcription Complex:     

  • TATA Binding Protein (TBP): A 34kDa34\,kDa protein that binds to the TATA box.     

  • TFIID: A complex formed by TBP and several TBP-associated factors (TAFs). Binding of TFIID is the first step in forming the "committed complex."    

  • The entire complex spans DNA from position 30-30 to +30+30 relative to the initiation site.

  • Enhancers and Silencers:     

  • Cis-acting: Enhancers located on the same strand of DNA as the promoter.     

  • Trans-acting: Enhancers acting on promoter elements on a different DNA strand.     

  • Activators: Proteins that bring enhancers close to the promoter to achieve maximal transcription.

Post-Transcriptional Processing in Eukaryotes

  • 5' Capping:     

  • Structure: A 7-methylguanosine7\text{-methylguanosine} cap is added to the 55' end of the mRNA via a 5-to-55'\text{-to-}5' triphosphate bridge.     

  • Occurs in the nucleus before transport to the cytoplasm.     

  • Functions: Protects mRNA from exonucleases and is required for efficient translation initiation.

  • 3' Polyadenylation (Poly-A Tail):     

  • Mechanism: mRNA is cleaved 10-3510\text{-}35 nucleotides downstream from the AAUAAAAAUAAA recognition site (approximately 20bp20\,bp from the site).     

  • Enzyme: Poly(A) polymerase adds up to 200A200\,A residues.     

  • Function: Protects mRNA from 55' exonucleases [per transcript note].

  • Intron Splicing:     

  • Introns: Intervening sequences that do not code for proteins; must be removed in the nucleus.

  • Exons: The protein-coding sequences.     

  • Spliceosome: The machinery for splicing, consisting of five snRNA\text{snRNA} molecules (U1U1, U2U2, U4U4, U5U5, and U6U6) and over 6060 proteins.     

  • snRNP: Small ribonucleoprotein particles, often called "snurps," which facilitate splicing by positioning RNA segments and binding to consensus sequences at exon-intron junctions.     

  • Splicing Consensus Sequences:         - 5 Exon junction5'\text{ Exon junction}: 5-AGGUAAGU-35'\text{-AGGUAAGU-}3'         - 3 Exon junction3'\text{ Exon junction}: 5-CAGG-35'\text{-CAGG-}3'

  • Final Structure of Mature mRNA:     

  • Includes the 5 CAP5'\text{ CAP}, 5-UTR5'\text{-UTR} (untranslated region), the Open Reading Frame (ORF) (starting with AUGAUG and ending with stop codons UAAUAA, UAGUAG, or UGAUGA), the 3-UTR3'\text{-UTR}, and the poly(A)poly(A) tail bound by the cap-binding complex and PAB (poly-A binding protein).