Transcription and Post-Transcriptional Modification
Transcription
- DNA contains the code for proteins, but protein synthesis happens in the cytoplasm.
- DNA can't leave the nucleus because it would be degraded. RNA is used to transmit genetic information.
- Transcription: mRNA creation from a DNA template.
- mRNA carries information directly from DNA.
- tRNA and rRNA are other types of RNA that play roles in protein translation.
Mechanism of Transcription
- Transcription copies only one of the two DNA strands.
- Initiation:
- Enzymes like helicase and topoisomerase unwind DNA, preventing supercoils.
- This allows access to the gene of interest.
- Transcription synthesizes a single mRNA strand from the DNA template (antisense strand).
- The mRNA strand is antiparallel and complementary to the DNA template strand.
- RNA is synthesized by DNA-dependent RNA polymerase.
- RNA polymerase finds genes by searching for promoter regions.
- Eukaryotes:
- RNA polymerase II transcribes mRNA.
- The TATA box (high concentration of thymine and adenine) is the binding site in the promoter region.
- Transcription factors help RNA polymerase locate and bind to the promoter.
- RNA polymerase, unlike DNA polymerase, doesn't need a primer.
- Eukaryotes have three RNA polymerases, but only one transcribes mRNA.
- RNA polymerase I: Located in the nucleolus, synthesizes rRNA.
- RNA polymerase II: Located in the nucleus, synthesizes hnRNA (preprocessed mRNA) and some snRNA.
- RNA polymerase III: Located in the nucleus, synthesizes tRNA and some rRNA.
- RNA polymerase moves along the template strand in the 3' to 5' direction.
- This allows mRNA construction in the 5' to 3' direction.
- RNA polymerase doesn't proofread, so the transcript isn't edited.
- The coding (sense) strand isn't used as a template but is identical to the mRNA (except T is replaced by U).
- Numbering system:
- The first base transcribed to RNA is +1.
- Bases upstream (5' end) are negative numbers (-1, -2, -3, etc.).
- Bases downstream (3' end) are positive numbers (+2, +3, +4, etc.).
- No nucleotide is numbered zero.
- The TATA box is usually around -25.
- Transcription continues until the RNA polymerase reaches a termination sequence.
- The initial transcript is hnRNA (heterogeneous nuclear RNA).
- mRNA is derived from hnRNA via post-transcriptional modification.
Post-Transcriptional Processing
- hnRNA must undergo processing to interact with ribosomes and survive in the cytoplasm.
- Maturation includes:
- Splicing (introns and exons).
- 5' cap.
- 3' poly-A tail.
Splicing Introns and Exons
- Non-coding sequences (introns) are removed, and coding sequences (exons) are linked.
- Splicing is done by the spliceosome.
- snRNA molecules couple with proteins (snRNPs).
- The snRNP/snRNA complex identifies the 5' and 3' splice sites of introns.
- Introns are excised in a lariat shape and degraded.
- The function of introns is not fully understood.
- Hypotheses:
- Regulation of gene expression.
- Maintaining genome size.
- Allowing rapid protein evolution.
- Modular function: standard sequences swapped in/out.
Five Cap
- A 7-methylguanylate triphosphate cap is added to the 5' end of hnRNA during transcription.
- The cap is recognized by the ribosome as a binding site.
- It also protects mRNA from degradation.
Three Poly A Tail
- A poly(A) tail (adenine bases) is added to the 3' end.
- It protects against rapid degradation.
- The longer the tail, the longer the mRNA survives.
- It also assists with export from the nucleus.
- After processing, only exons remain, and the cap and tail are added, creating mature mRNA.
- Untranslated regions (UTRs) remain at the 5' and 3' ends.
- Ribosome initiates translation at the start codon AUG and ends at a stop codon (UAA, UGA, or UAG).
Alternative Splicing
- hnRNA can be spliced in different ways to produce protein variants.
- This allows more proteins from a limited number of genes.
- Humans make around 100,000 proteins from about 20,000-25,000 genes.
- Alternative splicing also regulates gene expression.