Translation of mRNA

The Genetic Basis for Protein Synthesis and Historical Discoveries

  • Fundamental Principles of Gene Expression:     - Proteins are considered the active participants in cellular structure and function.     - Genes that encode polypeptides are specifically termed protein-coding genes.     - These genes are transcribed into messenger RNA (mRNA) molecules.     - The primary function of genetic material is to code for cellular protein production.     - Cellular proteins must be produced in the correct cell, at the proper time, and in suitable quantities.

  • Initial Discoveries by Archibald Garrod (Early 1900s):     - Garrod was the first to propose a relationship between genes and the production of proteins.     - He studied patients with metabolic defects, most notably alkaptonuria.     - Alkaptonuria Characteristic symptoms: Black urine and a bluish-black discoloration of the skin and cartilage.     - Metabolic Pathway Failure: Garrod proposed the disease resulted from a missing enzyme known as homogentisic acid oxidase.     - He identified that the disease followed a recessive inheritance pattern and described it as an "inborn error of metabolism."     - Metabolic pathways (specifically phenylalanine metabolism) can be disrupted by mutations, leading to missing or defective enzymes which cause various human diseases.

  • Experiments by Beadle and Tatum (Early 1940s):     - George Beadle and Edward Tatum investigated the relationship between genes, enzymes, and traits using the genetic model Neurospora crassa (common bread mold).     - Reseach Question: "Is it one gene–one enzyme or one gene–many enzymes?"     - Experimental Procedure: They analyzed simple nutritional requirements by isolating mutant strains unable to grow on minimal media lacking specific nutrients.     - Case Study: Methionine Biosynthesis:         - Researchers isolated strains unable to grow on minimal plates lacking the amino acid methionine.         - Wild-type (WT) and four mutant strains were streaked on minimal plates and plates supplemented with O-acetylhomoserine, cystathionine, homocysteine, or methionine.         - Strain 1: Missing Enzyme 1 (cannot grow without subsequent intermediates).         - Strain 2: Missing Enzyme 2.         - Strain 3: Missing Enzyme 3.         - Strain 4: Missing Enzyme 4.     - Conclusion: A single gene controls the synthesis of a single enzyme, leading to the one gene–one enzyme hypothesis.

  • Modifications to the One Gene-One Enzyme Theory:     1. Enzymes represent only one specific category of proteins.     2. Some proteins consist of two or more different polypeptides. Note: "Polypeptide" denotes structure, while "protein" denotes function.     3. Many genes code for functional RNA molecules rather than polypeptides (e.g., tRNA, rRNA).     4. A single gene can code for multiple polypeptides through the process of alternative splicing.

The Genetic Code and Translation Fundamentals

  • The Language of Translation:     - Translation is the interpretation of the nucleotide language of mRNA into the amino acid language of proteins.     - This process relies on the genetic code, where information is stored in groups of three nucleotides called codons.

  • Features of the Genetic Code:     - The code consists of a total of 6464 codons.     - Start Codon: AUG (specifies methionine). This defines the reading frame for subsequent codons.     - Termination (Stop) Codons: UAA, UAG, and UGA.     - Degeneracy: More than one codon can specify the same amino acid. For example, GGU, GGC, GGA, and GGG all code for glycine. These are termed synonymous codons.     - Universality: The code is nearly universal across all life forms, with few rare exceptions.

  • Exceptions to the Universal Genetic Code:     - Selenocysteine (Sec) and Pyrrolysine (Pyl): Often called the 21st21^{st} and 22nd22^{nd} amino acids.         - Found in specialty enzymes.         - Coded by UGA and UAG respectively, provided specific downstream mRNA sequences are present.     - Mitochondrial and Protozoan Variations:         - AUA: Universal (Isoleucine); Exception (Methionine in yeast and vertebrate mitochondria).         - UGA: Universal (Stop); Exception (Tryptophan in vertebrate mitochondria).         - CUU, CUC, CUA, CUG: Universal (Leucine); Exception (Threonine in yeast mitochondria).         - AGA, AGG: Universal (Arginine); Exception (Stop codon in ciliated protozoa and yeast/vertebrate mitochondria).         - UAA, UAG: Universal (Stop); Exception (Glutamine in ciliated protozoa).

  • The Reading Frame:     - The reading frame is defined by the start codon typically at the 55' end.     - Example: 5’- AUG CCC GGA GGC ACC GUC CAA U- 3’ translates to Met - Pro - Gly - Gly - Thr - Val - Gln.     - Deletion of a single nucleotide (e.g., a CC at the 6th6^{th} position) shifts the entire frame, changing all subsequent amino acids (e.g., Met - Pro - Glu - Ala - Pro - Ser - Asn).

Protein Structure and Function

  • Directionality of Synthesis:     - Polypeptide synthesis mirrors the 55' to 33' orientation of mRNA.     - A peptide bond forms between the carboxyl group of the last amino acid and the amino group of the incoming amino acid.     - N-terminus (amino-terminus): The first amino acid with an exposed amino group.     - C-terminus (carboxyl-terminus): The last amino acid with an exposed carboxyl (COOHCOOH) group.

  • Amino Acid Composition:     - There are 2020 standard amino acids.     - Each features a unique side chain (R group) with specific chemical properties.     - Nonpolar, Aliphatic & Aromatic: Hydrophobic; often buried in the protein interior (e.g., phenylalanin, leucine).     - Polar (Neutral, Acidic, Basic): Hydrophilic; likely found on the protein surface.

  • Four Levels of Protein Structure:     1. Primary structure: The linear amino acid sequence.     2. Secondary structure: Regular, repeating shapes stabilized by hydrogen bonds between backbone atoms. Includes the α\alpha helix and β\beta sheet.     3. Tertiary structure: The final 3D3D conformation of a single polypeptide, determined by hydrophobic/ionic interactions, hydrogen bonds, and van der Waals forces.     4. Quaternary structure: Formed when two or more polypeptides (subunits) associate.

  • Cellular Functions of Selected Proteins:     - Cell Shape/Organization: Tubulin forms microtubules.     - Transport: Sodium channels (ion transport); Hemoglobin (oxygen transport).     - Movement: Myosin (muscle contraction).     - Cell Signaling: Insulin (hormone) and the Insulin receptor.     - Cell Surface Recognition: Integrins.     - Enzymes (Metabolism & Synthesis):         - Hexokinase (glycolysis).         - β\beta-Galactosidase (lactose cleavage).         - Glycogen synthetase (glycogen synthesis from glucose).         - RNA polymerase (RNA synthesis).         - DNA polymerase (DNA synthesis).

Structure and Function of Transfer RNA (tRNA)

  • The Adaptor Hypothesis (Francis Crick):     - tRNA plays a direct role in recognizing mRNA codons.     - tRNA functions: 1. Recognize a 3base3-base codon; 2. Carry a specific amino acid.

  • tRNA-mRNA Recognition:     - The anticodon in tRNA binds to a complementary codon in mRNA in an anti-parallel orientation.     - Binding follows the AU/GCAU/GC rule.     - Nomenclature: tRNAPhetRNA^{Phe} carries phenylalanine.

  • Structural Features of tRNA:     - Secondary Structure: Cloverleaf pattern with three stem-loop structures and an acceptor stem.     - 3' Acceptor Stem: Features a single-strand region where a CCA sequence is added by an enzyme for amino acid attachment.     - Modified Nucleotides: tRNA contains bases such as Inosine (I), methylinosine (mI), ribothymidine (T), pseudouridine (P), dihydrouridine (UH2), and dimethylguanosine (m2G).

  • The Wobble Hypothesis (1966):     - Degeneracy typically occurs at the 3rd3^{rd} position of the codon.     - The first two positions pair strictly (AU/GCAU/GC), but the 3rd3^{rd} position can "wobble" or move, allowing mismatches.     - Isoacceptor tRNAs: Different tRNAs that recognize different codons for the same amino acid.

  • Charging (Aminoacylation):     - Aminoacyl-tRNA synthetases: Enzymes responsible for attaching amino acids to tRNAs.     - There are 2020 different synthetases, one for each amino acid.     - Reaction: Amino acid + ATP + tRNA → Charged tRNA (aminoacyl-tRNA) + AMP + Pyrophosphate.     - This is considered the "second genetic code" because accuracy is vital (error rate < 11 in 10,00010,000).

Ribosome Structure and Assembly

  • General Ribosome Features:     - Massive macromolecular complexes of rRNA and proteins.     - Bacterial: One type found in the cytoplasm (70S70S).     - Eukaryotic: Two types — cytoplasmic (80S80S) and organellar (mitochondria/chloroplasts).     - Assembly in eukaryotes occurs in the nucleolus.

  • Ribosomal Composition:     - Bacterial (70S70S):         - Small Subunit (30S30S): 2121 proteins, 16S16S rRNA.         - Large Subunit (50S50S): 3434 proteins, 5S5S and 23S23S rRNA.     - Eukaryotic (80S80S):         - Small Subunit (40S40S): 3333 proteins, 18S18S rRNA.         - Large Subunit (60S60S): 4949 proteins, 5S5S, 5.8S5.8S, and 28S28S rRNA.

  • Functional Sites for Translation:     1. Aminoacyl site (A): Site for incoming charged tRNA.     2. Peptidyl site (P): Site for the tRNA holding the growing polypeptide chain.     3. Exit site (E): Site for uncharged tRNA release.

  • Polyribosome (Polysome): An mRNA transcript bound by multiple ribosomes simultaneously.

The Stages of Translation

  • Stage 1: Initiation:     - Bacterial Initiation:         - Requires three Initiation Factors (IF1, IF2, IF3).         - Shine-Dalgarno sequence: A ribosomal-binding site in mRNA that is complementary to the 16S16S rRNA.         - Initiator tRNA: Specifically tRNAfMettRNA^{fMet}, carrying N-formylmethionine.         - Start codon: Usually AUG, sometimes GUG or UUG.     - Eukaryotic Initiation:         - Requires eIF factors.         - 7-methylguanosine cap at the 55' end is recognized by eIF4, which recruits the ribosome; there is no Shine-Dalgarno sequence.         - Initiator tRNA: tRNAMettRNA^{Met} (carries non-modified methionine).         - Kozak’s Rules for optimal initiation: 1. Must be AUG; 2. Guanine at the +4+4 position; 3. Purine (preferably Adenine) at the 3-3 position.

  • Stage 2: Elongation:     - Amino acids are added sequentially.     - Rates: Bacteria (1515 to 2020 amino acids/sec); Eukaryotes (22 to 66 amino acids/sec).     - Decoding function: The 16S16S rRNA detects mismatches in the A site and prevents elongation until the correct tRNA is bound.     - Peptidyl transfer: The polypeptide is moved from the P site tRNA to the amino acid on the A site tRNA. The 23S23S rRNA acts as the ribozyme (peptidyl transferase).     - Translocation: The ribosome moves to the next codon, shifting tRNAs to the E and P sites.

  • Stage 3: Termination:     - Occurs when a stop codon (UAG, UAA, UGA) is reached. These are recognized by Release Factors (RFs), which mimic tRNA structure.     - Bacterial RFs: RF1 (UAA, UAG); RF2 (UAA, UGA); RF3 (required for process release).     - Eukaryotic RFs: eRF1 (all three stop codons); eRF3 (required for the termination process).

  • Bacterial Coupling:     - Because bacteria lack a nucleus, transcription and translation are coupled (translation begins before transcription is finished).     - In eukaryotes, these processes are physically separated: transcription in the nucleus and translation in the cytosol.