Notes on Amino Acids, Proteins, and Protein Synthesis

Codons and Start Codons

  • Proteins are made by reading DNA; the reading frame for genes starts at a start codon.
  • Start codon: AUG in mRNA (A-U-G). It signals where the ribosome begins translating the mRNA into a protein.
  • Codons are triplets: each codon consists of 3 nucleotides and specifies one amino acid or a stop signal.
  • There are 64 possible codons (4 nucleotides × 4 × 4 = 64) but only 20 standard amino acids, so multiple codons can code for the same amino acid (degeneracy).
  • The bell question mentioned: three nitrogen bases per codon; AUG as the start codon for every gene.
  • In the context of protein synthesis, reading happens in a 5' to 3' direction, using ribosomes and transfer RNAs (tRNAs) to add amino acids in the order specified by the mRNA.
  • Important connections:
    • DNA provides the template in the nucleus.
    • mRNA carries the code to the ribosome.
    • tRNA brings amino acids to the ribosome according to codons.

Amino Acid Structure

  • There are 20 standard amino acids that make up proteins on Earth.
  • Basic amino acid structure (the same for all amino acids except the side chain):
    • Central carbon (the α-carbon)
    • An amino group (–NH₂)
    • A carboxyl group (–COOH)
    • A hydrogen atom (–H)
    • A side chain (R group) that varies between amino acids
  • The side chain (R group) determines the identities and properties of each amino acid:
    • Can be polar or nonpolar
    • Can be uncharged or carry a positive/negative charge
    • Can be hydrophilic (water-loving) or hydrophobic (water-fearing)
  • Properties of R groups drive how amino acids interact and how proteins fold into their three-dimensional shapes.
  • Dietary sources of amino acids:
    • Humans obtain amino acids from food (meat, dairy, plants). Some plants, like certain trees (e.g., moringa), are noted anecdotally to contain multiple amino acids.
    • The statement from the transcript about the moringa tree is an example used to illustrate dietary amino acids; note that nutritional content can vary and should be evaluated with current data.
  • General note on structure:
    • All amino acids share the same backbone (N–Cα–C) with the distinctive R group attached to the α-carbon.

Peptide Bonds and Condensation Reactions

  • Amino acids link together via peptide bonds to form proteins.
  • A peptide bond forms between the carboxyl group of one amino acid and the amino group of the next amino acid:
    • This is a condensation (dehydration synthesis) reaction, releasing a molecule of water (H₂O).
  • Results:
    • Two amino acids: dipeptide
    • Three or more amino acids: polypeptide
  • Chemical representation (simplified):
    ext{AminoAcid}1 + ext{AminoAcid}2
    ightarrow ext{Dipeptide} + H_2O
  • In protein synthesis, many peptide bonds form sequentially to build long polypeptide chains.

Protein Synthesis and the Central Dogma

  • The process of making proteins from genetic information involves transcription and translation:
    • DNA is transcribed to messenger RNA (mRNA).
    • mRNA is translated by ribosomes into a polypeptide (protein).
  • The central dogma of biology is summarized as:
    ext{DNA}
    ightarrow ext{RNA}
    ightarrow ext{Polypeptide (Protein)}
  • Key players in translation (briefly):
    • Ribosomes read codons (triplets of nucleotides) on mRNA.
    • tRNA brings the corresponding amino acid to the growing polypeptide chain.
    • The ribosome catalyzes peptide bond formation between adjacent amino acids.
  • An example codon discussed: triplet codons are read in groups of three bases to determine which amino acid to add next.

The 20 Amino Acids and Side-Chain Properties

  • The 20 amino acids are universal across life on Earth; the genetic code is effectively universal.
  • The only varying feature among amino acids is the R group (side chain), which determines:
    • Polarity
    • Charge (positive, negative, or neutral)
    • Hydrophilicity vs hydrophobicity
  • Consequence: the arrangement of amino acids and their R groups dictates how a protein folds and functions.
  • Key consequences of misfolding:
    • If an amino acid in the sequence is wrong or in the wrong position, the protein may not fold correctly and lose function.
    • A classic example discussed is sickle cell anemia, caused by a single base change leading to a different amino acid in the protein.
  • Sickle cell anemia example (as described in the transcript):
    • A single-base mutation changes a codon from GAG to GUG in the mRNA (translating to a Glu to Val substitution in the protein).
    • The substitution alters the protein’s folding, causing red blood cells to become sickle-shaped and functionally abnormal.
  • Universality and biodiversity:
    • The same 20 amino acids are used to build proteins in all organisms, but the order and combination of amino acids create vast biodiversity.

Protein Structure: From Primary to Quaternary

  • Primary structure:
    • The linear sequence of amino acids joined by peptide bonds.
    • The order of amino acids determines how the protein will fold.
  • Secondary structure:
    • Localized folding patterns stabilized by hydrogen bonds along the backbone.
    • Common motifs include α-helix and β-pleated sheet.
  • Tertiary structure:
    • Three-dimensional folding of a single polypeptide chain into a compact globular or fibrous shape.
    • Driven by interactions among side chains: hydrophobic interactions, hydrogen bonds, ionic bonds, and sometimes disulfide bridges.
  • Quaternary structure:
    • Interaction and assembly of multiple polypeptide subunits into a functional protein complex.

Fibrous vs Globular Proteins

  • Fibrous proteins:
    • Structure: long, thread-like shapes; often form extended fibers.
    • Functions: structural roles in tissues and connective structures (e.g., collagen, keratin, elastin).
    • Characteristics: typically insoluble in water; repetitive amino acid sequences contribute to strength and durability.
    • Examples discussed: collagen (most abundant protein in the human body), keratin, elastin; collagen provides structural support in skin, bones, tendons, and connective tissue.
  • Globular proteins:
    • Structure: roughly spherical (globular) shapes; functionally diverse.
    • Functions: enzymes, transport proteins, regulatory proteins, signaling, etc.
    • Characteristics: generally water-soluble; sensitive to changes in pH and temperature due to their roles in metabolism.
    • Enzymes as a major class of globular proteins: catalyze biochemical reactions; examples include helicases, amylases, ligases, proteases, etc.
  • Enzymes are typically globular and are central to metabolism; many end with -ase and catalyze specific reactions.

Specific Proteins and Examples

  • Collagen (fibrous):
    • The most abundant protein in the human body.
    • Provides structural support in skin, bones, teeth, and connective tissues.
  • Enzymes (globular):
    • Globular proteins that regulate metabolism and catalyze biochemical reactions.
    • Common enzyme names end in -ase (e.g., amylase, protease, ligase, helicase).
  • Rubisco (Ribulose-1,5-bisphosphate carboxylase/oxygenase):
    • A crucial globular enzyme in photosynthesis responsible for fixing carbon dioxide from the atmosphere.
    • Transcript describes Rubisco as converting atmospheric CO₂ into carbon for plants and later for the organism; note: the canonical reaction is to fix CO₂ into organic molecules during photosynthesis.
    • Correct chemical representation of the primary role:
      ext{CO}_2 + ext{RuBP}
      ightarrow 2 imes 3 ext{-PGA}
      where RuBP is ribulose-1,5-bisphosphate and 3-PGA is 3-phosphoglycerate.
    • Rubin first described as one of the most abundant enzymes on Earth, located in leaf cells and photosynthetic organisms.

The Proteome vs. Genome

  • Genome:
    • The complete set of genetic material present in an organism or cell.
  • Proteome:
    • The entire set of proteins that can be expressed by the genome, including variants produced due to alternative splicing, post-translational modifications, and cellular conditions.
  • Proteome is the collection of proteins that code for and execute biological functions in an organism, contributing to individuality (e.g., skin, eye color, metabolism).
  • Conceptual note:
    • Your proteome is determined by your genome, but expression levels and post-translational modifications add layers of regulation that differentiate individuals.

Practical and Contextual Notes

  • The moringa tree discussion in the transcript is presented as a dietary example of amino acids; while moringa does contain nutrients, verify current nutrient data when needed for coursework or exams.
  • The transcript uses a narrative style with some simplifications and occasional inaccuracies (e.g., specific details about rubisco products). Use standard textbooks for precise biochemical pathways and enzyme names when studying for exams, and treat the rubric in class as the primary guide.

Quick Glossary and Key Points to Remember

  • Codon: an mRNA triplet that encodes one amino acid or a stop signal; example codon for start is AUG.
  • Start codon: AUG
  • Peptide bond: an amide bond formed between amino acids via a condensation reaction, releasing H₂O.
  • Primary structure: the linear sequence of amino acids in a protein.
  • Secondary structure: local folding patterns such as α-helices and β-pleated sheets.
  • Tertiary structure: three-dimensional folding of a single polypeptide.
  • Quaternary structure: assembly of multiple polypeptide subunits.
  • Fibrous proteins: structural, often insoluble, e.g., collagen, keratin.
  • Globular proteins: enzymatic or metabolic roles, typically soluble, e.g., many enzymes.
  • Central dogma: DNA -> RNA -> Polypeptide (protein).
  • Rubisco: a key globular enzyme in photosynthesis that fixes atmospheric CO₂ into organic molecules.
  • Sickle cell anemia: a disease caused by a single base change altering an amino acid in a hemoglobin protein, affecting folding and RBC shape.
  • Proteome: all proteins encoded by the genome; genome vs proteome concept.
  • 20 amino acids: universal building blocks of proteins in life on Earth; diversity arises from order and combinations of these amino acids.