Comprehensive Notes on Proteins: Structure, Function, Motifs, and Prions

Structural Levels of Proteins

  • DNA vs Proteins analogy

    • DNA = software, Proteins = hardware
    • Nucleotides vs amino acids as building blocks
    • Amino acids: 20 common building blocks
    • R groups vary in size, shape, charge, polarity, and water solubility
    • 20 common amino acids and their symbols
  • Amino Acids: Building Blocks

    • General structure of amino acids
    • 20 common amino acids with diverse side chains (R groups)
  • Peptide Bond

    • Formed by dehydration synthesis
    • Allows rotation around the attached atoms, enabling various shapes of the polypeptide
  • Polypeptide Chain

    • Unbranched chain of amino acids (aa)
    • A protein may consist of one or more polypeptide chains

Levels of Protein Structure

  • Primary (1°) Structure

    • Definition: sequence and number of amino acids
    • Covalent bonds in a protein
    • Protein amino acid sequence determines higher-level structures
    • Example reference: Sickle Cell Anemia highlights the consequence of a single amino acid change
  • Secondary (2°) Structure

    • Localized organization of parts of a polypeptide chain
    • Stabilized by hydrogen bonds (H-bonds)
    • Between two peptide bonds
    • Between a peptide bond and a side chain
    • Between two side chains
    • Examples of H-bond interactions
    • a. H-bond between two peptide bonds:
      • backbone NH --- CO group interactions stabilize structure
    • b. H-bond between side chains (e.g., Glu and Ser):
      • example interactions depicted between Glu side chain and Ser side chain
    • c. H-bond between a peptide bond and a side chain
    • Four kinds of 2° structure
    • 1. α-helix: spiral; each turn = 3.6 ext{ aa}; commonly observed; often represented as barrels or rods; stabilized by hydrogen bonds between NH and CO groups of residues; disulfide linkages can play a role in some contexts (S–S)
      • Example: α-keratin in hair
    • 2. β-pleated sheet: planar segments arranged in a sheet; hydrogen bonds form between neighboring chains within the sheet
      • Represented by arrows; contributes to strength and stability in structural proteins (e.g., silk fibroin)
    • 3. β-turn (β-bend): -shaped reversal of the peptide chain; enables compact globular proteins; often small residues (glycine) or proline with built-in bend
      • Common directions include U-shaped turns with glycine and proline
    • 4. Collagen helix: important constituent of connective tissue matrix; stabilized by the association of helices to form a right-handed collagen triple helix
      • Collagen features a characteristic triple-helix assembly
    • Secondary structure in some proteins (e.g., ribonuclease A) illustrated as folded sheets and helices
  • Tertiary (3°) Structure

    • Definition: the overall conformation of a single polypeptide chain
    • Stabilized by hydrophobic interactions and other non-covalent forces
    • Highest level of organization for a single polypeptide
    • General shapes observed:
    • 1) Fibrous proteins: elongated, single-dimension organization (e.g., keratin, collagen)
      • Functions: external protection (hair, feathers, skin, nails), structural support (tendons, cartilage, bone), deeper layers of skin
    • 2) Globular proteins: tightly folded into compact 3D structures
      • More complex than fibrous proteins; includes enzymes, globins
      • Distribution of amino acids
      • Exterior: residues exposed to solvent
      • Interior: hydrophobic residues buried away from solvent
      • Both interior and exterior residues include Pro, Ser, Thr, Cys, Gly, etc.
    • Stabilizing forces for tertiary structure (summary)
    • Hydrogen bonding between R-groups
    • Ionic interactions between oppositely charged R groups
    • Hydrophobic interactions (R groups avoiding water)
    • Covalent cross-linkages (disulfide bridges)
    • Examples of stabilizing interactions depicted:
    • H-bonding, ionic bonds, hydrophobic interactions, disulfide bridges, and other covalent/ionic interactions
  • Quaternary (4°) Structure

    • Definition: number and relationships of sub-units in a protein
    • Example: Functional bacterial CAP (catabolite activator protein) forms a dimer
    • Multimeric assemblies where multiple polypeptide chains associate to form a functional unit

General Principles of Protein Folding

  • 2° structure is determined by short-range sequences of R groups
  • 3° structure is conferred by longer-range aspects of the amino acid sequence
  • Bend direction and angle are determined by the precise location of residues like Pro, Ser, Thr
  • Under biological conditions, most polypeptide chains fold into a single, highly stable conformation

Whole-Protein Architecture: Protein Classes

  • Fibrous proteins
    • Polypeptide chains arranged around a single dimension
    • Examples: Keratin, Collagen
    • Functions: external protection, support and shape, structural components of hair, skin, nails, tendons, cartilage, bone
  • Globular proteins
    • Polypeptide chains folded into a compact 3D structure
    • Examples: Enzymes, globins
    • Amino acid distribution: exterior vs interior residues

Forces Stabilizing Tertiary Structure (Detailed)

  • 1) Hydrogen bonding between R-groups in adjacent loops or regions
  • 2) Ionic interactions between oppositely charged R groups; or between R groups and water or ions
  • 3) Hydrophobic interactions driving burial of nonpolar residues inside
  • 4) Covalent cross-linkages (Disulfide bridges) between cysteine residues
    • Example schematic shows disulfide linkages stabilizing the backbone

Multimeric Proteins: Example of Hemagglutinin

  • Four levels of structure observed in complex multimeric proteins like hemagglutinin
  • Each subunit (HA1 and HA2) contributes to the overall assembly

Biological Functions of Proteins

  • Categories across different functional roles:
    • Structural and mechanical roles: keratin, collagen, elastin
    • Enzymatic roles: enzymes (e.g., proteases, kinases), ribonucleases
    • Transport and storage: lipoproteins, ferritin, myoglobin, hemoglobin
    • Defense and signaling: antibodies, transcription factors, hormones
    • Regulatory and structural roles: actin, tubulin, myosin, various regulatory proteins
    • Gluten and storage proteins: gliadin, ovalbumin, casein
    • Proteoglycans, fibrinogen, snake venom proteins
    • Proteins with multiple conformations and functions: allosteric and regulatory roles

Enzymes, Cofactors, and Active Sites

  • Apoenzyme + Cofactor form a holoenzyme

    • Protein portion: apoenzyme
    • Non-protein portion: cofactor
    • Prosthetic group (e.g., heme in peroxidase, FAD in SDH) or Coenzyme (e.g., FMN, NAD in decarboxylases)
    • Metal ions (e.g., Mg^{2+} in hexokinase) as cofactors
  • Active Site of an Enzyme

    • Substrate binds at the active site
    • Formed by amino acid side chains with two principal roles
    • Contact residue: attracts and orients the substrate
    • Catalytic residue: participates in transient bond formation with substrate; drives catalysis
  • Substrate Specificity and Catalysis

    • Enzymes are highly specific to substrates and reaction types
    • Catalytic changes occur during the reaction cycle
  • Allosteric Enzymes

    • Enzymes exist in alternative conformations with multiple binding sites
    • Active site binds substrate; effector site binds regulatory molecules
    • Binding of effector molecule induces conformational changes that modulate activity
  • Allosteric Regulation and Feedback Inhibition

    • Feedback inhibition: a downstream product inhibits an enzyme early in the pathway
    • Regulation of biosynthetic pathways by effector binding
    • Reversible modulation of enzymatic activity through allosteric effectors
  • Protein-Protein, Protein-RNA, Protein-DNA, and Protein-Drug Interactions

    • Molecular interactions drive recognition and binding between biomolecules
    • Critical for signaling, regulation, gene expression, and pharmacology

Motifs of DNA-binding Proteins (Secondary Structures in DNA-binding domains)

  • Motif: a recognizable 2° structure topology that forms a functional DNA-binding domain
  • Common DNA-binding motifs regulating transcription:

1) Zinc Finger

  • Finger-like projection of ~30 aa
  • Two main types depending on how zinc is coordinated:
    • Cys2-His2 (C2H2) finger
    • Cys4 (C4) finger
  • 2° structure: compact with conserved basic residues; binds major groove of DNA
  • Zinc finger structure features: two β-sheets and one α-helix
  • Binds DNA in the major groove; zinc stabilizes fold

2) Helix-Turn-Helix (HTH)

  • Contains two α-helices and a turn; binds specific sequences in the major groove
  • Stabilized configuration; recognition helix makes base contacts
  • Structure: two α-helices and one β-turn (sometimes described as a recognition helix)

3) Leucine Zipper (bZIP)

  • Leucine every seventh residue on one face forms a dimerization interface
  • Dimerization domain and DNA-binding domain
  • Amphipathic dimerization domain; DNA-binding domain is basic and rich in lysine/arginine residues
  • Function: forms homo- or heterodimers to regulate transcription

4) Helix-Loop-Helix (HLH)

  • Monomer carries a dimerization domain and a DNA-binding domain
  • Dimerization domain enables dimer formation; DNA-binding domain engages DNA

5) Copper Fist (Cobalt Finger? Note: historically described as Copper Fist)

  • Fist-like motif formed around Cu ions which interact with cysteine residues on the protein
  • Knuckles contain basic amino acids that interact with DNA

Prions

  • Prions: small proteinaceous infectious particles (no nucleic acid)

  • Cause neurodegenerative diseases in mammals

  • Term: Prion Diseases (spongiform encephalopathies)

  • Nucleic-acid-free spongiform encephalopathy diseases

  • Examples of prion diseases across species (historical and current context):

    • Sheep: Scrapie (first discovered by Stanley Prusiner, 1892)
    • Transmissible Mink Encephalopathy (TME) in mink
    • Chronic Wasting Disease (CWD) in mule deer, elk
    • Bovine Spongiform Encephalopathy (BSE) in cows
    • Creutzfeldt–Jakob disease (CJD) in humans
    • Gerstmann–Straussler–Scheinker syndrome (GSS)
    • Fatal Familial Insomnia (FFI)
    • Kuru
    • Alpers syndrome in infants
  • Symptoms of Prion Diseases (stages)

    • Early stages: loss of muscle control, personality changes, impaired memory/judgment/thinking, impaired vision, insomnia, depression, dementia
    • Later stages: involuntary muscle jerks, blindness, paralysis, wasting, coma, death (often following pneumonia)
    • Visible end results at autopsy: non-inflammatory lesions, vacuoles, amyloid protein deposits, astrogliosis
  • Kuru

    • First known prion disease (1950s)
    • Geographically isolated tribes in the highlands of New Guinea
    • Transmission believed to be via ingestion of brain tissue during religious rituals
  • Molecular Biology of Prions

    • Prion protein (PrP) exists in different conformations
    • PrP^C (cellular PrP): normal protein; encoded by a single exon of a single-copy gene on chromosome 20; predominantly on neuron surfaces; non-immunogenic and protease-sensitive
    • PrP^Sc (scrapie-associated isoform): modified form that is relatively protease-resistant and accumulates in diseased individuals
    • The infectious process involves conversion of PrP^C to PrP^Sc; the exact mechanism is unknown; may involve conformational change or chemical modification
  • Structural Difference Between PrP^C and PrP^Sc

    • PrP^C: predominantly α-helix structure
    • PrP^Sc: predominantly β-pleated sheet structure
    • This conformational shift underlies prion infectivity and aggregation tendencies
  • Normal cellular PrP (PrP^C) and its chromosomal locus

    • PrP^C encoded on chromosome 20; normal cellular form associated with synaptic function
    • PrP^Sc accumulates in synaptic vesicles and has a different conformational profile
  • Conceptual Summary

    • Prions are protein-only infectious agents that propagate by converting normal PrP to the misfolded PrP^Sc form
    • Conformational changes alter properties and lead to disease progression