Proteins: Structure, Function, and Enzymes (Comprehensive AP Biology Notes)
Proteins: Building Blocks, Structure, and Function
- Proteins are long chains of amino acids (the 20 standard amino acids) that form the basic polymers of life. Each amino acid contains:
- an amino group (-NH₂)
- a carboxyl group (-COOH)
- a variable side chain (R group)
- Amino acids link to neighbors via peptide bonds formed by a dehydration (condensation) reaction:
extAminoacid<em>n+extAminoacid</em>n+1ightarrowextDipeptide+H2O
- A growing chain of amino acids is a polypeptide; when folded and assembled with other polypeptides and non-peptide groups, it becomes a protein.
- Proteins perform diverse cellular roles: enzymes, structural components, regulators of transport across membranes, disease protection, and coordination of signaling pathways.
- Protein structure is organized into four levels: primary, secondary, tertiary, and quaternary.
Primary structure
- Primary structure is the unique sequence of amino acids in a polypeptide.
- A single amino acid change can alter structure and function.
- Example: Sickle cell anemia arises from a single amino acid substitution in the hemoglobin molecule (which has ~574 amino acids in total across the relevant subunits).
- The gene encoding a protein determines its primary sequence; mutations in the coding region can change which amino acids are added during translation.
- After synthesis (translation), many proteins undergo post-translational modifications (cleavage, phosphorylation, addition of prosthetic groups) that can be essential for function.
Secondary structure
- Secondary structure arises from local folding of the polypeptide backbone stabilized by hydrogen bonds.
- The two most common forms are:
- In an α-helix, hydrogen bonds form between the carbonyl oxygen of one amino acid and the amide hydrogen of the amino acid four residues away (i to i+4).
- Each turn of an α-helix consists of approximately nextperturn=3.6amino acids.
- The R groups project outward from the helix; in β-pleated sheets, β-strands align parallel or antiparallel, with hydrogen bonding between backbone atoms.
- These structures contribute to the stability and shape of many globular and fibrous proteins.
Tertiary structure
- The tertiary structure is the overall three-dimensional shape of a single polypeptide.
- Tertiary structure results from interactions among R groups (side chains):
- Hydrophobic (nonpolar) residues tend to cluster in the interior (hydrophobic interactions).
- Hydrogen bonds and ionic (electrostatic) interactions between charged R groups.
- Disulfide linkages (covalent bonds) between cysteine residues, formed in the presence of oxygen.
- The presence and arrangement of these interactions determine the final 3D shape.
- Most enzymes have a tertiary structure.
- If a protein loses its three-dimensional shape, it can lose function (denaturation).
Quaternary structure
- Some proteins are composed of multiple polypeptide subunits; the interactions among these subunits form the quaternary structure.
- Subunit interactions stabilize the overall protein configuration.
- Examples:
- Insulin consists of multiple subunits linked by disulfide bonds and can involve post-translational processing to become functional.
- Silk fibers arise from β-pleated sheet organization across multiple chains.
- Not all proteins have quaternary structure; it depends on whether the protein functions as a single chain or as an assembly of subunits.
Key proteins and concepts highlighted in the course material
- Table 3.1: Types and Functions of Proteins
- Digestive Enzymes: Amylase, lipase, pepsin, trypsin – catalyze breakdown of nutrients into monomers.
- Transport: Hemoglobin, albumin – carry substances in blood/lymph.
- Structural: Actin, tubulin, keratin – form cytoskeleton and structural elements.
- Hormones: Insulin, thyroxine – regulate physiology and metabolism.
- Defense: Immunoglobulins – protect against pathogens.
- Contractile: Actin, myosin – drive muscle contraction.
- Storage: Legume storage proteins, egg white albumin – provide nourishment.
- Globular vs fibrous proteins: color/shape diversity; hemoglobin is globular; collagen is fibrous.
- Protein shape is critical to function; denaturation can disrupt function.
- Examples linking structure to function:
- Hemoglobin: quaternary structure with subunits; mutations can affect oxygen transport.
- Collagen: fibrous protein with extensive hydrogen bonding contributing to tensile strength.
The amino acids that make up proteins
- Amino acids share a common core structure: a central (α) carbon attached to
- an amino group (-NH₂)
- a carboxyl group (-COOH)
- a hydrogen atom
- an R group (side chain) that varies between amino acids.
- The nine essential amino acids are those not synthesized by humans and must be obtained from the diet. Examples include isoleucine and leucine; cysteine is listed as essential in the provided material (note: the exact essential set can vary by species and context).
- The R group determines the chemical nature of the amino acid:
- Nonpolar/hydrophobic: glycine (Gly), valine (Val), methionine (Met), alanine (Ala), etc.
- Polar uncharged: serine (Ser), threonine (Thr), cysteine (Cys), etc.
- Positively charged (basic): lysine (Lys), arginine (Arg).
- Proline (Pro) has a ring-like structure where its R group bonds back to the amino group, an exception that constrains backbone flexibility.
- Amino acids are represented by one-letter or three-letter codes (e.g., Val = V, Valine = Val).
- The chemical nature of the side chain (R group) determines amino acid behavior in proteins (solubility, interactions, location within the molecule, etc.).
- Proline is unique because its R group forms a ring with the amino group, affecting protein folding.
- Amino acids are joined by peptide bonds through a dehydration synthesis reaction:
extAminoacid<em>n+extAminoacid</em>n+1<br/>ightarrowextDipeptide+H2O - The resulting bond is a peptide bond; the growing chain is a polypeptide.
- Each polypeptide has a free amino end (N-terminus) and a free carboxyl end (C-terminus).
- The terms polypeptide and protein are sometimes used interchangeably, but strictly a polypeptide is a polymer of amino acids; a protein may include one or more polypeptides, often with additional non-peptide components and a defined functional conformation.
- The genetic code determines the sequence of amino acids in a protein; hence gene mutations can alter protein structure and function.
- After synthesis, many proteins undergo post-translational modifications that are essential for functionality (e.g., cleavage, phosphorylation, addition of prosthetic groups).
Translation and protein synthesis implications
- Protein synthesis is central to life; proteins serve as energy sources, building blocks for tissue, hormones, enzymes, and transporters.
- The process of protein synthesis is tied to understanding energy use, storage, and regulatory mechanisms in biology.
- Practical example: the enzyme catalase catalyzes the decomposition of hydrogen peroxide into water and oxygen, protecting cells from oxidative damage.
extCatalase:2H<em>2O</em>2<br/>ightarrow2H<em>2O+O</em>2 - Enzymes are highly specific for their substrates, increasing reaction rates without being consumed. They have optimal temperature and pH ranges for activity.
Enzymes: catalysts of biochemical reactions
- Enzymes are typically complex or conjugated proteins.
- Each enzyme is specific for its substrate (the molecule it acts upon).
- Enzymes can catalyze breakdown (catalysis of hydrolysis), synthesis (anabolic reactions), or rearrangement of substrates.
- Enzyme categories:
- Catabolic enzymes: break down substrates into smaller units.
- Anabolic enzymes: build more complex molecules from simpler substrates.
- Catalytic enzymes: influence the rate of reactions (i.e., all enzymes are catalysts).
- Enzymes do not alter the equilibrium of a reaction; they speed up attainment of equilibrium.
- Example: salivary amylase hydrolyzes amylose (a starch component) into smaller sugars during digestion.
Hormones, transport, and other functional roles of proteins
- Hormones are signaling molecules (often small proteins or steroids) that regulate physiological processes, including growth, development, metabolism, and reproduction.
- Insulin is an example of a peptide hormone that helps regulate blood glucose levels.
- Transport proteins like hemoglobin carry substances through the bloodstream; albumin helps maintain osmotic balance and transports various substances.
Protein types and functional categories (Table 3.1 overview)
- Digestive enzymes: Amylase, lipase, pepsin, trypsin – catabolize nutrients into monomeric units.
- Transport proteins: Hemoglobin, albumin – carry substances in blood/lymph.
- Structural proteins: Actin, tubulin, keratin – form cytoskeletal structures.
- Hormones: Insulin, thyroxine – coordinate activity across body systems.
- Defense proteins: Immunoglobulins – protect against foreign pathogens.
- Contractile proteins: Actin, myosin – drive muscle contraction.
- Storage proteins: Legume storage proteins, egg white albumin – provide nourishment.
Protein structure in depth: folding, denaturation, and chaperones
- Protein folding is guided by interactions among R groups, hydrogen bonds, ionic interactions, and hydrophobic effects.
- Denaturation: loss of three-dimensional shape due to changes in temperature, pH, or exposure to chemicals, which can lead to loss of function.
- Denaturation can be reversible if the primary sequence is preserved and the denaturing agent is removed; it can be irreversible (e.g., cooking an egg, where albumin is irreversibly denatured).
- The stomach’s acidic environment can denature proteins during digestion, yet digestive enzymes remain active under these conditions.
- Chaperones (and chaperonins) assist in protein folding by preventing aggregation and facilitating correct folding; they dissociate after folding is complete.
Evolutionary and comparative biology topic: cytochrome c
- Cytochrome c is a component of the electron transport chain in mitochondria and contains a heme prosthetic group. It participates in electron transfer with the iron atom cycling between reduced and oxidized states.
- Protein sequencing shows substantial amino acid sequence homology across species, reflecting evolutionary kinship.
- Human cytochrome c contains 104 amino acids; among sequenced cytochromes:
- 37 amino acids are identical across all samples examined.
- Human vs chimpanzee: identical sequences.
- Human vs rhesus monkey: differs by 1 amino acid.
- Human vs yeast: differs at the 44th position.
- Implications:
- Small differences accumulate over evolutionary time; conserved regions imply important functional/structural roles.
- These comparisons support a common ancestor and reveal relative relatedness among species.
- Exam-style question (based on the data in the material): which species is more closely related to humans? The data indicate chimpanzees are more closely related to humans than rhesus monkeys, given the identical cytochrome c sequence between humans and chimpanzees vs a single amino acid difference with rhesus monkeys.
Specific example: hemoglobin and sickle cell anemia
- Hemoglobin is a tetramer comprised of two alpha and two beta chains, collectively around 600 amino acids in total.
- Each beta chain is about 147 residues long; the entire hemoglobin molecule is encoded by sequences totaling roughly 600 amino acids.
- A single amino acid substitution in hemoglobin can have dramatic physiological consequences:
- In sickle cell anemia, the glutamic acid at position 7 of the beta chain is replaced by valine (Glu → Val) in the β chain.
- This small change causes hemoglobin molecules to polymerize under low-oxygen conditions, forming long fibers that distort red blood cells into a crescent shape, leading to vessel blockages and health problems (breathlessness, dizziness, headaches, abdominal pain).
- The corresponding genetic change is a point mutation: a single base change in the coding region (one base change in the gene can alter one amino acid).
- The mutation rate described is about 1 in 1800 bases.
- The β-chain length and the overall Hb structure illustrate how a tiny change at the molecular level can have large organism-level effects.
- Visual example: normal versus sickled cells show sickled erythrocytes forming under stress, which can be observed in stained blood smears.
Protein synthesis, folding, and functional maturation workflow
- After translation, many proteins undergo post-translational modifications necessary for full functionality (e.g., cleavage, phosphorylation).
- The four levels of protein structure (primary, secondary, tertiary, quaternary) describe progressively complex folding and assembly processes that determine function.
- The ability of proteins to fold correctly is enhanced by chaperones that can prevent misfolding and aggregation during the folding process.
- Proper folding is essential for enzyme active sites to be correctly formed and for substrates to bind with specificity.
Practical and conceptual implications
- Structure-function relationship: A protein’s function is inextricably linked to its 3D shape and chemical properties of its amino acids.
- Mutations in amino acid sequence can alter structure and thus function, with potential health consequences or evolutionary implications.
- Protein misfolding and aggregation are central to many diseases and are a focus of biomedical research (e.g., prion diseases, amyloidoses).
- Understanding protein structure helps in areas such as drug design, biotechnology, and understanding metabolic regulation.
- In AP Biology, these topics connect to Big Idea 4: Biological systems interact and possess complex properties; learning objectives emphasize linking sequence to properties, refining models, and using models to predict functional changes.
Connections to learning objectives and practices
- Essential Knowledge 4.A.1: Subcomponents and sequence determine properties of biological molecules.
- Learning Objective 4.1: Explain how sequence and subcomponents of a biological polymer determine properties.
- Learning Objective 4.2: Refine representations/models to explain how subcomponents determine properties.
- Learning Objective 4.3: Use models to predict and justify how changes in subcomponents affect function.
- Science Practices emphasized: Scale analysis, model refinement, evidence-based claims, and predictive reasoning about natural phenomena.
Key numerical and symbolic references (summary)
- Sickle cell hemoglobin composition and size:
- Hemoglobin roughly 600 amino acids total; each Hb molecule contains two α chains and two β chains.
- The β chain length cited as 147 amino acids.
- Mutation example and rate:
- Point mutation: single base change causing Glu → Val substitution at a specific position in the β chain.
- Mutation rate referenced: approximately 1/1800 bases.
- Structural and bonding details:
- α-helix: ~3.6 amino acids per turn: nextperturn=3.6
- Hydrogen bonding in secondary structures involves backbone atoms (carbonyl O and amide H).
- Disulfide bonds form between cysteine residues: 2R-SH→R-S-S-R+2H++2e−
- Enzymatic reaction example:
- Catalase reaction: 2H<em>2O</em>2→2H<em>2O+O</em>2
Visual and terminology recap
- Primary vs secondary vs tertiary vs quaternary distinctions—structure determines function.
- Globular vs fibrous classifications reflect shape and typical function (e.g., hemoglobin as globular; collagen as fibrous).
- N-terminus vs C-terminus terminology describes the ends of a polypeptide chain.
- Essential amino acids: network of dietary requirements; humans rely on dietary sources for several essential AAs (examples mentioned include isoleucine, leucine, cysteine).
- Post-translational modifications expand functional diversity beyond the primary amino acid sequence.
- Chaperones aid in proper folding and prevent aggregation during protein maturation.
Quick connections to real-world relevance
- A single gene mutation can dramatically impact health (e.g., sickle cell anemia) by altering protein structure and function.
- Protein sequence comparisons across species (cytochrome c) illuminate evolutionary relationships and common ancestry.
- Enzyme function under different temperatures and pH conditions has practical implications for nutrition, medicine, and industrial biotechnology.