Line length in ancestry diagrams of hemoglobin genes is proportional to DNA sequence divergence, illustrating evolution.
Mutations in DNA can lead to:
Defective proteins causing diseases like cancer (kinases), cystic fibrosis (ion channels), and sickle cell anemia (hemoglobin).
Changes in gene regulatory elements, altering gene expression levels (overexpression, lower expression, or no expression).
Changes in RNA expression, including truncations, missense errors, and splicing errors.
Weak Bonds in Cells
Determine the shape of macromolecules:
The double-stranded helical shape of DNA is maintained by numerous weak hydrogen bonds between complementary base pairs (A-T and G-C).
Enable reversible self-assembly of subunits:
Examples include membrane lipid bilayers and protein polymers like microtubules and actin filaments.
Determine the specificity of molecular interactions:
Crucial for enzyme-substrate specificity and catalysis.
Environmental changes (pH, temperature, ionic strength) affect the strength of weak bonds, leading to denaturation (unfolding or disassembly) of molecules or aggregates.
Multiple weak interactions result in highly specific and tight binding, requiring complementary surfaces.
Types of Noncovalent Interactions
Ionic bonds
Strong attractive forces between positively and negatively charged atoms.
Involve the donation/acceptance of electrons rather than sharing.
Strong in the absence of water but weak in its presence.
Hydrogen bonds
Van der Waals interactions
Weak forces resulting from fluctuations in electron clouds of closely positioned atoms.
Individually weak but significant when two macromolecular surfaces are in close proximity.
Hydrophobic interactions
Water forces non-polar (uncharged) surfaces out of solution to maximize hydrogen bonding among water molecules.
Protein Learning Objectives
Familiarity with amino acids, including their codes and properties.
Understanding the types of interactions that stabilize protein structure and their origins.
Recognizing and understanding the formation of primary (1^\circ), secondary (2^\circ), tertiary (3^\circ), and quaternary (4^\circ) structures.
Explaining the modular nature of proteins and its evolutionary implications.
Understanding how proteins are classified and compared by sequence and/or structure.
Recognizing that proteins have built-in assembly instructions.
Understanding how protein function is defined by its 3-dimensional landscape.
Understanding how and why proteins can be regulated (turned on and off).
Recognizing that proteins often function in networks or multi-molecular complexes.
Appreciating how allosteric effects lead to conformational changes and altered activity.
Shape and Structure of Proteins
The shape of a protein is dictated by its amino acid sequence.
Proteins fold into conformations that minimize energy.
Proteins exhibit a wide variety of complex shapes.
Alpha helices and beta sheets are common folding patterns.
Helices form readily in biological structures.
Beta sheets form rigid core structures in many proteins.
Proteins have multiple levels of organization.
Only a fraction of possible polypeptide chains are useful.
Proteins can be classified into families.
Large protein molecules may consist of multiple polypeptide chains.
Proteins can assemble into filaments, sheets, or spheres.
Some proteins have elongated fibrous shapes.
Extracellular proteins are often stabilized by covalent cross-linkages.
Amino Acids
The 20 amino acids are categorized into four groups based on their side chains.
L isomers are found in proteins.
The alpha carbon is a chiral center in an amino acid.
The 20 amino acids have overlapping properties; small changes can result in big effects.
Degeneracy of the genetic code means that each of the 61 sense codons can mutate in 9 different ways.
134 of the 549 possible changes are synonymous (do not change the amino acid).
The rest are nonsynonymous (change the amino acid).
Primary Structure of Protein
Formed through a condensation reaction joining amino acids.
Linear arrangement of amino acids, written from the N-terminus to the C-terminus.
Amino acid sequence dictates the 3D shape/structure of a protein.
MDLY represents an example sequence.
Peptide bonds have partial double bond character, restricting rotation.
Rotation occurs along the polypeptide backbone.
Ramachandran plots visualize the sterically allowed phi and psi angles.
Ramachandran Plots
Alpha-helix: Phi ~ -57 degrees, Psi ~ -47 degrees
Beta-sheet: Phi ~ -110 to -140 degrees, Psi ~ 110 to 135 degrees
Other Considerations
L isomer is found in proteins.
The alpha carbon is a chiral center in an amino acid.
Alpha helices are almost always right-handed in proteins.
Protein Structure
Secondary structures like alpha helices and beta sheets form the core elements of protein architecture.
Beta sheets form rigid structures often found in the core of proteins.
Protein conformation (shape) is determined by its amino acid sequence.
All types of noncovalent bonds help a protein fold properly.
Multiple weak bonds cooperate to produce a strong bonding arrangement.
Polypeptide chains fold in 3D to maximize weak interactions.
Hydrogen bonds play a major role in holding different regions together.
Proteins can be denatured by chaotropic agents like urea.
Levels of Protein Structure
Primary (1^\circ): Amino acid sequence
Secondary (2^\circ): Alpha helices and beta sheets
Tertiary (3^\circ): Overall folding of a polypeptide chain
Quaternary (4^\circ): Assembled subunits
Similar codons often generate similar amino acids, and mutations in amino acids are usually conservative.
Some amino acids are found more frequently in helices or sheets.
Proteins have various functions: binding, catalysis, switching, and structural roles.
Protein structure is determined by its sequence.
Protein Domains
Protein domains consist of independently folding stretches of amino acids.
A single polypeptide chain can fold into one or more domains.
Linear stretches of approximately 20 amino acids are required to span a membrane, which can help identify integral membrane proteins.
Coiled Coils
Stabilized by the hydrophobic effect and can be amphipathic.
Leucine zippers occur when every 3rd or 4th residue is leucine.
Quaternary Structure
Results from interactions between multiple polypeptide chains.
Protein domains are modular units from which larger proteins are built.
Modularity
Proteins can be made from several domains, like the Src protein with ATP bound.
Different sequences can lead to different architectures, such as cytochrome C, lactic dehydrogenase, and immunoglobulin fold.
Protein Families
Serine proteases share similar active sites, even with different sequences.
Homeodomains are separated by billions of years of evolution.
Sequence databases can be searched using sequence motifs to find related proteins.
Evolutionary Trace of SH2 Domain
Amino acids are colored by proximity to ligand.
Important residues are conserved.
Protein domains are often swapped or shuffled (e.g., EGF domain, calcium-binding domain, kringle).
Many proteins are built from different combinations of domains.
Protein Assemblies
Proteins can assemble into various structures such as phage tobacco mosaic virus and viral capsids (e.g., tomato stunt, polio, SV40, tobacco necrosis).
Collagen is tough and inelastic, while elastin is stretchy, demonstrating that structure reflects function.
Disulfide Bonds
Covalent disulfide bonds stabilize extracellular proteins.
The question of whether insulin can refold properly after reducing disulfides and denaturing the protein is posed.
Antibodies
Have repeated framework domains and special binding domains.
Protein interactions typically require many weak bonds.
Enzyme Kinetics
Keq = 10^{10} = [AB]/[A][B]
A bigger K_m means weaker binding.
Km is the substrate concentration at half the maximal rate, V{max}.
The energy required to reach the transition state limits the reaction rate.
Significance of Km, k{cat}, and k{cat}/Km
Km approximates substrate affinity; lower Km means tighter binding.
k_{cat} is the turnover number for the enzyme.
At low substrate concentrations ([S] << Km), V = (k{cat}/K_m)[E][S].
The ratio k{cat}/Km is equivalent to the rate constant for the reaction between free enzyme and free substrate and measures enzyme effectiveness.
Catalytic Strategies
Includes antibody catalysis and various enzymatic mechanisms involving cofactors like retinal and heme.
Carbamoyl phosphate synthetase is an example of an enzyme employing specific catalytic strategies.
Multi-Enzyme Complexes
Pyruvate dehydrogenase is a multi-enzyme complex.
Protein Networks
Allosteric regulators bind to enzymes and alter their activity by changing the enzyme’s 3D structure.
Regulatory binding sites are separate from the substrate-binding site (active site).
Positive and negative regulation mechanisms exist.
Cooperative binding by multisubunit enzymes enables quicker response to concentration changes.