Proteomics

Native vs. Denatured Protein Molecular Weights:

Understanding the difference between the native and denatured state of a protein is fundamental to determining its quaternary structure and biological assembly.

Native Molecular Weight: This refers to the molecular weight of a protein in its biologically active, folded state.
- Proteins in their native state often exist as oligomers (e.g., dimers, tetramers), where multiple polypeptide subunits are held together by non-covalent interactions or disulphide bridges.
- Determining the native molecular weight allows researchers to identify the oligomeric state of the protein.
Denatured Molecular Weight: * In the denatured state, proteins are unfolded into linear polypeptide chains, and any non-covalent quaternary associations are disrupted.
- Denaturation is typically achieved using detergents like Sodium Dodecyl Sulphate (SDS) and reducing agents such as β-mercaptoethanol to break disulphide bonds.
- Comparing the native molecular weight to the denatured molecular weight reveals whether a protein is a monomer, a homomultimer (identical subunits), or a heteromultimer (different subunits).

Methods for Determining Protein Molecular Weights:

To fully characterise a protein, multiple techniques are employed to measure its mass under different conditions.

Gel Filtration (Size Exclusion Chromatography):
- Principle: This technique separates proteins in their native state based on their hydrodynamic volume.
- The stationary phase consists of porous beads. Large proteins are excluded from the pores and elute first (the void volume), while smaller proteins enter the pores, resulting in a longer path and later elution.
- Calibration: By running a set of standard proteins of known molecular weight, a calibration curve can be generated to estimate the native mass of an unknown sample.
SDS-PAGE (Polyacrylamide Gel Electrophoresis):
- Principle: This method determines the molecular weight of denatured polypeptide subunits.
- SDS coats proteins with a uniform negative charge, ensuring the charge-to-mass ratio is constant. Consequently, separation in the polyacrylamide gel is strictly dependent on the size (length) of the chain.
- Comparison: If a protein appears as a 100 kDa peak in gel filtration but shows a single 25 kDa band on SDS-PAGE, it is likely a homotetramer.

Primary Structure and Internal Cleavage:

Determining the primary sequence (the linear order of amino acids) is essential for identifying the protein and understanding its function.

Enzymatic Proteolysis: Large proteins are often too complex for direct sequencing. Instead, they are cleaved into smaller, manageable peptides using specific proteases.
- Trypsin: A widely used serine protease that highly specifically cleaves the peptide bond on the carboxyl side (C-terminus) of Lysine (K) and Arginine (R) residues.
- This predictable cleavage pattern produces a unique set of peptides known as a "peptide map" or "fingerprint".
Peptide Sequencing: * Once cleaved, these peptides can be sequenced to reconstruct the full primary structure of the protein.
- Historically, Edman degradation was the standard, but modern proteomics relies almost exclusively on mass spectrometry.

Mass Spectrometry and Proteomics:

Mass Spectrometry (MS) has revolutionised biochemistry by providing extremely accurate mass measurements and rapid protein identification.

Ionisation Techniques: Proteins and peptides must be converted into gas-phase ions.
- Two primary methods are used:
  - ESI (Electrospray Ionisation):
    - The sample is ionised from a liquid stream, allowing for direct coupling with high-performance liquid chromatography (LC-MS).
  - MALDI (Matrix-Assisted Laser Desorption/Ionisation):
    - The sample is co-crystallised with a matrix and ionised by a laser pulse. It is typically coupled with a TOF (Time of Flight) analyser.
Peptide Mass Fingerprinting (PMF) and Database Searching: The experimental masses of trypsin-cleaved peptides are measured and compared against theoretical masses derived from genomic databases.
- Bioinformatics Tools: Programmes such as BLAST are used to interrogate databases like SWISS-PROT to identify the protein based on these matching sequences.
- Successful identification is achieved when multiple peptide sequences from the sample align perfectly with a database entry