Control of Gene Expression in Bacteria and Eukaryotes
Overview of Gene Expression Control in Bacteria
The Operon and its Functional Discovery
Definition of an Operon: An operon is defined as a cluster of structural genes that are transcribed together as a single unit, sharing a common promoter. This organization allows the cell to produce a single molecule containing the coding regions for several related proteins. Usually, these proteins function within the same biochemical pathway.
Historical Context: The operon model was famously proposed by François Jacob and Jacques Monod in . Their discovery was motivated by Jacob’s research on bacteriophage and Monod’s work on -galactosidase in . Along with André Lwoff, they received the Nobel Prize in Physiology or Medicine in .
The Evolutionary Purpose of Operons: Computational biologists Oleg Igoshin and Christian Ray utilized mathematical models to investigate why operons exist in prokaryotes but are rare in eukaryotes. They hypothesized that the operon structure reduces ‐‐biochemical noise‐‐ (random fluctuations in transcription and translation levels). By coordinating the expression of multiple genes, the cell maintains optimal proportions of necessary proteins, which is critical for survival. They speculated that eukaryotic cells do not rely on operons because their larger volume buffers the effects of random molecular fluctuations.
Levels of Gene Regulation
Gene expression is a multi-step process from genotype to phenotype, and regulation can occur at various stages:
Alteration of DNA or Chromatin Structure: Primarily in eukaryotes, modifications such as DNA methylation or changes in chromatin packaging determine DNA accessibility for transcription.
Transcription: This is the primary level of control in bacteria. It is energetically efficient to stop protein synthesis early in the process.
Processing: In eukaryotes, modifications like the cap, poly(A) tail, and intron splicing regulate translation and stability. This is largely absent in prokaryotes.
Stability: The concentration of a protein is influenced by the rate of degradation. Rapid degradation results in lower protein levels.
Translation: Factors such as the availability of amino acids, initiation factors, and enzymes influence the rate of protein synthesis from .
Post-translational Modification: Proteins may be cleaved, trimmed, or chemically modified (e.g., phosphorylation) to become active or degrade.
DNA-Binding Proteins and Motifs
Much of gene regulation is mediated by proteins that physically bind to DNA sequences.
Domains: These are functional regions within regulatory proteins, typically to amino acids in length. Only a small subset of amino acids (including asparagine, glutamine, glycine, lysine, and arginine) make actual contact with the DNA.
Binding Nature: Most DNA-binding proteins bind dynamically to the major groove of the DNA double helix or the sugar-phosphate backbone through hydrogen bonds. This allows other molecules to compete for binding sites.
Characteristic Motifs:
Helix-turn-helix: Common in bacterial regulatory proteins; consists of two alpha helices connected by a turn.
Zinc-finger: Found in eukaryotic proteins; consists of a loop of amino acids with a zinc ion at the base.
Leucine zipper: Features a helix of leucine and a basic arm where two leucines interdigitate; common in eukaryotic transcription factors.
Helix-loop-helix: Found in eukaryotic proteins; two alpha helices separated by an amino acid loop.
Homeodomain: Three alpha helices; found in eukaryotic regulatory proteins.
The Logic of Operon Systems
Basic Structural Components
Structural Genes: Encode proteins involved in metabolism, biosynthesis, or cell structure.
Regulatory Genes: Separate sequences with their own promoters that encode regulatory proteins (activators or repressors). These proteins bind to the operator to control transcription.
Operator: A DNA sequence that overlaps the end of the promoter (and sometimes the end of the first structural gene). It serves as the binding site for the regulator protein.
Constitutive Genes: Genes that are expressed continually and are not regulated; they often encode essential ‐‐housekeeping‐‐ functions.
Four General Types of Transcriptional Control
Negative Inducible: The regulator is an active repressor that binds the operator and blocks transcription. An inducer must bind to the repressor to inactivate it, allowing transcription. This is common for degradative (catabolic) pathways.
Negative Repressible: The regulator is an inactive repressor. To block transcription, a corepressor must bind to the repressor to activate it. This is common for biosynthetic (anabolic) pathways where production should stop if the product is already abundant.
Positive Inducible: Transcription is normally off because the activator is inactive. It must be activated by an inducer to stimulate transcription.
Positive Repressible: The activator is normally bound to DNA, stimulating transcription. It must be inactivated by a molecule to stop transcription.
The lac Operon of Escherichia coli
Lactose Metabolism
(\beta-galactosidase): This enzyme breaks lactose into glucose and galactose. It also converts lactose into allolactose.
(Permease): This protein transports lactose into the bacterial cell.
(Thiogalactoside transacetylase): Its precise role is unclear, but it may detoxify transport-related molecules.
Regulation Mechanism
Coordinate Induction: The simultaneous synthesis of , , and stimulated by an inducer.
Negative Inducible Control: In the absence of lactose, the repressor (a tetramer of four polypeptides) binds to the operator () and blocks RNA polymerase.
Induction by Allolactose: When lactose enters the cell, it is converted to allolactose by a few existing -galactosidase molecules. Allolactose acts as the inducer, binding to the repressor, causing an allosteric change that makes it release the operator.
lac Mutations and Partial Diploids
Jacob and Monod used partial diploids (created by conjugation with an F-plasmid) to determine how mutations affect the system.
Structural-Gene Mutations (, ): Affect individual protein function but not the whole operon's regulation.
Regulator-Gene Mutations (): Constitutive mutations where the repressor is nonfunctional. These are trans-acting, meaning a functional on a plasmid can regulate an operon on the bacterial chromosome.
Super-repressors (): These mutations prevent the inducer from binding to the repressor. They are dominant and keep the operon off even in the presence of lactose.
Operator Mutations (): These are cis-acting. They alter the operator DNA so the repressor cannot bind. They only affect genes on the same DNA molecule.
Promoter Mutations (): Prevent RNA polymerase from binding; they are cis-acting and result in no transcription.
Positive Control and Catabolite Repression
Mechanism: Bacteria prefer glucose over lactose. If both are present, the operon is repressed through catabolite repression.
CAP and cAMP: The Catabolite Activator Protein (CAP) must bind to the promoter to facilitate RNA polymerase binding. However, CAP can only bind if it is complexed with -cyclic adenosine monophosphate ().
Inversion Relationship: The concentration of is inversely proportional to the level of glucose.
High Glucose: Low , no complex, low transcription.
Low Glucose: High , active complex, high transcription (provided lactose is present).
The trp Operon
Classification: Negative repressible operon.
Structure: Consists of five structural genes (, , , , and ) that encode enzymes to convert chorismate into tryptophan.
Regulator (): Encodes an inactive repressor.
Transcription Lock: When tryptophan levels are high, tryptophan behaves as a corepressor, binding and activating the repressor. The active repressor binds the operator and shuts down synthesis.
Control of Gene Expression in Eukaryotes
Human and Chimpanzee Divergence
Genomic Similarity: Humans and chimpanzees share of their DNA ( difference in base pairs, in indels).
Regulatory Evolution: King and Wilson () proposed that phenotypic differences between the species are due to changes in regulatory sequences rather than structural genes.
Transcription Factors: Katja Nowick identified transcription factors (including KRAB-ZFP proteins) that are expressed differently in human vs. chimpanzee brains, influencing complex cognitive traits.
Chromatin Structure Changes
DNase I Hypersensitivity: Active genes are more sensitive to DNase I because their chromatin is relaxed. These sites are often bp upstream of the start site.
Chromatin Remodeling Complexes: Complexes like use hydrolyzed ATP to reposition nucleosomes, exposing DNA for transcription factors.
Histone Modification (The Histone Code):
Methylation: Histone methyltransferases add . is a marker for active promoters.
Acetylation: Acetyltransferase enzymes add groups to lysine residues on histone tails, neutralizing their positive charge and weakening their grip on DNA. This generally stimulates transcription. Deacetylases remove these groups to repress transcription.
Case Study: Arabidopsis Flowering: The gene suppresses flowering. The gene encodes a deacetylase that removes acetyl groups from the chromatin, shutting it off and allowing the plant to flower after cold exposure (vernalization).
DNA Methylation: Methylation of cytosine (forming -methylcytosine) at islands near promoters is associated with long-term gene silencing.
Eukaryotic Transcriptional Regulation
Basal Transcription Apparatus: Comprises RNA polymerase, general transcription factors, and the Mediator complex; binds the core promoter.
Enhancers and Silencers: Regulatory elements that can act at great distances (e.g., ). Enhancers loop modern DNA to contact the promoter. They can be transcribed into .
Insulators: Boundary elements that block the action of an enhancer on a specific promoter if the insulator is placed between them. They help organize Topologically Associated Domains (TADs) using proteins like and cohesin.
Transcriptional Stalling: RNA polymerase may initiate transcription but pause after to nucleotides. Factors like promote stalling, while relieves it via phosphorylation.
Coordinated Regulation: Unlike the operons of bacteria, eukaryotic genes use shared response elements (e.g., , , ) in their promoters to respond to the same stimulus simultaneously.
RNA-Level Regulation and Interference
RNA Processing and Stability
Alternative Splicing: Allows a single pre- to produce different proteins. In sex determination, the protein regulates the splicing of pre-, leading to female-specific development. In males, the absence of leads to a nonfunctional protein and male development.
Degradation: Takes place in bodies. The shortened poly(A) tail causes the removal of the cap, and nucleases degrade the strand.
RNA Interference (RNAi)
Trigger: Double-stranded RNA is cleaved by Dicer into - nucleotide fragments ( or ).
Complexes: These bind proteins to form the RNA-induced silencing complex (RISC) or RNA-induced transcriptional silencing complex (RITS).
Mechanisms:
Cleavage of : Slicer protein cleaves complementary to .
Inhibition of Translation: often block protein synthesis without degrading the .
Transcriptional Silencing: attracts methylating enzymes to the chromatin.
Decay Triggering: Some bind to AU-rich elements in the UTR to trigger Slicer-independent degradation.
RNA Crosstalk: Competing endogenous RNAs () like long noncoding RNAs () or circular RNAs act as ‐‐molecular decoys‐‐ that soak up , preventing them from silencing target .
Translational and Post-translational Control
Initiation Factor Availability: In T lymphocytes, activation by an antigen increases the availability of initiation factors, boosting protein synthesis - to -fold without increasing transcription.
Post-translational Processing: Selective cleavage, trimming, and modifications (acetylation, phosphorylation, ubiquitination) control the final function and stability of proteins.
Questions & Discussion
Physical Proximity in Research: Jacob and Monod collaborated due to working on the same floor. In the modern era, while digital tools exist, physical proximity remains valuable for spontaneous inspiration and brainstorming (though digital collaboration allows global reach).
Constitutive Genes: These are genes expressed at a constant rate regardless of the environment, encoding essential enzymes.
Difference between Structural and Regulatory Genes: Structural genes encode metabolic or structural proteins; regulator genes produce factors (RNA or protein) that control the activity of other genes.
Effect of High Glucose on the lac Operon: High glucose leads to low , preventing from binding to the promoter. Even if lactose is present, transcription is low because RNA polymerase cannot bind efficiently without .
The Difference between Humans and Chimps in Base Pairs: With a human genome of base pairs, a difference ( SNPs + indels) equates to base pair differences.