RA

BIOSCI 202 Genetics: Lectures 11-12 Causes and Effects of Mutations; Lab, Regulation, Transposons (Fill-in-the-Blank Flashcards)

Mechanisms of Mutation

  • Small-scale mutations arise from three main sources:
    • Errors during DNA replication and repair
    • Spontaneous chemical changes of bases that alter pairing
    • Induced chemical changes from external mutagens
  • Diagrammatic idea: mutations become stably incorporated into DNA through replication and repair processes, creating heritable changes.

Replication mutations

  • DNA polymerase fidelity is extremely high, due to proofreading activity; failures during replication can lead to mutations.
  • If proofreading or other fidelity checks fail, replication errors become permanent mutations in the genome.

Repair mutations

  • The cell has multiple DNA damage repair pathways; many repairs occur during replication when DNA is damaged.
  • Repair DNA polymerases often have higher error rates than normal polymerases, so repair processes can be error-prone, paradoxically increasing mutation rates during repair.

Spontaneous base chemical changes

  • Cytosine deamination (spontaneous) is an example:
    • Cytosine can deaminate to uracil, and because uracil pairs with adenine, this causes C⇒T transitions after replication.
  • Methylcytosine deamination (spontaneous):
    • In CpG regions, methylated cytosine deaminates to thymine, leading to C→T transitions over time.
    • Specific enzymes detect and repair G:T mispairs to reduce this mutation type.

Induced Mutations

  • Incorporation of base analogs:
    • Base analogs substitute for natural bases and can cause mispairing; e.g., 2-amino-purine (2-AP) can pair with T or C and act as a mutagen.
  • Establishment of mutations via mispairing after mutagen exposure:
    • 5-bromouracil can pair with A or G; full mutation may require at least two rounds of replication to fix.
  • Alkylating mutagens (EMS, etc.):
    • EMS alkylates bases, causing mispairing (e.g., G pairs with T due to modification).
  • Damage that prevents pairing (DNA adducts):
    • Benzo[a]pyrene from cigarette smoke binds irreversibly to G; replication often results in T rather than G after repair.

Radiation-induced mutation

  • Radiation causes DNA breaks; large-scale mutations (duplications/deletions, inversions, translocations) arise from breaks.
  • Large-scale mutations tend to have larger phenotypic effects, so radiation is a powerful mutagen.

Establishment and fixation of mutations

  • Mutations can be established during replication and replication-associated repair, and then fixed in daughter genomes.

Effects of mutations: overview

  • Mutation effects are context-dependent: mutations in regions critical for function are more likely to affect phenotype; in other regions, effects may be negligible.
  • The more nucleotides a mutation affects, the greater the chance of an effect, but context matters.

Effects of Mutation – 1. Single nucleotide variants (SNVs)

  • SNVs in coding regions can be:
    • Synonymous (silent) if they do not change the encoded amino acid due to redundant genetic code.
    • Nonsynonymous (missense) if they change the amino acid; may be conservative (similar amino acid) or non-conservative (different).
    • Nonsense if a stop codon is created.
  • SNVs arise from redundancy in the genetic code; multiple codons can code for the same amino acid.
  • The chemical properties of amino acids and the 3D structure of proteins are critical for function.

Effects of Mutation – 1. SNVs (continued)

  • The amino acid substitution can be conservative or non-conservative, affecting protein folding, stability, or activity.
  • Nonsense mutations truncate proteins, often severely affecting function.

Effects of Mutation – 2. Indels (insertions/deletions)

  • Indels can cause frameshift mutations if they occur in coding regions, altering the reading frame and downstream amino acids.
  • Indels must occur in coding regions to affect protein sequence; some indels may occur in non-coding regions with no direct protein effect.
  • Question: Is it possible to get indels that do not cause frameshifts? (Yes, if the indel is in multiples of 3 nt, preserving the reading frame.)

Effects of Mutation – 3. Regulatory mutations

  • Mutations in regulatory regions (e.g., promoters) can alter transcription levels.
  • Example: promoter mutations affecting the β-globin gene promoter (CACCC/CACACCC/ATAAA motifs) can reduce transcription.
  • Regulatory mutations influence gene expression levels rather than protein sequence.

Effects of Mutation – summary: context matters

  • Effects are generally context-dependent: a mutation in a functionally important region is more likely to affect phenotype.
  • More nucleotides affected by a mutation increases the chance of an effect, but context remains decisive.

Continuum of mutation effects

  • Mutations span a continuum: neutral, deleterious, and beneficial.
  • Neutral mutations do not affect organismal function; deleterious reduce fitness; beneficial improve function, though beneficial mutations are relatively rare.
  • Bank et al., 2014 note this continuum in Trends in Genetics (2014) 30:540–6.

Conservation as a clue to function

  • DNA conservation across species can indicate functional regions.
  • Highly conserved regions are likely functionally important; mutations there are often deleterious and not observed in populations.
  • Kuhn et al., 2012 discuss conservation in Briefings in Bioinformatics 14:144–61.

Beneficial mutations in populations

  • Beneficial mutations can sweep through populations to fixation; demonstrates that new advantageous variants can arise and spread.
  • A video example illustrates these dynamics; real-world data show resistance mutations can preexist and spread under selection.

Origin of variation

  • A key question: Are beneficial mutations typically induced by environmental stress or do they occur by chance (pre-existing variation)?
  • The fluctuation test (Luria & Delbrück, 1943) tested whether resistance mutations arise in response to phage exposure or pre-exist.

Fluctuation test to decide the origin of mutations

  • Classic experiment used E. coli with resistance to bacteriophage T1.
  • Predictions:
    • If mutations are induced by phage exposure, resistant colonies should appear with similar numbers across all tubes (same number of bacteria, same phage pressure, same mutation rate).
    • If mutations are pre-existing, the number of resistant colonies should vary considerably between tubes; earlier mutations yield more resistant descendants.
  • Results: The data do not fit a Poisson distribution expected if mutations were induced; instead, they show substantial variation consistent with pre-existing (spontaneous) mutations.
  • Conclusion: Most mutations are spontaneous and resistance mutations are pre-existing before exposure.

Modern genome sequencing and standing variation

  • Modern sequencing reveals substantial standing variation; humans average ~20 million base differences from the reference genome, with ~100–200 de novo mutations per individual.
  • Most genetic diseases are caused by pre-existing variants rather than new mutations.

Summary of learning objectives (mutations)

  • Describe mechanisms that can cause mutations.
  • Diagram how mutations can be stably incorporated into DNA.
  • Define synonymous, non-synonymous, and frameshift mutations.
  • Predict effects of different mutations in various genomic contexts with justification.
  • Explain the continuum of mutation effects from deleterious to neutral to beneficial.
  • Explain how mutation location informs functional importance of genome regions.
  • Explain the fluctuation test and how it demonstrates pre-existing mutations in populations.

Laboratory: One-step growth experiment (phage biology)

  • Aim: Monitor the infection cycle of bacterial viruses (phages).
  • Phage structure: Head contains viral DNA; tail for attachment; injects DNA and hijacks host machinery to replicate and lyse the cell.
  • Life cycle steps: Phage attaches to bacterium, injects DNA, host machinery makes phage components, phage particles assemble, host cells lyse releasing phage.
  • Time-dependent data: Latent period, burst size, and plaque-forming units (Pfus) are measured at various time points.
  • Synchronizing infection: Use cyanide (KCN) to temporarily stop bacterial growth, infect non-growing cells, then relieve inhibition to synchronize infection.
  • Workflow: Involves growth tubes, dilutions, and plaque assays to quantify phage at different times.
  • Plaque assay: Mix infected bacteria with indicator bacteria on soft agar and count plaques at different time points.
  • Measurements: Relative phage titre over time, latent period, and burst size derived from data.

Regulation of Gene Expression in Bacteria

  • Gene regulation is critical and can be achieved via multiple mechanisms, including transcriptional control, operons, and global regulation.
  • Transcriptional control includes the Lac operon, the Arabinose operon, negative vs positive control, cis vs trans effects, and the logic of gene regulation.
  • Global regulation includes sigma factors and consensus sequences; control of large gene networks via shared regulatory proteins.

Promoters and transcription initiation

  • Promoters in E. coli typically have two key elements bound by RNA polymerase to initiate transcription:
    -35 region: TTGACA; spacer; -10 region: TATAAT; spacer; start site N17; N7 positions before transcription start site.
  • Strong promoters bind RNA polymerase more effectively; weak promoters bind less well and transcribe more slowly.
  • Proper spacing between -35 and -10 is crucial; optimal spacing allows RNA polymerase to contact both regions.
  • Example promoter strength: recA promoter (strong) vs araBAD promoter (weak).

Genetic switches: regulators of transcription

  • Promoter determines where transcription begins; Operator is the repressor binding site; Activator binding site is for activators.
  • Activators/repressors respond to environmental cues; two key sites in proteins: DNA-binding site and allosteric site.

The Lac operon (negative regulation with catabolite repression)

  • Structure: lacI (repressor) controlling lacZ (β-galactosidase), lacY (permease), lacA (transacetylase).
  • In absence of lactose: repressor binds operator, transcription off.
  • In presence of lactose (allolactose as the actual inducer): repressor binding to operator is alleviated, enabling transcription.
  • Discovery: Jacob & Monod showed that adding lactose induces a large increase in β-galactosidase expression.
  • Mutations:
    • Oc mutations: constitutive expression; repressor cannot bind operator (cis-acting).
    • lacI- mutations: defective repressor cannot bind operator; operon is always on (trans-acting for the operator in the chromosome).
    • I+ and I- interactions show cis vs trans effects; lacI+ can repress in trans if present with a wild-type background.

Catabolite repression of lac operon

  • Expression requires two signals: lactose present (derepresses repressor) and glucose absent (positive regulation via CAP-cAMP).
  • cAMP level inversely related to glucose; CAP-cAMP complex enhances RNA polymerase binding to the promoter, increasing transcription when glucose is scarce.
  • Mechanism: CAP-cAMP bends DNA and assists RNA polymerase binding to -10/-35 sites, elevating transcription.

Regulation of transcription: global control and sigma factors

  • Sigma factors direct RNA polymerase to specific promoter sequences; multiple sigma factors exist to regulate sets of genes under different conditions.
  • E. coli primary sigma factor is sigma 70; alternative sigma factors regulate stress responses, sporulation, etc.
  • Concept: a single sigma factor can govern many genes; switching sigma factor expression reprograms entire gene networks.

Promoter consensus sequences

  • Consensus sequences reflect the most common nucleotides at each position across promoters; used to predict promoter strength.
  • Example: -35 box consensus: TNATATT (some positions variable); -10 box consensus: TATAAT (typical core).
  • A site that closely matches the consensus tends to have stronger transcription initiation.

The Arabinose operon: dual positive and negative control

  • AraC acts as an activator (positive control) and, in the absence of arabinose, functions as a repressor via DNA looping that prevents RNA polymerase access.
  • In the presence of arabinose, AraC changes conformation and promotes transcription, in part by recruiting Mediator and looping DNA to bring transcription machinery to the promoter.
  • Additional positive control via CAP-cAMP; glucose absence increases cAMP and CAP activity.

Three-dimensional genome organization and chromatin (contrast with bacteria)

  • Eukaryotes rely on chromatin structure to regulate transcription.
  • DNA is packaged into chromatin; access to DNA is regulated by nucleosome positioning and histone modifications.
  • Transcriptional default state in bacteria is on; in eukaryotes, it is off due to nucleosome occupancy blocking access.
  • Enhancers and promoters function in 3D space; Mediator complexes and looping bring distant regulatory elements into proximity with promoters.

Chromatin and epigenetic regulation

  • Euchromatin (active) vs heterochromatin (silenced): chromatin remodeling and histone modifications govern accessibility.
  • Histone tails can be post-translationally modified (e.g., methylation, acetylation) to activate or repress transcription.
  • Activating marks: histone acetylation (e.g., H3K9ac, H3K27ac) and certain methylations (e.g., H3K4me3, H3K36me3, H3K79me).
  • Repressive marks: H3K9me, H3K27me, H4K20me, HP1-associated heterochromatin.
  • Chromatin states include active euchromatin, poised euchromatin, facultative heterochromatin, constitutive heterochromatin.
  • Boundary/insulator elements can block spread of heterochromatin.

Position effect variegation (PEV)

  • Introduced genes may be variably expressed depending on chromatin context (heterochromatin vs euchromatin) where the gene resides.
  • Classic Drosophila example: white gene inversion leads to variegated expression due to heterochromatin spreading.
  • Indicates importance of chromatin state in regulating gene expression.

Three-dimensional genome organization in nucleus

  • Genome organized into 3D territories and loops; topologically associating domains (TADs) group genes with similar expression patterns.
  • 3D organization is a key layer of gene regulation beyond linear DNA sequence.

GAL system in yeast: a case study in eukaryotic transcriptional control

  • GAL genes (GAL1, GAL2, GAL7, GAL10) are not in an operon in Saccharomyces cerevisiae.
  • GAL4 is a regulatory transcription factor; GAL4 binds UASG (upstream activating sequences) to promote transcription when galactose is present and glucose is absent.
  • GAL80 acts as a repressor that binds Gal4 in the absence of galactose; when galactose is present, GAL3 binds galactose, sequestering GAL80 away from Gal4 and enabling transcription.
  • Gal4 can recruit Mediator and preinitiation complex to initiate transcription via looping.
  • Yeast is diploid; mutations can be analyzed in heterozygous and homozygous backgrounds.

Gal4 as a tool for synthetic gene control

  • Gal4 can be used as a DNA-binding domain to drive transcription of genes linked to UAS sequences.
  • Swapping Gal4’s DNA-binding domain with other domains allows transcriptional control of different target genes.

Transcriptional regulation and chromatin interaction: GAL activation and chromatin

  • Gal4 activation involves chromatin modifiers that unwind DNA from nucleosomes, enabling RNA polymerase II binding and transcription initiation.
  • This demonstrates that transcriptional activation can occur via changes in chromatin structure, not just promoter sequence.

Regulation of Gene Expression in Eukaryotes

  • Similarities with prokaryotes include promoter-based control and transcription factors; but eukaryotes differ in operon absence, intron processing, and nuclear separation of transcription/translation.
  • Transcriptional regulation in eukaryotes heavily involves promoters, enhancers, silencers, and three-dimensional genome organization.
  • The GAL system is a canonical example illustrating enhancer-mediated control and chromatin-wide regulation.

Epigenetic and post-transcriptional regulation

  • DNA methylation: methylation of CG dinucleotides in promoters typically represses transcription and can be inherited through DNA replication.
  • RNA-based regulation: small RNAs (siRNA, miRNA) guide RISC to target mRNAs, leading to degradation or translational repression.
  • Alternative splicing: many genes have introns; splicing can produce multiple isoforms, contributing to protein diversity and regulation across tissues.

Small RNA regulation of gene expression

  • siRNA/miRNA pathway:
    • Long dsRNA is processed by Dicer into short dsRNA fragments (20–24 nt).
    • RISC binds one strand and pairs with complementary mRNA, resulting in degradation (siRNA) or translational repression (miRNA).
  • Let-7 and miR-34 examples show roles in differentiation and repression of pluripotency factors.

Post-transcriptional control and translation

  • mRNA stability affects protein output; untranslated regions and regulatory elements influence half-life.
  • Post-translational control includes protein stability and modifications that affect activity, localization, or interactions.

Transposons (Transposable Elements, TEs)

  • Barbara McClintock discovered transposons in maize; later work showed transposons are widespread across life.
  • Transposons are DNA sequences that catalyze their own movement within genomes via transposases.
  • They can be classified by mechanism and copy number:
    • Class I: retrotransposons — transpose via an RNA intermediate (transcribed, reverse transcribed, and inserted back into genome).
    • Class II: DNA transposons — move via a DNA intermediate; can be conservative (cut-and-paste) or replicative (copy-and-paste).
  • Autonomous transposons encode transposase and can transpose themselves; non-autonomous rely on transposases from autonomous elements.
  • Inverted repeats at termini are important for transposase recognition; direct repeats are created as target-site duplications upon insertion.
  • Transposons can cause mutations by inserting into genes or regulatory regions; they can also rearrange the genome through recombination between transposon copies.

Transposon types and features

  • In bacteria: insertion sequences (IS elements) are simple transposons; composite transposons contain genes flanked by IS elements (e.g., Tn1, Tn5, Tn10, Tn3).
  • Target site duplications: insertions generate short direct repeats flanking the inserted element due to staggered cuts and resynthesis.
  • Inverted repeats (IRs) are recognition sites for transposases; transposases mediate excision and insertion.
  • Class I vs Class II distinction reflects transposition mechanism and requirement for RNA intermediates (Class I) vs DNA transposition (Class II).

Transposons in the human genome

  • Retrotransposons dominate the human genome:
    • LINEs (Long Interspersed Nuclear Elements) — autonomous; encode reverse transcriptase; >21% of the genome.
    • SINEs (Short Interspersed Nuclear Elements) — non-autonomous; rely on LINE machinery; >13% of the genome; most derive from tRNA genes; Alu is the most abundant SINE.
    • LTRs (Long Terminal Repeat retrotransposons) — related to retroviruses; make up >8% of the genome; some are endogenous retroviruses (ERVs).
  • Alu elements are a major SINE family derived from 7SL RNA gene; over a million copies.
  • ERVs and other retrotransposons contribute to genome evolution and regulation of host genes, including placental genes and early development programs.

Transposon activity and host suppression

  • Transposons are often silenced by the host genome via heterochromatin formation, DNA methylation, and small RNA silencing pathways to limit movement.
  • Transposition rates are typically low in humans (e.g., about one LINE transposition per 95–270 live births).
  • In mammals, stem cells and early development can see transient upregulation of transposon transcription, the reasons for which are still debated.
  • Genome defense mechanisms include DNA methylation, histone modifications, heterochromatin formation, and small RNA pathways that target transposons for silencing.

Transposons as drivers of evolution and genome structure

  • Transposons can move genes between plasmids and genomes, reshaping gene content and regulatory landscapes.
  • They can facilitate antibiotic resistance gene spread in bacteria via mobilization and horizontal transfer.
  • Transposons contribute to regulatory innovation by providing promoters, enhancers, and other regulatory elements.
  • The genome contains many transposon remnants; current activity is limited by host repression, but regulatory co-option can benefit host organisms.

Key terms and concepts (quick glossary)

  • Synonymous (silent) mutation: a nucleotide change that does not alter the amino acid encoded.
  • Nonsynonymous (missense) mutation: a nucleotide change that alters the amino acid; may be conservative or non-conservative.
  • Frameshift mutation: insertion/deletion not in multiples of 3 that shifts the reading frame.
  • Promoter: DNA region where RNA polymerase binds to initiate transcription.
  • Operator: DNA sequence bound by a repressor to regulate transcription.
  • Activator: protein that increases transcription by aiding RNA polymerase binding.
  • Cis-acting element: DNA sequences that influence regulation on the same molecule (e.g., Oc).
  • Trans-acting factor: diffusible proteins (e.g., LacI) that regulate targets other than their own gene.
  • CAP-cAMP: regulatory complex that activates transcription under glucose-poor conditions.
  • Enhancer: distal regulatory DNA sequence that increases transcription via chromatin looping.
  • Silencer: DNA element that represses transcription.
  • TADs: Topologically associating domains; higher-order genome organization influences gene expression.
  • Epigenetics: heritable changes in gene expression not due to changes in DNA sequence (e.g., DNA methylation, histone modifications).
  • siRNA/miRNA: small RNAs guiding RISC to degrade or repress target mRNA.
  • Spliceosome: complex that removes introns from pre-mRNA in eukaryotes, enabling alternative splicing.
  • Bursts, latent period, and Pfus (plaque-forming units): key phage infection metrics in one-step growth experiments.
  • Recombination frequency: ext{RF} = rac{N{ ext{recombinant}}}{N{ ext{total}}}; used to map distances in phage genetics.

Important formulas and quantitative notes

  • Recombination frequency in phage genetics: ext{RF} = rac{N{ ext{recombinant}}}{N{ ext{total (progeny)}}}
  • Burst size (phage biology): approximately the number of phage produced per infected bacterial cell; can be estimated as
    ext{Burst size} \, ext{≈} \, rac{ ext{Pfus produced after lysis}}{ ext{Number of infected cells}}
  • Direct repeats and target site duplications upon transposon insertion:
    • Insertion creates short direct repeats flanking the transposon (typically 3–30 bp): ext{Direct repeat length} \sim 3 ext{–}30\,\text{bp}
  • Transposon insertion mechanics:
    • Target site duplication results from staggered cuts at the insertion site followed by repair synthesis.
    • Distinguish direct repeats from inverted repeats: direct repeats flank the inserted element; inverted repeats are at the termini of the transposon and are recognition sites for transposases.
  • Consensus sequences (for promoter elements):
    • -35 box consensus example: ext{TNATATT} (positions with variability indicated by N)
    • -10 box consensus example: often around ext{TATAAT} or related sequences; spacing is crucial for RNA polymerase binding.

Connections to foundational principles and real-world relevance

  • Mutation mechanisms illuminate why genomes contain a spectrum of variation, from neutral to deleterious to beneficial, shaping evolution and adaptation.
  • The fluctuation test demonstrates that many mutations are pre-existing, informing how populations adapt to pathogens and environmental changes.
  • Transposons illustrate genome dynamism: they can drive evolution by reshaping gene regulation and creating genetic diversity, while host defense mechanisms keep their activity in check.
  • Epigenetic and post-transcriptional regulation add layers of control beyond DNA sequence, enabling cell-type–specific expression patterns and development.
  • Understanding operons, transcription factors, and chromatin dynamics underpins modern biotechnology, synthetic biology, and medical genetics (e.g., gene therapy strategies, CRISPR-based regulation, and epigenetic therapies).

Practical implications and ethical considerations

  • Epigenetic inheritance and DNA methylation raise questions about heritability of expression states beyond the DNA sequence.
  • Transposon activity and genome editing carry implications for genome stability, cancer biology, and long-term evolutionary trajectories.
  • Use of small RNAs as gene-silencing tools offers powerful research and therapeutic potential, accompanied by safety and off-target concerns.
  • The knowledge of bacterial regulation (lac operon) informs synthetic biology designs, metabolic engineering, and antibiotic resistance monitoring.

Quick reference guide (study-ready bullets)

  • Three sources of mutation: replication/repair errors, spontaneous base changes, induced mutagens.
  • Spontaneous base changes: cytosine deamination (C→U → potential C→T), methylcytosine deamination (mC→T, especially in CpG sites).
  • Induced mutagens: base analogs (e.g., 2-AP), alkylating agents (EMS), carcinogens (benzo[a]pyrene) requiring repair (often G→T substitutions).
  • Radiation induces large-scale changes; point mutations vs large-scale events differ in phenotypic impact.
  • SNVs: synonymous vs nonsynonymous (conservative vs non-conservative) vs nonsense; frameshifts from indels.
  • Regulatory mutations alter transcription levels; examples include promoter variants in the β-globin gene.
  • Continuum of mutation effects: neutral, deleterious, beneficial; most beneficial mutations are rare.
  • Conservation as a functional proxy; highly conserved regions likely functionally important.
  • Fluctuation test: pre-existing mutations vs induced mutations; supports spontaneous variation.
  • Lac operon: negative regulation by LacI; catabolite repression by CAP-cAMP; glucose presence lowers cAMP, reducing CAP activity.
  • Arabinose operon: dual positive and negative control via AraC, AraI/AraO, and 3D genome interactions; galactose condition changes regulate transcription.
  • Eukaryotic regulation: promoters, enhancers, and the 3D genome architecture (TADs); chromatin states govern accessibility; histone modifications and DNA methylation modulate transcription.
  • GAL system as a case study of enhancer-based regulation and chromatin remodeling in yeast.
  • Transposons: IS elements, composite transposons, autonomous vs non-autonomous; class I vs class II; effects include insertions, rearrangements, and gene mobility.
  • Human genome TE landscape: LINEs, SINEs (Alu), and LTRs/ERVs; transposons shape genome structure and regulation while being tightly repressed by host defenses.
  • Recombination in phage genetics uses a frequency-based approach to map distances; complementation tests help determine gene relationships.