DNA Bases, Chargaff's Rules, and Related Concepts Notes

Recognizing DNA bases

  • Base identity and recognition in images/pictures:
    • Cytosine:
    • Pyrimidine (one ring) and contains an amine group (–NH₂).
    • Recognized by the amine group.
    • Cytosine can undergo deamination to form uracil, which is a major form of DNA damage in cells.
      • Deamination reaction: extC<br/>ightarrowextUext(deamination)ext{C} <br /> ightarrow ext{U} ext{ (deamination)}
    • Uracil vs thymine:
    • Uracil lacks the methyl group that thymine has.
    • Thymine has a methyl group (CH₃) on the pyrimidine ring; this difference helps distinguish RNA (which uses uracil) from DNA (which uses thymine).
  • Purines vs pyrimidines (basic structural rule):
    • Purines: Adenine (A) and Guanine (G) → two-ring structures.
    • Pyrimidines: Cytosine (C), Thymine (T), and Uracil (U) → one-ring structures.
    • In images: Guanine often shows an oxygen-containing group; adenine does not have that distinctive feature.
  • Double-checking key recognitions:
    • Thymine: methyl group (–CH₃) present.
    • Cytosine: amine group (–NH₂) present, one-ring pyrimidine.
    • Guanine vs Adenine: Guanine is a purine with an additional ring and an oxygen-containing feature; Adenine is a purine without that particular oxygen feature.
    • Uracil vs thymine: Uracil (RNA) has no methyl group; thymine (DNA) has a methyl group.
  • Practical takeaway for quick tests (DNA jeopardy style):
    • Cytosine = one ring + amine group;
    • Uracil = cytosine deaminated, no methyl group;
    • Guanine = purine with additional ring and characteristic functional groups;
    • Adenine = purine without the thymine-like methyl group.

Chargaff's rules (concepts and implications)

  • Base pair equivalences in double-stranded DNA:
    • A pairs with T; G pairs with C.
    • This means the two strands are complementary and base-paired, maintaining consistent amounts on each strand.
  • Content balance in duplex DNA:
    • Total purines = total pyrimidines, each contributing ~50% of the DNA composition:
    • Purines: [A]+[G]50%[A] + [G] \approx 50\%
    • Pyrimidines: [C]+[T]50%[C] + [T] \approx 50\%
  • Individual base pair balances:
    • [A][T][A] \approx [T] and [G][C][G] \approx [C] in double-stranded DNA.
  • AT vs GC ratios and variability:
    • The ratio of AT base pairs to GC base pairs is not necessarily 1 across species or sequences.
    • Chargaff’s rule guarantees A ≈ T and G ≈ C, and overall purines ≈ pyrimidines, but the relative number of AT vs GC base pairs can vary widely depending on GC content of the genome.
  • Numerical illustration (example from lecture):
    • Suppose A = 10%, T = 10% → AT = 20%
    • Since A pairs with T, T = 10%; this forces C and G to make up the remaining 80% with C = G = 40% each.
    • This demonstrates how A=T and G=C hold, while AT vs GC can differ from 1:1 in terms of base-pair counts.
  • Exam-style note on Chargaff's rule:
    • When asked which statement explains Chargaff’s rule, the correct general idea is that purines complement pyrimidines in base pairing (A with T, G with C) and the total purine content equals total pyrimidine content (roughly 50/50).
    • The question may involve constructing or canceling percentages to show A=T and G=C, or AT vs GC not necessarily equaling 1.
  • Quick takeaway:
    • In dsDNA: [A][T], [G][C], [A]+[G][C]+[T]50%.[A] \approx [T], \ [G] \approx [C], \ [A]+[G] \approx [C]+[T] \approx 50\%.
    • The AT/GC ratio is genome- and sequence-dependent and is not constrained to be 1.

Chargaff's rule applications in exam-style questions

  • Practice reasoning about a given option set:
    • A) (Rule check) For dsDNA, adding up all purines versus all pyrimidines should be equal. If the statement aligns with A≡T and G≡C plus purine/pyrimidine parity, it’s true.
    • B) (Rule check) Mixing purines with pyrimidines on each side can still be reconciled by pairing; the equal-sum property holds when considering purines vs pyrimidines across the entire molecule.
    • D) (Correct choice in lecture) In double-stranded DNA, purines base-pair with pyrimidines such that the total purines equal the total pyrimidines; A pairs with T and G pairs with C, making the two sides balanced. This is the statement the instructor indicated as the right answer in that context.
  • Conceptual takeaway for solving similar questions:
    • Always check: Are purines balanced with pyrimidines? Is A ≈ T and G ≈ C? Do the two halves (or two sides) of the duplex balance when you sum nucleotides?
    • If a question asks about AT vs GC ratios, remember that those can vary even though A≈T and G≈C hold overall.

Viruses and nucleic acid types (DNA vs RNA, ss vs ds)

  • German measles (rubella) virus properties:
    • The genome can be DNA or RNA; it can be single-stranded (ss) or double-stranded (ds) depending on the virus.
  • Diagnostic clues from base composition:
    • Presence of uracil (U) and absence of thymine (T) indicate RNA genomes, since RNA uses U instead of T.
    • If uracil is present and thymine is absent, the genetic material is RNA.
  • Applying Chargaff’s rule to viruses:
    • For RNA viruses that are single-stranded, the base-pairing rule A≈T, G≈C does not apply to the genome as a whole in the same way as dsDNA does; the ratio is not constrained to be 1.
    • Therefore, the RNA virus cannot be dsDNA and cannot have a 1:1 AT:GC balance in the same sense as dsDNA.

Melting temperature of double-stranded DNA (Tm)

  • Definition:
    • The melting temperature (Tm) is the temperature at which 50% of dsDNA becomes single-stranded.
  • What determines Tm:
    • Hydrogen bonds between base pairs: GC pairs have 3 H-bonds; AT pairs have 2 H-bonds. More hydrogen bonds generally raise Tm.
    • Length of the DNA molecule: Longer sequences typically have higher Tm due to more total bonds, but there are nuances.
    • GC content: Higher GC content increases Tm because GC pairs contribute more H-bonds per base pair.
  • The two plus four rule of thumb (for short sequences):
    • Formula: Tm=2×(A+T)+4×(G+C)[C]Tm = 2 \times (A+T) + 4 \times (G+C) \quad [^{\circ}\mathrm{C}]
    • Note: The counts (A+T) and (G+C) refer to the number of bases in one strand (the two strands share hydrogen bonds, so one strand is enough for counting).
    • Rationale: A–T pairs contribute 2 H-bonds each; G–C pairs contribute 3 H-bonds each, but the rule uses 4 for GC to approximate the stabilizing effect.
  • Practical example from the lecture:
    • Given a sequence where the count of A+T is 9 and G+C is 6:
    • Tm=2×9+4×6=18+24=42CTm = 2 \times 9 + 4 \times 6 = 18 + 24 = 42^{\circ}\mathrm{C}
    • Units are in degrees Celsius; calculators on the exam can be used.
  • Special notes for primer design:
    • The two plus four rule is a heuristic used by many primer-design algorithms, especially in PCR primer design.
    • Real-world Tm calculations for longer sequences may require more thermodynamic models beyond this simple rule.

Major and minor grooves of DNA

  • Structural concept:
    • When looking at a base pair from one strand’s perspective, the major groove is the larger spacing created by the glycosidic bonds across the backbones; the minor groove is the smaller gap.
  • Why grooves matter:
    • The major groove contains more information about base sequences, enabling many proteins (e.g., transcription factors) to recognize specific DNA sequences by reading base-pair steps.
    • Not every protein uses the major groove; some, like TATA-binding protein, recognizes the minor groove and bends DNA there, because the minor groove is more bendable.
  • Quick geometry note:
    • The major groove is formed by the distance between glycosidic bonds on opposite sides of the base pair; the minor groove is the smaller distance around the backbone.

Practice questions: turn, ends, and base identification

  • DNA helix turn and base-pair count:
    • A full 360° turn of B-form DNA contains approximately 10 base pairs.
    • Consequently, there are about 20 nucleotides per turn (two per base pair).
  • End identification in a doubled strand (5' and 3' ends):
    • In a given anti-parallel pair of strands, the 5' end is identified by the terminal phosphate group attached to the 5' carbon of the sugar.
    • The 3' end is identified by the terminal hydroxyl group on the 3' carbon of the sugar.
    • In the example discussed, the left strand ends at the 5' end, while the opposite strand runs in the 5'→3' direction opposite to the left strand, making that strand's end the 3' end in that orientation.
  • Recognizing bases in a single strand (with a provided sequence image):
    • Cytosine: recognized by the amine group (–NH₂).
    • Adenine: identified as a purine (two-ring structure).
    • Guanine: identified by having an oxygen-containing feature and as a purine.
    • Thymine: identified by the methyl group (–CH₃).
  • Complementarity and anti-parallel arrangement:
    • The second strand is complementary and anti-parallel to the first strand; bases pair as A↔T and G↔C.
    • The bases on the complementary strand would be read in the opposite direction (3' to 5' opposite 5' to 3').
  • Quick recap about RNA vs DNA in the context of structure:
    • RNA is typically single-stranded but can base-pair with other nucleic acids to form structures; RNA molecules in cells are coated with proteins (RNA-protein complexes).

RNA structure (brief summary relevant to course concepts)

  • RNA differs from DNA in form and function but can participate in base pairing:
    • RNA is single-stranded most of the time but can fold back on itself to form structures via intramolecular base pairing.
    • RNA can pair with other RNA or DNA sequences, enabling functions like interference and regulation (to be covered in RNA interference topics).
  • Proteins association:
    • RNA molecules are often coated with proteins, which influences their stability and function in the cell.

Quick reference formulas and principles

  • Base pairing in double-stranded DNA:
    • AextpairswithT, GextpairswithCA ext{ pairs with } T,\ G ext{ pairs with } C
    • [A][T], [G][C][A] \approx [T], \ [G] \approx [C]
    • [A]+[G][C]+[T]50%[A]+[G] \approx [C]+[T] \approx 50\%
  • Melting temperature (two plus four rule):
    • Tm=2×(A+T)+4×(G+C)[C]Tm = 2 \times (A+T) + 4 \times (G+C) \quad [^{\circ}\mathrm{C}]
  • Deamination reaction (example):
    • CdeaminationU\text{C} \xrightarrow{\text{deamination}} \text{U}
  • Structural features:
    • Major groove vs Minor groove: major is wider and contains more sequence information; minor is narrower and can be involved in bending and recognition by certain proteins.