Notes on Transcription and Codon Reading

Codons and Translation Basics

  • Translation involves reading RNA codons in groups of three to specify amino acids.
  • The mapping from codons to amino acids is given by a codon-to-amino acid chart.
  • In this context, the chart uses RNA bases (U, C, A, G), not DNA bases (T).
  • The question asked in the transcript confirms we are looking at RNA: "Are we looking at DNA or RNA?" Answer: RNA.
  • The order of bases in a codon matters; you read from the first base, then the second, then the third to form a codon.
  • The bases are grouped into triplets (codons): each codon specifies one amino acid on the chart.
  • The transcript provides an example sequence fragment to illustrate grouping and translation.

Reading Codons: Order and Frames

  • A codon is a triplet of nucleotides: the reading can be represented as a sequence of three positions:
    • First position: base 1
    • Second position: base 2
    • Third position: base 3
  • Mathematically, a codon can be represented as \text{codon} = (n1, n2, n3) where each base is from the RNA set ni \in {U, C, A, G}.
  • The reading direction is 5' to 3' along the mRNA strand.
  • Codons are read in successive triplets: codon1 uses bases 1–3; codon2 uses bases 4–6; etc.
  • If a sequence ends with fewer than 3 bases (incomplete codon), that remaining segment does not constitute a full codon.

RNA vs DNA in Codon Reading

  • The transcript highlights that the chart is for RNA codons, not DNA, as it uses the RNA bases U, C, A, G.
  • Practical implication: During transcription (DNA -> RNA) thymine (T) is replaced by uracil (U) in RNA; translation then reads the RNA codons.
  • Key takeaway: When interpreting codon charts, you should use RNA sequences (with U) rather than DNA sequences (with T).

Example: Codon GAC (from the transcript)

  • Given sequence fragment: gacg
  • Grouping into codons: gac | g
    • codon1 = GAC
    • remaining base = G (incomplete second codon)
  • Translation for codon1:
    • GAC translates to the amino acid Aspartic acid (Asp), with one-letter code D and three-letter code Asp.
  • If more bases followed, the leftover G would start the next codon with two additional bases to complete it.
  • Important concept illustrated: the first codon is determined by the first three bases of the reading frame; subsequent codons follow in triplets.

Quick Recap of Key Points

  • Codons are read in triplets from 5' to 3' on the RNA sequence.
  • The first, second, and third bases of each codon determine the encoded amino acid via the codon chart.
  • The chart used in the transcript is based on RNA bases: U, C, A, G.
  • In the example, the sequence gacg yields codon GAC (Asp) and a leftover G starting the next codon if extended.
  • The concept of a reading frame is critical: grouping into non-overlapping triplets defines the amino acid sequence.
  • The distinction between transcription (DNA -> RNA) and translation (RNA -> protein) is implied by the RNA codon context.

Notation and Formulas (quick reference)

  • Codon definition: \text{codon} = (n1, n2, n3), \quad ni \in {U, C, A, G}.
  • Codon length: |\text{codon}| = 3.
  • Example mapping: \text{GAC} \rightarrow \text{Asp} \; (D).
  • Reading frame concept: codons are non-overlapping triplets starting from the chosen start position along the 5' to 3' direction.