Notes on Transcription and Codon Reading
Codons and Translation Basics
- Translation involves reading RNA codons in groups of three to specify amino acids.
- The mapping from codons to amino acids is given by a codon-to-amino acid chart.
- In this context, the chart uses RNA bases (U, C, A, G), not DNA bases (T).
- The question asked in the transcript confirms we are looking at RNA: "Are we looking at DNA or RNA?" Answer: RNA.
- The order of bases in a codon matters; you read from the first base, then the second, then the third to form a codon.
- The bases are grouped into triplets (codons): each codon specifies one amino acid on the chart.
- The transcript provides an example sequence fragment to illustrate grouping and translation.
Reading Codons: Order and Frames
- A codon is a triplet of nucleotides: the reading can be represented as a sequence of three positions:
- First position: base 1
- Second position: base 2
- Third position: base 3
- Mathematically, a codon can be represented as \text{codon} = (n1, n2, n3) where each base is from the RNA set ni \in {U, C, A, G}.
- The reading direction is 5' to 3' along the mRNA strand.
- Codons are read in successive triplets: codon1 uses bases 1–3; codon2 uses bases 4–6; etc.
- If a sequence ends with fewer than 3 bases (incomplete codon), that remaining segment does not constitute a full codon.
RNA vs DNA in Codon Reading
- The transcript highlights that the chart is for RNA codons, not DNA, as it uses the RNA bases U, C, A, G.
- Practical implication: During transcription (DNA -> RNA) thymine (T) is replaced by uracil (U) in RNA; translation then reads the RNA codons.
- Key takeaway: When interpreting codon charts, you should use RNA sequences (with U) rather than DNA sequences (with T).
Example: Codon GAC (from the transcript)
- Given sequence fragment: gacg
- Grouping into codons: gac | g
- codon1 = GAC
- remaining base = G (incomplete second codon)
- Translation for codon1:
- GAC translates to the amino acid Aspartic acid (Asp), with one-letter code D and three-letter code Asp.
- If more bases followed, the leftover G would start the next codon with two additional bases to complete it.
- Important concept illustrated: the first codon is determined by the first three bases of the reading frame; subsequent codons follow in triplets.
Quick Recap of Key Points
- Codons are read in triplets from 5' to 3' on the RNA sequence.
- The first, second, and third bases of each codon determine the encoded amino acid via the codon chart.
- The chart used in the transcript is based on RNA bases: U, C, A, G.
- In the example, the sequence gacg yields codon GAC (Asp) and a leftover G starting the next codon if extended.
- The concept of a reading frame is critical: grouping into non-overlapping triplets defines the amino acid sequence.
- The distinction between transcription (DNA -> RNA) and translation (RNA -> protein) is implied by the RNA codon context.
- Codon definition: \text{codon} = (n1, n2, n3), \quad ni \in {U, C, A, G}.
- Codon length: |\text{codon}| = 3.
- Example mapping: \text{GAC} \rightarrow \text{Asp} \; (D).
- Reading frame concept: codons are non-overlapping triplets starting from the chosen start position along the 5' to 3' direction.