RNA Prediction and Analysis of Structures BIOL 4300 Notes
RNA Structure Overview
Primary Structure
- Defined by the sequence of nucleotides (monomers) e.g., ATGCCGTCAC
Secondary Structure
- Refers to the 2D folding of RNA defined by hydrogen bonds among bases.
Tertiary Structure
- The 3D conformation of the RNA molecule.
Quaternary Structure
- Arrangement of multiple folded RNA molecules together.
Non-Coding RNAs (ncRNAs)
- Definition: RNA molecules not translated into proteins.
- Types of ncRNAs:
- tRNA (Transfer RNA)
- Assists in protein synthesis by carrying amino acids.
- rRNA (Ribosomal RNA)
- Component of ribosomes, crucial for protein synthesis.
- snoRNA (Small Nuclear RNA)
- Guides chemical modifications of other RNAs.
- microRNA (miRNA)
- Regulates gene expression.
- siRNA (Short Interfering RNA)
- Involved in RNA interference (RNAi) pathway, silencing gene expression.
- piRNA (PIWI-Interacting RNA)
- Involved in silencing transposons and regulating gene expression in germ line cells.
- Long ncRNAs
- Non-protein coding transcripts larger than 200 nucleotides.
Functions of Non-Coding RNAs
Can influence:
- Gene transcription
- Translation
- Localization
- Replication
- Degradation
Specific functions:
- Protein synthesis (rRNA and tRNA)
- RNA processing (snoRNA)
- Gene regulation (miRNA)
- RNA interference (siRNA)
- Gene silencing (piRNA)
RNA Structural Features
- Single-Stranded:
- RNA is mostly produced as a single-stranded molecule.
- Folding:
- The strand can fold upon itself to form base pairs.
- Base Pairing:
- Regular Watson-Crick base pairing (A-U, G-C).
- Non-Watson-Crick base pairing (e.g., G/U wobble).
Types of RNA Secondary Structures
Loops:
- Bulge Loop
- Interior Loop
- Hairpin Loop
- Pseudoknot
- Junction (Multiloop)
Complex Folds:
- Pseudoknot
- Kissing Hairpins
- Hair-bulge interactions
Approaches to RNA Secondary Structure Prediction
- Energy Minimization:
- Does not require prior sequence alignment; estimates energy contributions for structure.
- Comparative Sequence Analysis:
- Uses sequence alignment to identify conserved residues and covariant base pairs.
- Simultaneous Folding and Alignment:
- Involves structural alignment.
Energy Minimization Assumptions
- Most likely structure is the energetically most stable.
- Energy at any position is influenced only by local sequence and structure.
- Pseudoknots may be neglected in predictions.
Energy Minimization Methods
- Aim: Find structure with the most base pairs.
- Consider only A-U and G-C pairs without distinction.
- Nussinov Algorithm:
- Simple base pair maximization method.
- Energy Minimization Algorithm:
- Predicts secondary structure by minimizing free energy (ΔG).
- ΔG calculated from contributions of loops and stacking interactions.
Minimal Free Energy (MFE) Approach
- The most common method for predicting RNA secondary structure.
- Searches for a structure with the Minimal Free Energy.
Example of RNA Secondary Structure
- Sequence:
- 5’ to 3’ Folding:
- Base pairing example:
- G with C, A with U through hydrogen bonding.
Common Secondary Structures
- Internal Loop
- Hairpin Loop
- Bulge
- Dangling Ends
- Stem