3.2

Introduction to the Transcriptome

  • Overview of the transcriptome

    • The transcriptome encompasses all the expressed RNA in a cell, including:

    • mRNA: Messenger RNA which is translated into proteins.

    • rRNA: Ribosomal RNA, which forms part of the ribosome and is vital in the translation process.

    • tRNA: Transfer RNA, which recognizes codons and delivers specific amino acids during protein synthesis.

    • Non-coding RNA: RNAs that do not code for proteins, serving multiple functions. Research is ongoing to explore the various forms of non-coding RNA.

Definition of Transcriptome

  • The transcriptome is defined as the complete set of RNA transcripts produced in a given cell or population of cells under defined conditions.

    • It consists not only of mRNA but also rRNA, tRNA, and all types of non-coding RNA.

    • Understanding the transcriptome is crucial in eukaryotes as it provides insights on gene expression regulation and function.

Relationship between Transcriptome, Proteome, and Genome

  • Eukaryotic transcription differs from bacterial transcription:

    • Bacteria: Transcription and translation occur simultaneously as they lack a nucleus.

    • Eukaryotes: Transcription occurs in the nucleus with a pre-mRNA transcript which undergoes processing (capping, splicing, and polyadenylation) before being exported for translation.

Processing of Eukaryotic RNA

  • Primary transcripts are formed first and must be processed to create mature mRNA:

    • Processing involves:

    • Addition of a 5' methyl cap.

    • Addition of a poly(A) tail.

  • Mature mRNA is then exported from the nucleus for translation by ribosomes.

Reverse Transcription and cDNA Synthesis

  • To study RNA sequences, researchers typically convert RNA into cDNA (complementary DNA):

    • Reverse Transcriptase: An enzyme that synthesizes cDNA from RNA templates using primers.

    • Oligo(dT) Primer: A primer consisting of a string of thymine nucleotides, allowing binding to the poly(A) tail of mRNAs for cDNA synthesis.

    • Note that while oligo(dT) predominantly enriches for mature mRNA, some non-coding RNAs also possess poly(A) tails, and some mRNAs may lack them.

Difference between Genomic DNA and cDNA

  • cDNA reflects only the expressed sequences (exons) of the RNA, omitting the introns, which separate exons in genomic DNA.

  • Exons are defined as portions of DNA that are expressed in the final mRNA.

Alternative Splicing

  • Alternative splicing significantly increases protein diversity from individual genes in eukaryotes leading to multiple transcript variants.

    • Mechanisms of alternative splicing include:

    • Alternative Polyadenylation: Use of different poly(A) sites affecting the 3' end of transcripts.

    • Alternative Promoter Usage: Different start sites for transcription leading to diverse mRNAs.

    • Exon Inclusion/Exclusion (Exon Cassette): Exons can be included or omitted in transcripts.

    • Mutually Exclusive Exons: Only one out of two or more exons is included in any given transcript.

    • Alternative Splice Sites: Different 5' or 3' splice sites may be utilized.

    • Intron Retention: In some transcripts, introns are retained instead of being spliced out.

Using Transcriptome Data

  • The transcriptome data plays a crucial role in annotating the proteome in eukaryotes.

    • With transcript sequences, researchers can identify open reading frames (ORFs) corresponding to proteins.

    • Analysis of the transcriptome helps reveal where genes are expressed and assists in annotating the genome accordingly.

    • Example provided: Research on a Drosophila gene associated with Down syndrome, showing extensive splicing leading to numerous variants (38,016 unique transcript variants).

Transcriptome Analysis Techniques

  • RNA Sequencing (RNA-Seq): A modern technique for chronically assessing the transcriptome.

    • RNA-Seq allows determination of which genes are transcribed, the exons included in gene transcripts, and the relative expression levels of these genes.

    • Process involves:

    • Lysis of target cells and isolation of RNA.

    • Reverse transcription to form cDNA (with the use of oligo(dT) or random hexamer primers).

    • cDNA is then sequenced using next-generation sequencing technologies, such as Illumina sequencing.

Example of Differential Gene Expression

  • Comparison of eye lens cells and red blood cells illustrates differential gene expression.

    • For example, lens cells express crystalline RNA much more than red blood cells, while red blood cells express hemoglobin significantly.

    • This process detects differentially expressed genes (DEGs) that indicate important biological functions specific to cell types.

Single-Cell RNA Sequencing (scRNA-Seq)

  • A refinement of RNA-Seq that examines individual cells to understand distinct transcriptomes of various cell types within a sample.

    • ScRNA-Seq provides insights into cellular differentiation, development, and perspectives on diseases like cancer.

    • DropSeq: A technology enabling single-cell analysis using a microfluidic device for encapsulating individual cells:

    • Combines cells with barcoded primer beads that contain unique identifiers for tracking transcripts back to individual cells.

    • mRNA from individual cells hybridizes with the attached primers on these beads.

    • Allows researchers to quantify gene expression patterns in various conditions significantly improving resolution compared to bulk RNA-Seq.

Research Application Example

  • Example cited comparing RNA patterns across zebrafish development stages, showing changes in gene expression which correlate with developmental time points.

    • Each data point indicates transcriptome similarity, allowing for the visualization of gene expression shifts during development.

Conclusion

  • The transcriptome includes all expressed RNAs in the cell, showcasing mRNA as a focal point in many transcriptomic studies.

  • RNA conversion to cDNA is a pivotal step for analysis.

  • The concept of alternative splicing highlights how single genes can yield multiple proteins.

  • RNA-Seq and single-cell RNA-Seq methods are vital for elucidating expression profiles in both aggregated and individual cell contexts.