Transcriptomics 1- what, why and how

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Get a hint
Hint

What is the transcriptome?

Get a hint
Hint

The set of all RNA transcripts, including coding and non-coding, in an individual or a population of cells.

Get a hint
Hint

Draw the timeline of the Central Dogma of Molecular Biology.

Get a hint
Hint
knowt flashcard image
Card Sorting

1/31

Anonymous user
Anonymous user
encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

32 Terms

1
New cards

What is the transcriptome?

The set of all RNA transcripts, including coding and non-coding, in an individual or a population of cells.

2
New cards

Draw the timeline of the Central Dogma of Molecular Biology.

knowt flashcard image
3
New cards

Name 6 environmental factors

  • Weather (esp plants)

  • Hormones

  • Diet

  • Drugs

  • Psycological (e.g., stress)

  • Experience, e.g., learning

4
New cards

What 4 processes do transcripts in a cell reflect on?

  • Transcription

  • Splicing

  • Nuclear export

  • Decay

5
New cards

3 reasons why we study the transcriptome?

  1. What genes are expressed in which organism (evolution, gene function)

  2. What genes are expressed in what cells (what makes a specific cell type, function, gene function).

  3. What gene expression changes occur under certain conditions (loss/gain of function, environment, disease mechanism + biomarkers)

6
New cards

Name 5 types of RNA in a cell.

  • mRNA

  • rRNA

  • tRNA

  • noncoding RNA

  • sn/snoRNA

7
New cards

First, second and third most abundant RNA by mass?

  1. rRNA

  2. tRNA

  3. mRNA

8
New cards

What is the most abundant RNA by number of molecules?

tRNA.

9
New cards

What 2 methods can sequence mRNA and ncRNA?

PolyA enrichment

rRNA depletion

10
New cards

What happens during PolyA enrichment?

Oligo d (T25) magnetic bead moves along mRNA from 3’ to 5’ cap

<p>Oligo d (T25) magnetic bead moves along mRNA from 3’ to 5’ cap</p>
11
New cards

What happens during rRNA depletion? (3)

  1. Probe hybridization to rRNAs

  2. Capture of DNA-RNA hybrids with magnetic beads.

  3. Removal of rRNAs.

<ol><li><p>Probe hybridization to rRNAs</p></li><li><p>Capture of DNA-RNA hybrids with magnetic beads.</p></li><li><p>Removal of rRNAs. </p></li></ol><p></p>
12
New cards

How is sequence data for the transcriptome generated? (3)

  1. Extract RNA, then remove the rRNA (or purify mRNA).

  2. Fragment, reverse transcribe, add adapters, sequence.

  3. Send for next-generation sequencing (high throughput sequencing).

13
New cards

What technology produces:

  1. 100-150 bp per end

  2. 300 bp per end

  3. 10kb

  1. Illumina HiSeq

  2. Illumina MiSeq

  3. PacBio/ONT

14
New cards

What is sequencing depth?

The total number of reads obtained from a high-throughput sequencing run.

15
New cards

What is coverage?

The ratio of the total number of bases obtained by sequencing to the size of the genome.

16
New cards

What 3 things is identification of RNA molecules dependent on?

  • Length

  • Abundance

  • Number of sequence reads.

17
New cards

What are the 4 bulk RNA-seq current standards?

  1. Average library insert size is 200 base pairs

  2. 2 or more replicates

  3. Each replicate should have 30 million aligned reads

  4. Replicate concordance

18
New cards

What does replicate concordance mean?

The gene level quantification should have a Spearman correlation of <0.9 between isogenic replicates and >0.8 between anisogenic replicates (replicates from different donors).

19
New cards

What is the single-cell isolation followed by RNA-seq specific standards?

Each replicate requires only 5 million aligned reads.

20
New cards

Name 3 adaptations to the technology to look at low abundance transcripts.

  1. Pol II- associated RNA enrichment

  2. Run-on RNA enrichment

  3. Metabolic RNA labelling

21
New cards

How is a transcriptome generated with a reference genome?

Map the sequence data to the reference genome. Use splice aligning of DNA/pre-mRNA to produce spliced mRNA and RNA sequenced reads.

<p>Map the sequence data to the reference genome. Use splice aligning of DNA/pre-mRNA to produce spliced mRNA and RNA sequenced reads. </p>
22
New cards

When generating the transcriptome with a reference genome, why are there sequence differences between reads and reference? (2)

  • Alternative splicing

  • SNPs or mutation

23
New cards

When generating a transcriptome without a reference genome, how do you assemble the sequence data? (2)

  1. Overlap graphs OR

  2. De Bruijn graphs

24
New cards

Role of overlap graphs? Is it memory intensive?

Simply align all the reads against each other to find overlap, and extend the reads to form transcripts.

Memory intensive.

25
New cards

Role of De Bruijn graphs? Is it memory intensive?

They identify all the k-mers (of length k) in the reads, and assemble these by searching for k-1 overlap. Less memory intensive.

26
New cards

How do you measure expression? (3)

  • Each sample is mapped back to the reference using aligners and Quasi-mappers

  • Identification of novel transcripts

  • Gene expression can be estimated by counting the number of reads that come from the gene after normalising for total read number

27
New cards

What are aligners?

They align reads and reports SNPs.

28
New cards

What are Quasi-Mappers?

Just gives counts per gene.

29
New cards

What is differential gene expression?

Difference in abundance of gene transcripts within a transcriptome

30
New cards

At 5% significance (P<0.05), how many are false positives?

1 in 20

31
New cards

What does FDR stand for?

False discovery rate

32
New cards

What is the q value?

FDR-corrected p value. Gives more confidence.