Genome, Transcriptome Assembly, and Annotations

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/29

flashcard set

Earn XP

Description and Tags

Flashcards reviewing key concepts from the lecture including genome and transcriptome assembly, annotations, sequencing technologies, and relevant algorithms.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

30 Terms

1
New cards

Zoonemia

A project assembling genomes from 240 different mammals.

2
New cards

Genome Assembly

Taking sequences and putting them together for comparison.

3
New cards

Illumina Sequencing

Fragmented pieces of DNA are sequenced, and mapped to a reference genome if available. If not, assembly is done de novo.

4
New cards

Repetitive Elements

A major challenge in genome assembly due to their repetitive nature.

5
New cards

Human Genome

The only reference genome that has been fully completed.

6
New cards

T to T

Telomere to telomere; a complete version of the human genome.

7
New cards

Telomeres

The ends of chromosomes.

8
New cards

Genome Annotation

Adding meaning to the genome, including structural and functional aspects.

9
New cards

Structural Annotation

Locating genes within the genome (e.g., central region of chromosome 1).

10
New cards

Functional Annotation

Determining the functional importance of different parts of the genome; involves gene ontology and pathway analysis.

11
New cards

Genome Assembly Steps

Reads are assembled into contigs, and then into scaffolds.

12
New cards

N50 Value

A statistical measurement indicating the minimum contig length needed to cover 50% of the genome.

13
New cards

Reference-Based Assembly

Assembly using an existing genome as a template (e.g., chimpanzee genome using the human genome).

14
New cards

De Novo Assembly

Assembly performed without a reference genome, requiring assembly from scratch (e.g., for endemic species).

15
New cards

Sequencing Steps

Quality control, trimming, contamination removal, k-mer counting, genome size estimation, error correction, and De Novo assembly.

16
New cards

De Novo Assembly Algorithms

Algorithms used for De Novo assembly; examples include Velvet (using the Beyond graph algorithm) and Trinity (for RNA).

17
New cards

Combining Sequencing Technologies

Short read next-generation sequencing, like Illumina, provides abundant data, while long-read sequencing (PacBio or Oxford) fills gaps.

18
New cards

SAIC

Algorithm example that uses overlap layout.

19
New cards

Velvet

Algorithm example that uses overlap beyond graph.

20
New cards

Genome Properties Affecting Assembly

Genome size, heterozygosity, and GC content.

21
New cards

In Silico Annotations

Refers to predicting open reading frames, signaling, coding/non-coding regions, and splicing regions.

22
New cards

Experimental Validation

Uses experimental data to validate predictions and determine gene functions.

23
New cards

Mature mRNA

Five prime untranslated regions, start codon, stop codon and polyadenylation signal.

24
New cards

Splicing

Alternative splicing events and different transcript availability occur during this.

25
New cards

Transcriptome Assembly

Assembly of RNA sequences to identify isoforms and splicing architectures, using Trinity for De Novo assembly or cufflinks for reference-based assembly.

26
New cards

Trinity Strategies

Three module that use a KD DeBjorn graph approach: Inchworm, Chrysalis and Butterfly.

27
New cards

Transcriptome Assembly Analysis

Requires paired-end RNA sequencing, followed by De Novo assembly and alignment.

28
New cards

Allen Wilson and Marie Claire King Publication

Expression gives additional information, which is useful for comparisons.

29
New cards

Engene Myers

Developed the first assembly program. (1995)

30
New cards

David Hausler

Scientific Director still working on UCSC Genome Browser.