DL 5: Genomics, Bioinformatics, and Functional Annotation

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/53

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

54 Terms

1
New cards

Genomics

The study of the DNA sequence

2
New cards

Recombinant DNA

Technology that enables cloning and manipulation of genes; foundational for genomics.

3
New cards

cDNA Cloning

Uses reverse transcriptase to synthesize complementary DNA (cDNA) from mRNA templates.

4
New cards

Reverse Transcriptase

Enzyme that converts single-stranded RNA into double-stranded DNA (cDNA).

5
New cards

Poly-T Primer

Oligonucleotide primer that binds to the poly-A tail of eukaryotic mRNA for cDNA synthesis.

6
New cards

cDNA Library

Collection of cDNA molecules representing all mRNAs expressed in a cell or tissue.

7
New cards

gDNA Library

Collection of genomic DNA fragments

8
New cards

Coverage

The average number of times each base is sequenced in a dataset; determines sequencing depth.

9
New cards

Library Coverage Calculation

Calculated by multiplying the number of clones by average insert size and dividing by genome size.

10
New cards

Unequal cDNA Representation

mRNAs vary in abundance

11
New cards

Sanger Sequencing

Uses dideoxynucleotides (ddNTPs) to terminate DNA synthesis and determine base order.

12
New cards

Whole-Genome Shotgun (WGS) Sequencing

Randomly shears DNA

13
New cards

Contigs

Overlapping DNA fragments assembled into continuous stretches of sequence.

14
New cards

Coverage in WGS

Typically 10–12× genome length to ensure complete assembly.

15
New cards

Repetitive Sequences

Make de novo genome assembly difficult due to ambiguous alignments.

16
New cards

Illumina Sequencing

Next-generation sequencing method that sequences millions of short fragments in parallel.

17
New cards

Flow Cell

Surface where Illumina sequencing occurs

18
New cards

Read Length Comparison

Sanger: ~900 bp; Illumina: ~150 bp paired-end.

19
New cards

Throughput Comparison

Sanger produces hundreds of reads per run; Illumina produces billions.

20
New cards

Cost Comparison

Sanger >$1M per gigabase

21
New cards

De Novo Assembly

Building genomes from scratch without a reference sequence; challenging for repetitive genomes.

22
New cards

Assembly by Reference

Aligning reads to a known reference genome to identify SNPs and small indels.

23
New cards

Human Genome Sequencing

Achieved through both clone-by-clone and WGS approaches; completed in 2003.

24
New cards

Genome Coverage Today

Human genome sequencing can reach ~20× coverage for <$1

25
New cards

Metagenomics

Sequencing mixed DNA from environmental samples to study uncultured microbial communities.

26
New cards

Metagenome

The collective genome of microbial populations from a given environment.

27
New cards

Bioinformatics

The computational analysis and annotation of biological data such as DNA sequences.

28
New cards

Genome Annotation

Identifying genes

29
New cards

Santa Cruz Genome Browser

Online tool for viewing

30
New cards

Gene Prediction

Computational identification of coding regions using ab initio (in silico) or experimental methods.

31
New cards

Ab Initio Gene Prediction

Statistical models predict gene structure directly from DNA sequence; fast but may be inaccurate.

32
New cards

Functional Annotation

Experimental verification of predicted genes via RNAseq and other molecular methods.

33
New cards

RNAseq

Sequencing-based method to measure gene expression and identify exons and splice variants.

34
New cards

RNAseq Output

High read density at exons; splice junction reads define intron–exon boundaries.

35
New cards

Functional Gene Annotation

Integrates RNAseq

36
New cards

Comparative Genomics

Uses cross-species sequence comparison to identify conserved functional regions.

37
New cards

Evolutionary Signatures

Conserved sequences indicate functional importance (e.g.

38
New cards

Phylogenetic Conservation

Higher conservation across species suggests essential function.

39
New cards

Conservation Example

Exons conserved between humans and rhesus monkeys more than with chickens or dogs.

40
New cards

Comparative Gene Expression

Measures expression across tissues or timepoints to infer gene function.

41
New cards

Gene Expression Profiling

Uses arrays or RNAseq to quantify expression levels across samples.

42
New cards

Heatmap

Graphical representation of gene expression patterns; clusters genes with similar expression.

43
New cards

GTEx Database

Provides tissue-specific gene expression data across human tissues.

44
New cards

ChIP-seq

Chromatin Immunoprecipitation Sequencing; identifies protein-DNA binding sites such as enhancers and promoters.

45
New cards

DNAse I Hypersensitivity Sequencing

Identifies open chromatin regions accessible to regulatory proteins.

46
New cards

Open vs Closed Chromatin

Open regions are transcriptionally active; closed chromatin wrapped around histones is inactive.

47
New cards

Enhancers

Regulatory DNA elements marked by histone modification (e.g.

48
New cards

1000 Genomes Project

Large-scale effort to catalog common human genetic variation by re-sequencing 1

49
New cards

OMIM

Online Mendelian Inheritance in Man

50
New cards

Repeat Masker

Computational tool identifying repetitive elements such as transposable elements in the genome.

51
New cards

HEXA Gene

Mutations in HEXA cause Tay–Sachs disease

52
New cards

Impact of Genomics

Technologies like sequencing and bioinformatics have revolutionized our understanding of human biology and disease.

53
New cards

Functional Genomics

Combines genomics and experimental methods to determine gene function.

54
New cards

The Upshot

Genomic and bioinformatic tools now allow disease analysis and discovery from a laptop or smartphone.