Bioinformatics Lec2

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/21

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

22 Terms

1
New cards

Volcano Plots

  • Show which genes are significantly differentially expressed in a condition

2
New cards

Direct functions

Actual biological pathways of a gene

3
New cards

Indirect functions

Larger scale impacts of the pathway a gene is involved in

4
New cards

Genes can be associated into a set based on ____ or _____

pathway involvement, similarity

5
New cards

What is a gene set?

A priori defined set of genes

6
New cards

______, ______, and ______, are sources of gene sets

Literature and experimental evidence, curated databases, computational predictions

7
New cards

Gene ontologies are _____ with _______relationships

gene sets; hierarchal relationships

8
New cards

Three aspects of gene ontology

  • Biological processes

  • Molecular function

  • Cellular component

9
New cards

The GO structure is ____ due to many __________ relationships

acyclic, parent-child

10
New cards

Molecular function

  • Catalytic/enzymatic “activity”

11
New cards

Cellular component

  • Structural components/complexes

12
New cards

Biological Processes

Pathways (ie. exocytosis)

13
New cards

What is the motivation for gene set analysis?

Is my hitlist enriched in any function?

14
New cards

Differences between gene set analysis compared to individual gene analysis

  • Genes analyzed as sets

  • Individual genes don’t need to be significantly differentially expressed

  • Gene sets must be enriched for the finding to be statistically significant

15
New cards

Over-representation analysis (ORA) takes a list of ______ genes and sees if any gene sets are _______

significant; overrepresented

16
New cards

Pro of ORA

Easy to understand and run

17
New cards

Cons of ORA

  • Does not account for genes that are borderline on the p-value cutoff

  • Possible that no genes are significant to begin with

18
New cards

GSEA step 1

Rank ALL genes from high-low expression in one condition

19
New cards

GSEA step 2

  • Identify individual genes within the data set and ranked list

  • Black lines indicate genes that are present in the gene set of question

  • Determine if there is enrichment

20
New cards

A positive enrichment score indicates

Upregulated genes in X pathway are enriched

21
New cards

A negative enrichment score indicates

Downregulated genes in X pathway are enriched.

22
New cards

Problems with ontology and enrichment

  • Ontologies are incomplete (based on experimental evidence and statistics)

  • Ontologies are always changing