BIO 331 - Structural and Functional Annotation

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/53

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 9:42 PM on 3/15/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

54 Terms

1
New cards

Genome annotation

Attaches biological information to sequences

2
New cards

Structural annotation

Process of identifying coding genes (an intron-exon structures) and non-coding genes (e.g. tRNAs)

3
New cards

Functional annotation

Attaches metadata structural annotations (e.g. which product is encoded in the gene)

4
New cards
<p>This image describes the workflow for ___</p>

This image describes the workflow for ___

Genome annotation

5
New cards

Programs for finding repeats and masking/annotating them

RepeatMasker, emboss NUCLEIC REPEATS

6
New cards
<p>This output is from ___</p>

This output is from ___

RepeatMasker

7
New cards
<p>This program is ___</p>

This program is ___

RepeatMasker

8
New cards
<p>This program is ___</p>

This program is ___

emboss NUCLEIC REPEATS

9
New cards
<p>This output is from ___</p>

This output is from ___

emboss NUCLEIC REPEATS

10
New cards

Structural annotation helps identify these genomic elements

Non-coding genes, regulatory motifs/promoters, coding genes

11
New cards

ncRNAs

Non-coding RNAs; RNA molecule that is not translated into a protein

12
New cards
<p>tRNAscan-SE</p>

tRNAscan-SE

Finds tRNAs

13
New cards
<p>RNAmmer</p>

RNAmmer

Finds rRNA

14
New cards
<p>Rfam</p>

Rfam

Finds tRNA, rRNA, mtRNA, snRNA, miRNA, etc.

15
New cards

This output is from ___

Rfam

16
New cards

What does ab initio mean?

From the beginning

17
New cards

Ab initio signals

Specific sequences that indicate the presence of a gene nearby (e.g. promoter)

18
New cards

Ab initio content

Properties of protein-coding sequence itself (e.g. start & stop

codons)

19
New cards

Tools for ab initio discovery

GLIMMER, GeneMarkS, Prodigal, ORF finder

20
New cards
<p>What site is this?</p>

What site is this?

GLIMMER

21
New cards
<p>This output is from ___</p>

This output is from ___

GLIMMER

22
New cards
<p>What site is this?</p>

What site is this?

GeneMarkS

23
New cards
<p>This output is from ___</p>

This output is from ___

GeneMarkS

24
New cards
<p>Why do split reads exist in eukaryotic RNASeq data?</p>

Why do split reads exist in eukaryotic RNASeq data?

Intron splicing

25
New cards

Average vertebrate gene is ___ kb long

30

26
New cards

Average vertebrate coding sequence is approx. ___ kb long

1

27
New cards

Average vertebrate coding region consists of ___ exons of about ___ bp each

6, 200

28
New cards

Exon boundaries can be defined through ___

RNASeq

29
New cards

FOr highest accuracy, one should use ___ data rather than screening public databases

Experimental

30
New cards

Databases for determining gene function

UniProt, RefSeq, Pfam/TIGERFAM, user-provided set of annotated proteins

31
New cards
<p>What website is this?</p>

What website is this?

UniProt

<p>UniProt</p>
32
New cards
<p>This output is from ___</p>

This output is from ___

UniProt

<p>UniProt</p>
33
New cards

True or false, UniProt provides hits to specific proteins?

True

34
New cards

Pfam is used to identify the ___ your protein belongs to

Family

35
New cards

Superfamily vs subfamily

Superfamily: large group of distantly related proteins

Subfamily: small group of closely related proteins

36
New cards

Building Pfam families

Seed alignment used to build a profile HMM

Profile HMM is searched against sequence databases

All matches scoring equal to or grater than a given threshold are considered as true members of the protein family

These members are added to the seed alignment to generate the full alignment and from there a consensus HMM and a consensus sequence

37
New cards

HMMs

Hidden Markov Models; used in many BI applications (gene/protein prediction, phylogenetic analysis, alignments)

38
New cards

HMMs are systems that move from state to state with ___ probabilities. Each state produces a new possible outcome.

Finite

39
New cards

Issue with HMMs

States that are responsible for a possible outcome are unobservable; only final outcome is observable

40
New cards

Two probabilities associated with HMMs

Transition and emissions

41
New cards
<p>Transition probability</p>

Transition probability

Probability of transitioning from one state (e.g. AA) to another

42
New cards
<p>Emissions probability</p>

Emissions probability

Probability that a given AA exists at that position in the alignment

43
New cards

Comparing highly divergent protein sequences is best achieved through ___

HMMs

44
New cards

BLAST is based on ___ comparisons

Pairwise

45
New cards

Seq2EC

Takes an unknown protein and can predict the EC that would match the protein

46
New cards

BRENDA

Takes EC number and gives names of the protein that is usually assigned

47
New cards
<p>This website is ___</p>

This website is ___

StructRNA finder

48
New cards
<p>This output is from ___</p>

This output is from ___

StructRNA finder

49
New cards
<p>This website is ___</p>

This website is ___

InterPro scan

50
New cards
<p>This output is from ___</p>

This output is from ___

InterPro scan

51
New cards
<p>This website is ___</p>

This website is ___

BlastKOALA

52
New cards

BlastKOALA provides information about ___

KEGG orthologs (KOs)

53
New cards

InterPro scan provides information about ___

Gene ontology (GO)

54
New cards
<p>This output is from ___</p>

This output is from ___

BlastKOALA

<p>BlastKOALA</p>

Explore top notes

note
Quadratic sequences
Updated 1158d ago
0.0(0)
note
AP Bio: Unit 2 Chemistry of Life
Updated 690d ago
0.0(0)
note
Chemistry Honors: Final Review
Updated 299d ago
0.0(0)
note
War of the worlds plot summary
Updated 33d ago
0.0(0)
note
French Unit 3 Study Guide
Updated 1236d ago
0.0(0)
note
Hormones and behavior
Updated 1355d ago
0.0(0)
note
Electricity
Updated 1232d ago
0.0(0)
note
Quadratic sequences
Updated 1158d ago
0.0(0)
note
AP Bio: Unit 2 Chemistry of Life
Updated 690d ago
0.0(0)
note
Chemistry Honors: Final Review
Updated 299d ago
0.0(0)
note
War of the worlds plot summary
Updated 33d ago
0.0(0)
note
French Unit 3 Study Guide
Updated 1236d ago
0.0(0)
note
Hormones and behavior
Updated 1355d ago
0.0(0)
note
Electricity
Updated 1232d ago
0.0(0)