Bioinformatics + Homology Modelling

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/35

flashcard set

Earn XP

Description and Tags

Things you need to know! • Databanks vs. databases • The difference between a primary and a secondary databank • Examples of primary and secondary databanks • The definition of homology / orthology / paralogy and their importance • The difference between a databank (like SwissProt) and a search tool (like BLAST) • Local and global alignment • Basic ideas of sequence searching and the meaning of the e-value • Problems of annotation

Last updated 9:02 PM on 3/12/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

36 Terms

1
New cards

Define bioinformatics

Aiding the Biologist in creating, storing, searching and analyzing biological data (particularly sequences and structures, but also 'omics data), presenting it in a way Biologists can use and applying the analysis to make predictions

2
New cards

What are we trying to achieve with bioinformatics?

knowt flashcard image
3
New cards

What is a key point both homologous proteins/genes share?

  • Have descended from a common ancestor

  • Homologues have derived from a common ancestor hence they will have similar characteristics

4
New cards

“It is WRONG to refer to ‘percentage homology’”, why?

  • An old term used before the evolutionary angle took over the word homology

  • Percentage sequence identity is used instead to infer homology

5
New cards

What is the criteria for a protein sequence to be always homologues? (and vice versa)

100AA with >35% sequence identity = Homologues and same function still

Lower sequence identity = May be homologues

6
New cards

Which source can you use to compare structures of homologues and determine which type of homologue?

Structure comparison

  • PDB Database

  • GPCRdb

Identifying type

  • NCBI Homologene

  • EBI

7
New cards

What are the two types of homologues?

  • Orthologues

  • Paralogues

8
New cards

What are orthologues?

Homologues resulting from a speciation event

i.e. the same gene in two different species

9
New cards

What are paralogues? Give an example

Result from gene copying within a species

E.g. globin genes (beta and alpha globin)

10
New cards

What is protein modelling, why use it, and why not just rely on software predictions?

Protein modelling = Using instrumentation to work out the actual protein structure, i.e. cryo-EM

You need to validate things experimentally to verify that what you’ve computed is correct (support the computational prediction), and you get the results you expected

11
New cards

What are the two types of storages of data?

Databank & database

12
New cards

Define databank

A collection of data (normally in simple text files) without a fixed associated query tool

  • You can download all the raw data and handle it however you wish

13
New cards

What are the types of databanks?

Primary

  • Raw sequence/structure data

  • Possibly with detailed annotations

Secondary = Curates the primary data

  • Derived data - sequence profiles, etc

  • Generally highly annotated

14
New cards

Give examples of primary databanks, and what they (+may) contain

DNA: Genbank, EMBL-ENA, DDBJ

Protein: UniProtKB/SwissProt

  • Simply contain sequence data (DNA or protein)

  • May also have ‘feature’ information (splice sites, signal sequences, disulphides, active sites, etc., etc.)

  • DNA databanks may also contain translations (known or predicted)

15
New cards

What is the main primary databanks for structural data and enzyme classifications (EC numbers)?

  • PDB

  • Enzymes

16
New cards

Give examples of primary databanks, and what they (+may) contain

PROSITE, PRINTS, BLOCKS, INTERPRO

  • Contain derived information

  • Patterns that characterize a protein family

  • Detailed annotation

NOTE: Can be used for finding homologues

17
New cards

Where would you find annotations of proteins/genes from the source?

  • References

  • Methods

  • Cross-links to other databases

  • ‘Feature tables’

    • Segments of biological significance (e.g. coding regions, active sites)

  • More general descriptions

    • Computer parsable?

    • May use a ‘restricted dictionary’ or ‘ontology’

  • Authors

NOTE: Just because an information isn’t there doesn’t necessarily mean it is not known

18
New cards

How are most sequences predicted and why are gene-prediction methods imperfect?

  • Predicted from genomic data

  • Most of the the time the software only makes a sensible prediction after more biological information is put into the database

“A protein identified from genome data is hypothetical until verified by experiment”

19
New cards

Define database

A structured collection of data hidden behind a fixed search tool

20
New cards

What is protein/peptide sequencing largely replaced by?

DNA sequencing (now automated)

21
New cards

What are the two important things to remember about quality?

  • The quality of raw data is as good as the methods that produce it

  • The quality of annotations is as good as the curators

22
New cards

How were annotations in the pre-genome world different to annotation in the genome world?

PRE

  • DNA sequence came from a single group with particular interest in that gene

  • Annotations grounded in experiment

  • Written by experts in that gene

POST

  • Rarely experimental confirmation of expression of putative genes or characterization of products

  • Annotation based on computer analysis

  • The weakest link of genomics!

  • Getting it right is labour intensive and underfunded

  • Poor annotation negates the benefit of having genome sequence data!

23
New cards
24
New cards
25
New cards
26
New cards
27
New cards
28
New cards
29
New cards
30
New cards
31
New cards
32
New cards
33
New cards
34
New cards
35
New cards
36
New cards

Explore top notes

note
Fluid flow
Updated 507d ago
0.0(0)
note
Properties of Matter!
Updated 1520d ago
0.0(0)
note
Neurobiology Lab Quiz 1 Concepts
Updated 1131d ago
0.0(0)
note
Introduction to Animals
Updated 1171d ago
0.0(0)
note
Week 6 & 7 Readings
Updated 1167d ago
0.0(0)
note
Fluid flow
Updated 507d ago
0.0(0)
note
Properties of Matter!
Updated 1520d ago
0.0(0)
note
Neurobiology Lab Quiz 1 Concepts
Updated 1131d ago
0.0(0)
note
Introduction to Animals
Updated 1171d ago
0.0(0)
note
Week 6 & 7 Readings
Updated 1167d ago
0.0(0)

Explore top flashcards

flashcards
Soph L1 Vocab
300
Updated 704d ago
0.0(0)
flashcards
POE Lesson 3-2 Key Terms
34
Updated 1129d ago
0.0(0)
flashcards
Types of Energy
25
Updated 474d ago
0.0(0)
flashcards
western civ test 1
100
Updated 569d ago
0.0(0)
flashcards
CSCI 2100 Final Exam
40
Updated 341d ago
0.0(0)
flashcards
Chemistry Unit 2
46
Updated 1278d ago
0.0(0)
flashcards
APHUG Unit 6
98
Updated 697d ago
0.0(0)
flashcards
Birds M1TN3
79
Updated 109d ago
0.0(0)
flashcards
Soph L1 Vocab
300
Updated 704d ago
0.0(0)
flashcards
POE Lesson 3-2 Key Terms
34
Updated 1129d ago
0.0(0)
flashcards
Types of Energy
25
Updated 474d ago
0.0(0)
flashcards
western civ test 1
100
Updated 569d ago
0.0(0)
flashcards
CSCI 2100 Final Exam
40
Updated 341d ago
0.0(0)
flashcards
Chemistry Unit 2
46
Updated 1278d ago
0.0(0)
flashcards
APHUG Unit 6
98
Updated 697d ago
0.0(0)
flashcards
Birds M1TN3
79
Updated 109d ago
0.0(0)