L18 Bioinformatics I - multiple sequence alignment

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/26

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

27 Terms

1
New cards

what is a sequence alignment

same set of sequences with zero or more gaps inserted into them, such that

  • all sequences have the same length

  • there is no alignment position where every sequence contains a gap

2
New cards

what is a pairwise alignment

find the optimal alignment for any pair of biological sequences

3
New cards

what is a global alignment

aligns whole sequences end to end

4
New cards

give an example of a global alignment

needle

  • implementation of Needleman-Wunsch alignment

5
New cards

what is local alignment

focusses on the best matching part of sequences

6
New cards

give an example of local alignment

water

  • implementation of Smith-Waterman alignment

7
New cards

when does pairwise alignment not scale

when comparing many sequences

8
New cards

what is the equation to find the comparisons in a multiple alignment

  • 3 sequences = 6 comparisons

  • 8 sequences = 28 comparisons

<ul><li><p>3 sequences = 6 comparisons</p></li></ul><ul><li><p>8 sequences = 28 comparisons</p></li></ul><p></p>
9
New cards

what is a multiple sequence alignment viewer

JalView

10
New cards

constructing alignments by eye does not scale, though…

there are tools that enable this (JalView, Seaview, CINEMA)

11
New cards

what is an algorithm

a process or set of rules followed to solve a problem

12
New cards

what approaches are there to solve multiple sequence alignment

heuristic approaches

13
New cards

whats a heuristic approach

a pragmatic approach that is not optimal, but is good enough

14
New cards

what are the 2 main classes of multiple sequence alignment algorithms

progressive and iterative

15
New cards

when was clustal multiple sequence alignment

old method, lots of versions (V 1992, W 1994, X 1997, Omega 2011)

16
New cards

when was T-coffee multiple sequence alignment

released in 2000

  • progressive alignment followed by optimisation

17
New cards

when was MAFFT multiple sequence alignment

released in 2002

  • progressive alignment with iterative refinement

18
New cards

when was Muscle multiple sequence alignment

released in 2004, draft alignment, improved alignment, refinement

19
New cards

what are the 3 steps of Clustal algorithm of multiple sequence alignment

  • compare sequences to obtain a similarity matrix

  • make a guide tree that relates all the sequences

  • perform progressive alignments, adding new sequences according to the guide tree

20
New cards

how do we compare sequences to obtain a similarity matrix in Clustal multiple sequence alignment

  • long vector alignment

  • clustering using a standard algorithm

  • genetic distances between each pair of sequences are computed- the number of mismatched positions in the MSA divided by the total number of matched positions

21
New cards

how do we compute genetic distances

the number of mismatched positions in the MSA divided by the total number of matched positions

22
New cards

how do we use a similarity matrix to make a guide tree that relates all the sequences in Clustal multiple sequence alignment

  • the genetic distances from the alignments are used to form a phylogenetic tree for the sequences

  • the tree is termed a ‘guide tree’ and it is used to control the order in which sequences are added progressively to the multiple alignment

  • the relative contributions of sequences to the alignment are weighted according to the evolutionary positions of the sequences in the tree

23
New cards

how do we perform progressive alignments, adding new sequences according to the guide tree

  • sequences are aligned progressively

  • most closely related pair or pairs are aligned first

  • the next closely related sequences are then added by aligning them with the existing alignment

24
New cards

describe how Clustal alignments, a progressive alignment strategy, would be performed

  • 4 with 5 → alignment 4-5

  • 1 with 2 → alignment 1-2

  • alignment 4-5 with 3 → alignment 3-4-5

  • alignment 3-4-5 with alignment 1-2 → alignment 1-2-3-4-5

25
New cards

what does MAFFT mean

multiple alignment using fast fourier transform

26
New cards

what is MAFFT

a flexible alignment method that constructs a progressive alignment that improves iteratively → i.e. a hybrid between the two main approaches

27
New cards

how does MAFFT add new protein sequences to the alignment

uses a substitution matrix