Bioinformatices Midterm

0.0(0)
Studied by 0 people
call kaiCall Kai
Locked
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/3

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 7:46 PM on 5/26/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai
Chat

No analytics yet

Send a link to your students to track their progress

4 Terms

1
New cards

Why DNA sequencing?

compare similarities between organisms to see conservation to see which sequences are the more important.

Make phylogentic trees.

Pretty much everything in bioinformatics

Find mutations such as SNPs

Alpha fold

2
New cards

BLAST steps

  1. remove low-complexity regions

  2. make a dictionary of all words that are 3 amino acids long or 11 necleotides long.

  3. augment list / dictionary to include similar words (this will include sequences that are similar but not identitcal)

  4. scan database for occurrences of words

  5. connect nearby occurrences

  6. extend matches in both directions to see how far they align

  7. prune the list of matches using a score threshold. This gets rid of sequences that matched just due to chance.

  8. evaluate significance of each remainging match. This is done by looking at the E value and further removing sequences that matched just due to chance.

  9. perform smith-waterman to get alingment. With the sequences left from the above steps we can run a dynamic programming (smith-waterman) to align them.

3
New cards

GWAS

Comparing SNPs in people with a disease verses people without a disease. You need to have at least 1000 people in both the control and test group to do this. You use SNP chips to compare many SNPs at once not just one.

4
New cards

Alpha Fold

  1. input amino acid sequence

  2. runs template search, MSA, and gets a general idea of the protien shape.

  3. based on the things in the above steps a machine learning algotithm learns relationships between the amino acids.

  4. Models the protien structure

  5. Evaulate peformance with legend and dot plot and Ramachandrin plot (shows areas where we expect amino acids to be)

  6. Structure helps you infer function and visa virsa