UNIT TWO

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/39

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 2:57 PM on 4/13/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

40 Terms

1
New cards

what are two problems that standard BLAST cannot solve?

  • it cannot find homologs that are too distantly related

  • it’s not built for (too detailed for) large queries (10,000 bp)

2
New cards

what does PSI BLAST stand for?

position specific iterated BLAST

3
New cards

what is the purpose of PSI BLAST?

used to detect weak but biologically meaningful relationships between proteins

4
New cards

What are the steps of PSI BLAST?

  1. select a query and search it against a protein database

  2. PSI-BLAST constructs a multiple sequence alignment then created a specialized position specific scoring matrix (PSSM)

  3. the PSSM is used as a query against the database

  4. PSI-BLAST estimates the statistical significance

  5. repeat step 3 and 4 iteratively, typically five times. At each new search, a new profile is used as the query.

5
New cards

What does PSSM stand for?

position-specific scoring matrix

6
New cards

what are the steps to calculate the PSSM?

  1. create a multiple sequence alignment

  2. calculate raw frequencies - at each position the computer calculates how many times each of the 20 amino acids appear at each position

  3. calculate overall frequency - divide the specific frequency by the overall frequency

  4. log value of frequencies - find the log base 2 of all the values

7
New cards

what are and how to interpret the results of PSI-BLAST?

  • iteration - the number of BLAST you are on

  • number of hits

  • number of hits > threshold - how many hits were significant

8
New cards

what does it mean to approach convergence?

as you move down the iterations, the rate of growth slows down, till no more new sequences are found

9
New cards

What is corruption?

presence of at least one false positive alignment with an e value < 10^-4 after 5 iterations

10
New cards

what are approaches to stopping corruption?

  1. apply filtering of biased composition regions

  2. adjust E value from 0.001 to a lower E value such as E = 0.0001

  3. visually inspect the output from each iteration. remove suspicious hits by unchecking the box.

11
New cards

What is the result of one false positive?

one false positive can be amplified to many and it becomes permanent due to the nature of the math for the PSSM.

12
New cards

why proteomics?

-proteins are the functional (almost every diseases is the result of a protein failing not the gene)

-transcriptome information is only loosely related to protein levels

13
New cards

what does ELISA stand for?

enzyme linked immunosorbent assays

14
New cards

what is ELISA used to do?

used to detect and measure specific proteins or antibodies in a liquid sample.

15
New cards

steps of ELISA

1 - antibody is added; only binds to target protein

2 - secondary antibody is added; it looks for first antibody

3 - substrate is added

4 - enzyme catalyzes; produces color; the more color the more protein

16
New cards

steps of single protein analysis

1 - gel based separation: each black dot is a different protein

2 - spot excision: cut out one specific dot; same dot, same protein

3 - digestion: add on enzyme to chop up protein into smaller pieces called peptides

4 - MS analysis

17
New cards

what does protein sequence analysis allow for?

protein classifcation

18
New cards

What is the primary purpose of analyzing protein sequences using computational (in silico) methods?

characterize protein structures in silico and allows the prediction of protein structure and function

19
New cards

If a BLAST search shows high sequence homology between two proteins, what can you safely assume about their similarities?

may not have the same function but most always has the same structural fold

20
New cards

steps of shotgun analysis

1 - digestion of protein mixture

2 - liquid chromatography

3 - MS analysis

21
New cards

what are protein sequence databases?

-atlas of protein sequence and structure (currently known as Protein Information Resource (PIR)

-protein data bank

-UniProt

22
New cards

What can patterns of conservation in protein sequences tell us about specific amino acid residues?

They help determine which residues are under selective constraints, meaning those residues are critical for the protein's function

23
New cards

What is the defining characteristic of homologous proteins?

They share a common ancestor.

24
New cards

Different proteins evolve at ____ depending on their functional importance and structural requirements.

different

25
New cards

protein analysis is ____ sensitive than DNA analysis

more

26
New cards

How is the amino acid sequence of a protein typically generated for comparison?

It is generated from proteomics experiments

27
New cards

How is the % similarity between two protein sequences determined?

By aligning the two sequences and counting the number of identical residues or using an index of similarity (like a substitution matrix)

28
New cards

What is the primary method used to compare the 3D shapes of different proteins?

Superimposition (or structural overlay). This involves physically rotating and shifting the 3D models to see how well their "skeletons" match up.

29
New cards

pairwise alignment

30
New cards

multiple sequence alignment

31
New cards
32
New cards
33
New cards
34
New cards
35
New cards
36
New cards
37
New cards
38
New cards
39
New cards
40
New cards