Genomics and Evolution Final

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/76

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

77 Terms

1
New cards

Studies of the genome must include what?

  1. The transcriptome and proteome

  2. How the transcriptome and proteome establish and coordinate the network of linked biochemical pathways in a living cell

2
New cards

What is the most direct way to study the transcriptome? What are the cons to this?

convert all the mRNA to cDNA, then sequence every clone in the library

The cons are that it is laborious and time-consuming

3
New cards

How do you generally characterize the transcriptome?

identify all the mRNAs present and quantify their abundances

4
New cards

What are two tests that are used to study the transcriptome?

  1. Serial analysis of gene expression (SAGE)

  2. Microarray or DNA chip analysis

5
New cards

How does SAGE address the issue of the labor intensity of characterizing the transcriptome?

Instead of studying the whole cDNA strands, SAGE uses 12bp sequences of the cDNA for identification of the genes coding at that moment

6
New cards

What are the steps for SAGE?

  1. mRNA is immobilized in a chromatography column by annealing the poly(A) tails to oligo (dT) strands that are attached to cellulose beads

  2. mRNA is converted to cDNA and cut is a restriction enzyme (Alu 1) that cut frequently in each cDNA

  3. the terminal restriction fragment remains attached to the bead while the other fragments are eluted

  4. An oligo with a BsmFI recongition sequence is ligated to the free end of each cDNA

  5. BsmFI will cut 10-14 bp downstream of the recognition sequence

  6. The fragments cut off the bead are then collected, ligated, and sequenced. The tagged sequences are compared with the sequences of genes in the genome

7
New cards

What is the difference between a microarray and a DNA chip analysis?

A DNA chip carries an array of immobilized oligos on a wafer or silicon

A microarray us made up of DNA molecules (PCR products) on a glass slide or nylon membrane

8
New cards

What are the two objectives of studying the transcriptome and how does a microarray accomplish them?

  1. Identify the genes who mRNA are present ← all relevant genes must be represented by at least one probe on the array

  2. Determine the relative amounts of various mRNA ← Each position in the microarray contains up to 10^9 copies of the probe molecule

9
New cards

Why are both the 5’ and 3’ end of mRNA on a microarray?

It is to look at the 5’/3’ ratio. If the ratio is significantly different from one than there is likely something wrong with the RNA (or the gene is truncated)

10
New cards

What are the steps of a microarray?

1. RNA Isolation

2. RNA quantitation

3. cDNA synthesis

4. Labelling (kind of included in the cDNA synthesis)

5. Hybridization

6. Washing unhybridized material

7. Scanning (where we obtain the data)

8. Data normalization

11
New cards

What are the two limitations of microarrays and chip analysis?

  1. Insufficient specificity between some of the mRNA can lead to cross-hybridization (this can happen with paralogous genes)

  2. Two or more mRNA may be derived from the same gene via alternative splicing

12
New cards

What happens when saturation happens in a microarray and how do you prevent it?

When saturation happens, it leads to underestimation of genes being expressed. This can be prevented by ensuring there are enough probe spaces on a chip

13
New cards

What is used as a positive control for microarrays?

the actin gene: it will never change expression with changing experimental conditions

14
New cards

When comparing transcriptomes, what are some biases that could look like differences in levels of mRNA but are not? Then how do you prevent these biases?

  1. The amount of target DNA

  2. the efficiency in which the probe has been labelled

  3. the effectiveness of hybridization

These can be avoided with

  1. Using controls like the actin gene

  2. scanning arrays at the same wavelength

  3. Data normalization: reduce background signal

15
New cards

What is a dendrogram?

Any sort of tree diagram

16
New cards

What study system is ideal for researching the transcriptome?

Yeast is an ideal study system, specifically looking at genes for sporulation

17
New cards

What has been studied in the human transcriptome?

  1. transcriptome for different cell types have been mapped onto the genome

  2. Above has provided global view of patterns of gene expression for whole chromosomes

  3. Led to discovery of transcripts from regions of genomes where no genes are known to exist

18
New cards

What was the benefit of the transcriptome analysis of human chromosomes 21 and 22?

10,000 transcribed sequences have been identified from regions not previously thought to contain genes, signifying the role of transcriptome analysis in genome annotation

19
New cards

What are the advantages to studying the proteome?

  1. Proteome plays a central role as the link between genome and biochemical pathways of the cell

  2. Proteome shows how the genome operates and how its dysfunction can lead to diseases

  3. Some proteins undergo physical and/or chemical modifications before becoming functional

20
New cards

What is the study of the proteome called?

proteomics

21
New cards

What are the two methods of identifying proteins in the proteome? (protein profiling)

  1. protein electrophoresis

  2. mass spectrometry (MALDI-TOF)

22
New cards

What kind of gel is used for identifying proteins in the proteome?

Polyacrylamide gel

23
New cards

What chemical is used to separate proteins by their molecular mass and how?

(SDS) Sodium dodecyl sulfate: This denatures the proteins and confers a negative charge about equivalent to the length of the unfolded peptide

24
New cards

What is isoelectric focusing?

These separate proteins using the function between charge and pH until proteins hit the isolelectric point

25
New cards

What is the isoelectric point?

The position of a protein in a gradient where its net charge is zero

26
New cards

How does two-dimensional gel electrophoresis work?

  1. The protein sample is loaded to the top left corner of the gel

  2. First electrophoresis is done to separate proteins by isoelectric focusing

  3. the gel is spun 90 degrees so line of proteins are on top and Na dodecyl sulfate (SDS) is added

  4. A second electrophoresis is ran so proteins are then separated by molecular mass

  5. Protein spots are visualized by staining with a silver solution

27
New cards

What are the steps for matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF)

  1. protein of interest in purified and digested with a sequence-specific protease like trypsin

  2. following ionization, the mass-to-charge ratio of a peptide is deduced from its time-of-flight within the mass spectrometer as it passes the ionization source to the detector

  3. mass-to-charge ratio can be used to calculate molecular mass, allowing the amino acid composition of the peptide to be deduced

  4. The resulting compositional sequence is related to the genome sequence in order to identify the gene that specifies for that protein

28
New cards

How does trypsin work?

Trypsin cuts following arginine or lysine residues to produce peptides 5-75 amino acids in length

29
New cards

How does mass spectrometry work for MALDI-TOV?

Peptides are ionized by a pulse of energy from a laster and then accelerated down the column to the reflector and onto a detector

Time-of-flight is related to its mass-to-charge ratio

The computer contains a database of the predicted molecular masses of every trypsin fragment of every protein encoded by the genome of the organism under study

Computer compares mass of detected peptide with database to identify the most likely source protein

30
New cards

What is the point of isotope coded affinity tag (ICAT) in proteomics?

It is used to analyze two proteomes: Each proteome is labelled with a different fluorescent marker before being put through MALDI-TOF

31
New cards

What is the chemical label commonly used in ICAT?

deuterium

32
New cards

What are the two common methods for identifying proteins that interact with one another?

  1. Phage display

  2. Yeast-two hybrid studies

33
New cards

What is phage display?

  1. Phage display uses bacteriophage lambda or M13

  2. Gene is cloned in so that is fuses with one of the phage coat proteins (via a unique restriction enzyme site located within a gene for a coat protein)

34
New cards

Disadvantages to phage-display?

  1. Time-consuming

  2. Requires prior info on possible interactions

35
New cards

What is a phage display library? and how does it work for testing protein interactions?

A collection of clones displaying a range of proteins

  1. The test protein is immobilized within the well of a microtiter plate

  2. Phage display library is added to plate

  3. plate is washed and all proteins that interact with test protein will remain in the plate

36
New cards

How does yeast two-hybrid screening (Y2H) work?

A gene of a human protein is ligated to the gene for a DNA-binding domain of a yeast activator, creating a construct that is part human and part yeast activator

A variety of fusion proteins for the actual yeast activation domain are created using different human DNA fragments

These two constructs are mixed into the yeast for screening

If there is protein-protein interaction, then RNA polymerase will be activated to express the reporter gene

37
New cards

How do you identify the components of a multiprotein complex?

With affinity chromatography

38
New cards

How does standard affinity chromatography work?

  1. Test protein is attached to the chromatography resin and placed in a column

  2. Cell extract is placed in a low-salt buffer, which causes formation of hydrogen bonds that hold proteins together in a complex

  3. Proteins that interact with the test protein are retained in the column

  4. Interacting proteins are then eluted with a high-salt buffer

39
New cards

What are two disadvantages to affinity chromatography? (one minor, one major)

The need to purify the test protein is time-consuming and difficult to use in large screening programs

Major: a single member of the protein complex is used as bait, so a member of the complex may not be isolated if it does not interact with the bait

40
New cards

How does Tandem-affinity Purification (TAP) work?

Similar to affinity chromatography

test protein is modified so that c-terminal extension binds to calmodulin

The resin in the column contains the modified test proteins

Modified test proteins trap protein complex

41
New cards

How are proteins identified in affinity chromatography and TAP?

Through mass spectrometry

42
New cards

What is a metabolome?

The complete collection of metabolites present in a cell or tissue under a particular set of conditions

43
New cards

What is metabolomics? (aka biochemical profiling)

Gives a precise description of the biochemistry underlying different physiological states (Ex. disease states)

44
New cards

What is a phylogeny?

A classification scheme indicating similarities among species and their evolutionary relationship

45
New cards

How are evolutionary relationships among a group of organisms illustrated?

With a phylogenetic tree

46
New cards

What are the main two portions of a phylogenetic tree?

nodes and branches

47
New cards

What are internal versus external nodes?

internal nodes represent taxonomic units such as species or genes

External nodes (at the end of branches) represent living organisms

48
New cards

What are two examples of what branch length can represent in a phylogenetic tree?

  1. It can represent elapsed time

  2. It can represent the number of molecular changes (aka mutations)

49
New cards

What are operational taxnomic units (OTUs)?

The extant (aka not extinct) taxonomic units under comparison

50
New cards

What are hypothetical taxonomic units (HTUs)?

Extant species whose relations are not based on empirical data

51
New cards

What is the difference between a rooted and unrooted tree?

Rooted trees have a root (aka most recent common ancestor) of all taxonomic units under study and each path corresponds to evolutionary time

An unrooted tree simply specifies the degree of kinship among taxonomic units

52
New cards

What is the different between a scaled and unscaled tree?

A scaled tree uses branch length proportionally to degree of relation, while an unscaled degree does not

53
New cards

What is the topology of the phylogenetic tree?

This refers to the branching pattern of a phylogenetic tree

54
New cards

Does a DNA or protein tree yield more phylogenetic information and why?

DNA yields more phylogenetic information than proteins

With the redundancy of codons, different DNA segments lead to the same amino acids, so for example. while three organisms may have the same amino acid sequences (which would show they are equally related on a tree), two may have the same DNA sequence, while one has a different DNA sequence, showing that two organisms are more related than the third one

55
New cards

What is an outgroup?

An OTU which we have external knowledge the clearly show them to have diverged from the common ancestor prior to all the other OTUs (ingroup taxa)

56
New cards

What are the differences between monophyletic, clade, paraphyletic, and polyphyletic? Can you show it in a diagram?

Monophyletic - two or more organisms or DNA sequences that are derived from a single ancestor

Clade - group of monophyletic organisms

Paraphyletic - a group of sequences or taxa that exclude some members of a clade

Polyphyletic - a group of DNA sequences derived from two or more distinct ancestral sequences

57
New cards

What is a gene tree?

Gene tree is constructed from comparting the sequences of orthologous genes and used to make inferences about the evolutionary history of the species from which the genes were obtained

58
New cards

Is a gene tree the same as a species tree?

While a gene tree is a more accurate reflection of a species tree than a tree based on morphological data, it does not mean that a gene tree is always the same as a species tree

59
New cards

What is a molecular clock?

It is used to date the time at which a gene divergence took place

With mutation not occurring simultaneously with speciation events, the molecular clock cannot give an accurate time of speciation

60
New cards

What is random genetic drift?

The change in allele frequency in a population

61
New cards

What are the two types of molecular phylogenetic data?

Character data and distance data

62
New cards

What is character data?

A well-defined feature that in a taxonomic unit can assume one out of two or more mutually exclusive character states

63
New cards

What is a character state?

The value or the character in a particular OTU

64
New cards

What is the different between a quantitative or qualitative character?

quantitative characters show a normal distribution and are continuous

qualitative characters are discrete, meaning they are either one thing or another and nothing in between

65
New cards

HYPOTHETICAL: We are looking at the length of middle fingers between male and female gorillas. We find that female fingers are 8 inches while male fingers are 9 inches. Can you tell me what is the character, character states, and OTU?

Character: finger length

Character state: 8 and 9 inches

OTU: gorillas

66
New cards

What is distance data?

A quantitative statement concerning the dissimilarity between two OTUs

67
New cards

What is the degree of divergence?

It is the proportion of differences between two organisms or DNA sequences. It is calculated with the number of nucleotide differences (n) divided by the sequence length (N), multiplied by 100

68
New cards
<p>Looking at a sequence from Organism 1 and Organism 2, can you calculate the degree of divergence between them? What can you conclude from the degree of divergence?</p>

Looking at a sequence from Organism 1 and Organism 2, can you calculate the degree of divergence between them? What can you conclude from the degree of divergence?

n= 3 N= 19: (3/19)*100 = 15.79%

The degree of divergence is pretty low, indicating that they are closely related

69
New cards

What are the two major types of phylogenetics trees?

A cladogram and a phenogram

70
New cards

What is the difference between cladistics and phenetics?

Cladistics: a tree that expresses ancestor-descendent relationship (rooted)

Phenetics: a tree that establishes the relationship among a group of organisms on the basis of the degree of similarity (unrooted)

71
New cards

What are four methods of tree reconstruction?

maximum parsimony

UPGMA

transformed distance method

neighbor-joining method

72
New cards

Unweighted pair-group method with arithmetic means (UPGMA)

UPGMA employs a sequential clustering algorithm, in which local topological relationships are identified in order of decreasing similarity, and the phylogenetic tree is built in a step-wise manner

Simple terms, Identify two OTUs that are the most similar. Then from that pair (aka composite OTU) the next most similar OTU is paired up with the composite, and repeat

73
New cards

How do you calculate the distance matrix between the composite OTU of A and C (d(AC)) and D? AKA how do you calculate d(AC)D? BONUS How do you calculate d(ACD)B?

d(AC)D= (dAD+dCD)/2

BONUS

d(ACD)B=(dAB+dCB+dDB)/3

74
New cards

What is an assumption you are making when calculating distance matrix?

That multiple substitutions did not occur

75
New cards

Steps of the neighbor-joining method

Start with a star- pattern tree where all OTUs stem from same ancestor

Pick a pair at random

Calculate branch length for this new pair

Repeat with all possible pair combinations

Final tree will have the shortest total branch length

76
New cards

What is the maximum parsimony method

The identification of tree topology that requires the smallest number of evolutionary changes to explain the observed differences among the OTUs under study

77
New cards

What is the bootstrap method

A computational technique used as a means to estimate the confidence level of phylogenetic hypotheses using a null hypothesis and a series of pseudosamples