Biostats

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/164

There's no tags or description

Looks like no tags are added yet.

Last updated 3:43 PM on 3/20/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

165 Terms

New cards

Multiple Sequence Alignment

New cards

Multiple Sequence Alignment

combines both optimal (global/local) and heuristic alignment; cannot compare DNA to protein due to different scoring matrices

New cards

Profile Alignment

profile is created by taking a finished alignment and counting the frequency of every letter and gap and each location; progressively aligns all sequences pairwise, starting with the most similar

New cards

ClutalW

cluster alignment weighted; progressive alignment strategy; neighbor joining guide tree; lower accuracy; medium speed; use for small datasets with similar sequences

New cards

T-Coffee

tree-based consistency objective function for alignment evaluation; consistency-based alignment strategy; neighbor-joining and consistency weights guide tree; medium accuracy; lower speed; use for small datasets

New cards

MUSCLE

multiple sequence comparison by log-expectation; iterative progress and refinement alignment strategy; UPGMA guide tree; higher accuracy; higher speed; use for medium-large datasets

New cards

MAFFT

multiple alignment using fast fourier transform; progressive and iterative refinement alignment strategy; UPGMA/NJ guide tree; highest accuracy; highest speed; use for large datasets

New cards

Molecular Evolution

New cards

Mutation Types

New cards

Single Base Substitutions

AKA point mutations; a single base is replaced by another

New cards

Transition

same class of nucleotide; purine to purine or pyrimidine to pyrimidine

New cards

Transversion

different class of nucleotide; purine to pyrimidine or pyrimidine to purine

New cards

Synonymous

encodes for the same amino acid

New cards

Silent Mutation

the new nucleotide alters the codon but does not alter the amino acid for which it encodes

New cards

Nonsynonymous

encodes for a different amino acid

New cards

Missense Mutation

the new nucleotide alters the codon to produce an altered amino acid in the protein product (ex

New cards

Nonsense Mutation

the new nucleotide changes a codon that specified an amino acid to a stop codon; translation of the mRNA transcribed from this mutant gene will stop prematurely

New cards

Indels

the addition or subtraction of extra base pairs; creates a change in the reading frame

New cards

Frameshift

change in the reading frame

New cards

Genome Rearrangements

large scale chromosome structure changes; can alter phenotype by 1) destroying gene function, 2) change in expression via influence of different promoters and enhancers, or 3) creating hybrid genes

New cards

Deletion and Duplication

occurs on the same chromosome

New cards

Inversion (Reversal)

occurs on the same chromosome

New cards

Translocation

occurs between different chromosomes; usually between paternal and maternal

New cards

Homolog

a gene related to other genes by evolutionary descent from a common ancestral DNA sequence

New cards

Identity

New cards

((number of identical residues))/((number of residues and gaps in th? alignment)) x 100

New cards

Similarity

some amino acid substitutions have similar side chains, leading to a smaller effect in the final protein

New cards

((number of similar residues))/((number of residues and gaps in th? alignment) ) x 100

New cards

Point Accepted Mutation (PAM)

quantifies the rate at which amino acids change over evolutionary time; assumes constant rate of change for amino acids

New cards

Constant Rate

mutations occur at a relatively steady pace over time

New cards

Independence

each amino acid position mutates independently of its neighbor

New cards

Natural Selection

only count "accepted" mutations that don't break down the protein's function and are passed down

New cards

Matrices

PAM matrices are a series, as the number increases the evolutionary distance grows

New cards

PAM #

of mutations per 100 amino acids

New cards

PAM 1

very conserved; observable mutation; small-scale evolution

New cards

PAM 250

same amino acid mutation repeatedly; not observable but extrapolated; has error associated with it; large-scale evolution

New cards

Block Substitution Matrices (BLOSUM)

based on observed alignments; aligned sequences from functional domains (blocks) of proteins; look at domains (blocks) rather than looking at entire sequence

New cards

Blocks

represents highly conserved regions that have survived natural selection

New cards

Matrices

BLOSUM matrices represent the minimum percentage identity of the sequences used to build it

New cards

Lower #

distant relatives; BLOSUM45 used for very divergent sequences

New cards

Higher #

close relatives; BLOSUM80 used for very similar sequences

New cards

Similarity Score

not all amino acid matches produce the same similarity score; add all numbers for individual score, the higher the better

New cards

Ortholog

a gene present in different species that evolved from a common ancestral gene by speciation; retain the same/similar function in the course of evolution; speciation to give two separate species

New cards

Paralog

one gene of a set of genes that underwent a duplication event in a common ancestor; evolve new functions (can be related to the original function); gene duplication and divergence

New cards

Phylogenetic Trees

New cards

Phylogenetics

method of classification of organisms based upon their evolutionary history

New cards

Phylogenetic Tree

shows the evolutionary relationships among various species or other entities that likely have a common ancestor; multiple trees possible showing multiple plausible evolutionary scenarios

New cards

Gene-Specific Phylogenies

different genes may show different phylogenetic histories; can avoid this by using multiple genes and many single-gene analyses then concatenating them

New cards

Neutral Marker

genes under similar positive selection regimes in different taxa can result in convergent evolution; can make confusing phylogenetic analysis

New cards

Connected Graph

graph containing at least one path between any two nodes

New cards

Tree

type of connected graph in which there is exactly one path between every two nodes

New cards

Rooted Tree

shows evolutionary history of the taxa; single unique node which is the ancestor of all other nodes; directed tree which shows change over time; best done by using an outgroup

New cards

Outgroup

a species or molecule that is known to be more distantly related than everything else in the tree

New cards

Ingroup

taxa being analyzed to view relationships

New cards

Unrooted Tree

shows evolutionary relationships between the taxa; can't make any statement about the direction of evolution, only the closeness of relationships

New cards

Nodes

common ancestor; rotating a tree at a node does not change the relationships between the taxa, only the way those relationships are visualized; each node called an operational taxonomic unit

New cards

Branches

evolutionary lineages

New cards

Tips/Leaves

the most recent taxa in the analysis

New cards

Cladogram

branch lengths do not represent time; branching is determined by distinguishing characteristics which identify a particular clade

New cards

Phylogram

explicitly represents number of character changes through its branch lengths; indicates the amount of evolutionary time separating taxa

New cards

Distance-Based Methods

calculate the genetic distance between pairs of taxa and construct a tree based on these distances

New cards

Unweighted Pair Group Method with Arithmetic Mean (UPGMA)

determination of phylogenetic relationships are explicitly non-historical; simply based on similarity/dissimilarity; assumes an ultrametric tree in which the distances from the root to every branch tip are equal

New cards

Steps

New cards

(1) create tree by first selecting the most closely related sequences and insert a node to represent their common ancestor

New cards

(2) then replace the selected sequences by a set containing both and replace the distances from the pair to the others by the average distances

New cards

(3) repeat

New cards

Neighbor-Joining

clustering creates an additive unrooted tree using pairwise distances; all the taxa do not diverge from a most common ancestor; does not assume that all sequences have the same rate of substitution; fast and often used as a starting point in phylogenetic analyses

New cards

Steps

New cards

(1) determine the pairwise distances between all the sequences

New cards

(2) identify the two sequences closest to each other based on their distances

New cards

(3) combine these two sequences into a single node

New cards

(4) update the distances between this new node and the other sequences

New cards

(5) repeat until all sequences are joined into a single tree

New cards

Strengths

New cards

Weaknesses

New cards

Cladistic Methods

consider the various possible trees and choose the best possible tree; tree selection criteria varies depending on the approach; slower than neighbor joining, but usually more accurate

New cards

Maximum Parsimony

finds the tree that requires the fewest number of evolutionary changes to explain the observed data

New cards

Strengths

New cards

Weaknesses

New cards

Maximum Likelihood

finds the tree that has the highest probability of producing the observed data given a specific model of evolution

New cards

Strengths