Overlapping sequences that can be assembled to make larger pieces
2
New cards
Consensus sequence
A sequence that has pieces from multiple sequences
3
New cards
WGS sequence
Traditional or next generation, method for constructing the human genome
4
New cards
Genomic libriaries
Collections of randomly sampled DNA fragments that are cloned into plasmids. Used in traditional WGS sequencing
5
New cards
Steps for next generation sequencing
* Adaptor sequences attached to small DNA fragments (reads) * Amplify fragments via flow cell * Sequencing by DNA synthesis (uses DNA polymerase) with dyes
6
New cards
Illumina sequencing
2nd generation sequencing that produces lots of data, assembly of the genome is challenging
7
New cards
PacBio
3rd generation sequencing that produces few, long reads and is easier to assemble
8
New cards
Repetitive elements
Short, repetitive tracks for many many base pairs. Primary challenge to sequence assembly
9
New cards
Genome annotation
identification of all the functional elements of a particular genome
10
New cards
Open reading frames
DNA segment with start and stop codons
11
New cards
Complementary DNA
cDNA, derived from mRNA. Needed for annotation because we can’t sequence RNA
12
New cards
Proteome
Inventory of the proteins a genome encodes
13
New cards
transposable elements
repetitive DNA sequences that can copy, cut, and paste themselves around the genome
14
New cards
Pseudogenes
Genes that have mutated and no longer function
15
New cards
Phylogeny
the evolutionary history of a group depicted by a tree, can be inferred from genomic data
16
New cards
Homologs
Any genes that share ancestral history
17
New cards
Orthologs
Genes found in different species that are related due to originating from an ancestral species
18
New cards
Paralogs
Multiple genes in 1 species that are related due to a gene duplication event in the past
19
New cards
Synteny
Regions of different species’ genomes that are arranged in similar blocks due to common ancestry. Useful for studying model organisms
20
New cards
Barbara McClintock
Discovered transposable elements in Maize
21
New cards
Nonautonomous transposable element
Ds (disassociation) locus, can move around the genome, leading to different phenotypes
22
New cards
Activator sequence
Ac locus, encodes Transposase
23
New cards
IS elements
can move around but can't take other genes with them
24
New cards
Transposons
Can move around and take other genes with them
25
New cards
transposase
enzyme required for movement
26
New cards
Composite transposons
Can carry other genes around with them. Has 2 IS elements, each with transposase genes
27
New cards
Replicative transposition
Transposons replicate, number of transposons increases
28
New cards
Conservative transposition
Transposons cut and paste, number of transposons does not increase
29
New cards
Retrotransposons
Derived from retroviruses, all have reverse transcriptase
30
New cards
Reverse transcriptase
Enzyme that converts mRNA to cDNA
31
New cards
P elements in drosophila
Similar to bacterial IS elements, ends are short inverted repeats that flank a transposase gene.
Individuals with these elements have silencing factors in their cytoplasm
32
New cards
Hybrid dysgenesis
Production of biologically deficient hybrids from certain crosses
33
New cards
C-value paradox
Genome size varies dramatically among eukaryotes, but this is not explained by complexity
34
New cards
Haplotype
A haploid segment of a chromosome defined by its sequence
35
New cards
Haplotype network
Graphical representation of haplotype relationships
36
New cards
Hardy Weinberg rules
Random mating only, no selection, free mating, no mutations, infinite population size