bioinformatics final exam broad quizlet

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall with Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/168

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No study sessions yet.

169 Terms

1
New cards
2
New cards
3
New cards
4
New cards
5
New cards
EXPASY
Portal linking many bioinformatics tools, including UniProt and Prosite.
6
New cards
UniProt
Main site for full protein information: sequence, function, domains, variants.
7
New cards
Swiss-Prot
Manually reviewed, high-quality UniProt protein entries.
8
New cards
Prosite
Database of protein domains, families, and functional sites.
9
New cards
PDB
Protein structure database with 3D models from X-ray, NMR, cryo-EM, and AlphaFold uploads.
10
New cards
PDB-101
Educational site explaining how to interpret protein structures.
11
New cards
BLAST
Fast local alignment tool using heuristic seed-and-extend search.
12
New cards
Heuristic search
Fast method using rules and short matches instead of full comparisons.
13
New cards
E-value
Expected number of random matches; lower = more significant.
14
New cards
BLASTN
Compares nucleotide query to nucleotide database.
15
New cards
BLASTP
Compares protein query to protein database.
16
New cards
BLASTX
Translates nucleotide sequence and searches protein databases.
17
New cards
TBLASTN
Searches a protein query against translated nucleotide databases.
18
New cards
Ortholog
Genes in different species from a common ancestor; usually same function.
19
New cards
Paralog
Duplicated genes within the same species; may gain new functions.
20
New cards
Conserved domain
Shared functional region of a protein preserved across evolution.
21
New cards
Local alignment
Matches only the most similar region between sequences.
22
New cards
Compare splice variants (BLASTN)
Run BLASTN, check coding region coordinates, compare exon differences.
23
New cards
Find human ortholog (BLASTP)
BLAST yeast protein vs human; pick lowest E-value + longest alignment.
24
New cards
Identify paralogs
BLASTP in human database; multiple strong matches = paralogs.
25
New cards
Find conserved domains
Use conserved domain section of BLAST result to see matched regions.
26
New cards
Evaluate BLAST hits
Check E-value, identity, coverage, length of alignment, domain overlap.
27
New cards
Find protein length
Open UniProt entry → see aa count under “Sequence.”
28
New cards
Find transmembrane domains
Check “Topology” or “Domains” section.
29
New cards
Find protein family
See “Family and Domains” section.
30
New cards
Find interactions (BioGRID)
Scroll to “Interactions” → follow BioGRID link.
31
New cards
Check similar proteins in humans
Use “Homology” section.
32
New cards
How to solve 6.2–6.6
Use the correct database or BLAST tool, record lengths/E-values/domains, compare sequences.
33
New cards
Systems biology

uses large data sets to understand behavior of collective components

34
New cards
Holistic approach
Looks at entire system behavior and emergent properties rather than isolated components.
35
New cards
Interdisciplinary
Combines biology, math, computing, engineering, and data science.
36
New cards
Modeling
Uses computational and mathematical models to simulate pathways, reactions, and system behavior.
37
New cards
Data integration
Combines genomics, transcriptomics, proteomics, and metabolomics to analyze whole systems.
38
New cards
Applications
Used for medicine, pathway analysis, target discovery, metabolic engineering, synthetic biology.
39
New cards
Complex system
Biological system made of interacting components like genes, proteins, and metabolites with feedback loops and boundaries.
40
New cards
Inputs and outputs
Inputs include nutrients or signals; outputs include growth, metabolism, hormone release.
41
New cards
Feedback control
Positive and negative loops that maintain system balance.
42
New cards
Boundary
The physical limit of a system (such as a cell membrane) separating inside from outside.
43
New cards
Chaos
Highly sensitive system behavior where small changes cause major differences over time.
44
New cards
Emergent property
A property that appears only when parts work together (e.g., muscle contraction from multiple tissues).
45
New cards
Attractor
A stable long-term state a biological system tends to return to.
46
New cards
Tipping point
The threshold where a system shifts into a new state or behavior.
47
New cards
Therapeutic window
Range between helpful dose and harmful dose of a drug; tipping points mark its edges.
48
New cards
Genomics
Study of all DNA and gene sequences in an organism.
49
New cards
Transcriptomics
Study of gene expression using mRNA levels.
50
New cards
Proteomics
Study of proteins present in a cell, including amounts and functions.
51
New cards
Metabolomics
Study of small molecules and metabolic reactions in cells.
52
New cards
Omics integration
Combining all "omes" to understand how changes at one level affect the entire system.
53
New cards
Examples of system layering
Changes in genes affect transcripts, proteins, and metabolic outputs.
54
New cards
GLP-1
Glucagon-like peptide involved in glucose regulation and insulin signaling.
55
New cards
Modeling GLP-1 agonists
Simulates how GLP-1 drugs affect metabolism, insulin release, pathways, and dosing.
56
New cards
Benefits of GLP-1 modeling
Predicts glucose changes, tests doses, finds tipping points, reduces lab experiments, improves diabetes treatment.
57
New cards
System components
Genes, proteins, metabolites, and pathways that interact to produce behavior.
58
New cards
System modularity
Systems are built from smaller interacting modules (e.g., pathways, tissue types).
59
New cards
Chaos vs predictability
Systems can be predictable short-term but chaotic long-term.
60
New cards
Pioneers of systems science
Alan Turing, Benoit Mandelbrot, Leroy Hood.
61
New cards
Systems biology software
Includes Cytoscape for networks, COPASI for model simulations, Python/Jupyter for data, and KBase for large-scale analysis.
62
New cards
circulating tumor DNA (ctDNA)
Small DNA fragments shed by tumors into the bloodstream; used for noninvasive cancer detection.
63
New cards
How to analyze ctDNA
Compare mutations, methylation, and fragment size to normal DNA; use databases like COSMIC and TCGA.
64
New cards
ctDNA differences
Somatic mutations, shorter fragment sizes, abnormal methylation, copy number changes, rearrangements.
65
New cards
Why some cancers chosen for ctDNA
Common cancers with well-studied, targetable mutations and high levels of ctDNA shedding.
66
New cards

67
New cards
Evolution
Genetic change in a population over time; descent with modification.
68
New cards
Main mechanisms of evolution
Mutation, natural selection, genetic drift, gene flow.
69
New cards
Gradualism
Slow, steady evolutionary change.
70
New cards
Punctuated equilibrium
Rapid bursts of change with long periods of stasis.
71
New cards
Allopatric speciation
New species form in isolated populations.
72
New cards
Sympatric speciation
New species arise in the same geographic area.
73
New cards
Parsimony
In phylogenetics, the simplest tree requiring the fewest mutations is preferred.
74
New cards
Cladistics
Groups species by shared derived traits (synapomorphies).
75
New cards
Outgroup
Determines what traits are ancestral in a tree.
76
New cards
Molecular clock
Uses mutation rates to estimate divergence times.
77
New cards
Limit of molecular clocks
Mutation rates vary across genes and lineages, reducing accuracy.
78
New cards
Distance method
for trees
79
New cards
Informative site
Shared derived base supporting a specific grouping.
80
New cards
Noninformative site
Same in all taxa; gives no relationship information.
81
New cards
Metagenomics
Sequencing all DNA directly from an environment to study whole communities.
82
New cards
CGP
Comprehensive genomic profiling; broad NGS test detecting many mutation types.
83
New cards
Advantage of CGP
Detects SNVs, indels, CNVs, fusions, splice variants using small tissue and data can be reanalyzed.
84
New cards
SNV
Single base substitution.
85
New cards
Indel
Small insertion or deletion.
86
New cards
CNV
Gene amplification or deletion.
87
New cards
Fusion
Rearranged gene joining two gene parts.
88
New cards
Splice variant
Altered RNA isoform caused by changes in splicing.
89
New cards
TMB
Number of coding mutations; predicts immunotherapy response.
90
New cards
MSI
Instability in short DNA repeats; important biomarker for IO therapy.
91
New cards
ctDNA
Tumor DNA fragments in blood; used when tissue biopsy isn’t available.
92
New cards
Advantages of matching therapy
Improves response rates, survival, and side-effect profiles.
93
New cards
Most targeted NSCLC gene
EGFR has the most targeted drugs.
94
New cards
Hotspot panel limit
May miss ~81% of mutations and often fails to detect splice variants.
95
New cards
NCCN and ESMO
Groups that set cancer testing guidelines.
96
New cards
Proteins conserved because
Changes would disrupt essential or critical biological function.
97
New cards
Using DNA for phylogeny
DNA differences reveal species relationships and divergence timing.
98
New cards
How to build a tree
Align sequences, find informative sites, apply parsimony or distance methods.
99
New cards
Why emergent properties occur
Interactions among components create new behaviors not seen in individual parts.
100
New cards