L12 - Proteomics
Intro to Bioinformatics
bioinformatics - a scientific subdiscipline that involves computer tech to collect, store, analyze, and disseminate biological data and info such as DNA and amino sequences, etc.
Applications include:
id new drug targets
understanding disease mechanisms
designing new drugs
predicting interactions btwn compound and enzyme
predicting drug responses/safety
streamlining clinical trials
shortening timeline and reducing cost of drug development
reducing risk of side effects
fostering growth of personalized medicine
Bioinformatics Tools:
Databases - sequence storage
archival - genbank
curated - uniprot
computational models - mathematical models to describe biological systems
software
sequence alignment
function analysis
image analysis
clinical
machine learning
protein structure predictions
drug response predictions
UniProt - Knowledgebase has lots of info
Techniques used to determine protein conformation:
x-ray crystallography - uses diffracted x-rays from a protein crystal to generate electron density map, indicating the atomic positions of the protein (only for those that can readily crystallize)
NMR spectroscopy - reveals structure + dynamics of proteins in solution by iding protons in close proximity
Cryo-electron microscopy - a rapidly developing method that can elucidate the structures of large multimeric complexes at increasingly higher resolutions
Protein Structure Prediction
one the most important goals of computational biology (very challenging)
Approaches:
Ab initio - without prior knowledge; calculations that attempt to minimize the free energy of a structure
Knowledge-based - an unknown primary structure is examined for compatibility with known protein structures/fragments
Predictions + Analysis of 3D biomolecules
blah blah blah
Proteomics encompasses the large-scale study of proteins, particularly their functions and structures, utilizing advanced techniques such as mass spectrometry and bioinformatics to elucidate protein interactions and modifications.
Genomics is the study of a person’s genes
Methods for sequencing DNA
Sanger (Dideoxy) sequencing - uses DNA polymerase + chain-terminating nucleotides (dideoxyribonucleoside triphosphates) to block further elongation of a DNA strand once incorporated
does not have the 3’ OH that would have allowed extension
Next Generation Sequencing (NGS) - Illumina; allows sequencing of millions of DNA molecules simultaneously
high speed, reduced costs
whole genome can be broken into millions of fragments then sequences are amplified on a surface w/ covalently attached linkers
RNA-seq - detects the presence and quantitation of all the RNA molecules in a cell under specific conditions
step 1: isolate RNA from cell/tissue of interest
step 2: select for RNA by filtering for sequences containing poly(A) tails
step 3: synthesize cDNA using reverse transcriptase
step 4: sequence cDNA using NGS
step 5: using computational algorithms to assemble the sequencing data