Key Concepts in Bioinformatics and Protein Structure

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/34

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

35 Terms

1
New cards

Hidden Markov Model (HMM)

A statistical model used for sequence analysis; can model protein domains from multiple sequence alignments.

2
New cards

Position Weight Matrix (PWM)

Represents the frequency of each nucleotide or amino acid at each position in a sequence motif; used in motif discovery.

3
New cards

Inferential Statistics

Statistical testing to determine if gene expression differences between conditions are significant (e.g., t-tests, FDR correction).

4
New cards

Normalization (RNA-seq)

Adjusts RNA-seq read counts for sequencing depth and gene/transcript length (e.g., TPM, RPKM, DESeq2).

5
New cards

Clustering Analysis

Groups genes/samples with similar expression patterns; does not imply causality or statistical significance.

6
New cards

Differentially Expressed Genes (DEGs)

Genes with statistically significant differences in expression between conditions. Identified using inferential stats.

7
New cards

RNA-Seq

Sequencing method for transcriptome profiling; detects novel transcripts and isoforms.

8
New cards

DNA Microarray

Chip-based expression analysis; limited to known genes/transcripts.

9
New cards

DAVID

Identifies overrepresented functional terms (GO, KEGG) in gene lists.

10
New cards

GEO2R

Online tool to find differentially expressed genes from GEO datasets.

11
New cards

MEME

Discovers sequence motifs (e.g., TF binding sites) in DNA/protein sequences.

12
New cards

Cufflinks

Assembles transcripts and quantifies RNA-seq data; not for functional enrichment.

13
New cards

Domain

A structurally/functionally independent unit in a protein.

14
New cards

Fold

The core 3D structure of a protein domain.

15
New cards

Secondary Structure

Local structures (alpha helices, beta sheets), stabilized by hydrogen bonds.

16
New cards

Tertiary Structure

The complete 3D shape of a single polypeptide.

17
New cards

Quaternary Structure

The 3D arrangement of multiple polypeptides in a complex.

18
New cards

Intrinsically Disordered Proteins (IDPs)

Proteins/regions lacking fixed 3D structure; still functional.

19
New cards

RCSB Protein Data Bank (PDB)

Database of experimentally-determined 3D structures of proteins/nucleic acids.

20
New cards

VAST (Vector Alignment Search Tool)

Compares 3D protein structures to find structural similarities.

21
New cards

CATH Database

Classifies protein structures by: Class, Architecture, Topology, and Homologous superfamily.

22
New cards

DALI

Compares 3D protein structures via inter-residue distance matrices.

23
New cards

AlphaFold

DeepMind's AI tool for accurate 3D protein structure prediction.

24
New cards

Molecular Docking

Predicts molecular interactions (e.g., ligand-protein binding); useful in drug discovery.

25
New cards

Homology Modeling

Predicts protein structure using a known template with sequence similarity.

26
New cards

Ab initio Prediction

Predicts protein structure from scratch, without templates.

27
New cards

Artificial Neural Network

Used for secondary structure prediction (e.g., PSIPRED).

28
New cards

Phylogenetic Tree

Diagram showing evolutionary relationships.

29
New cards

Neighbor-Joining

Distance-based method for tree building.

30
New cards

Maximum Likelihood

Builds tree based on most probable evolutionary model.

31
New cards

Maximum Parsimony

Builds tree with the fewest evolutionary changes.

32
New cards

k-means Clustering

Groups data using ML; not used for phylogenetic trees.

33
New cards

Scale-Free Network

Network with hub nodes; follows power-law distribution.

34
New cards

Small-World Property

Most nodes connected through a small number of steps; common in biological systems.

35
New cards

Random Network

Edges randomly distributed; lacks structure of biological networks.