PCA (Principal Component Analysis) - Video Notes Vocabulary Flashcards

0.0(0)

Studied by 0 people

View linked note

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/14

Earn XP

Description and Tags

Vocabulary flashcards covering key PCA concepts from the lecture notes, including dimensionality reduction, PCA components, eigenvalues/eigenvectors, standardization, covariance, scores, and practical examples.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

15 Terms

New cards

Principal Component Analysis (PCA)

A dimensionality reduction technique that transforms a set of possibly correlated variables into a smaller set of uncorrelated variables called principal components, capturing most of the data's variance.

New cards

Dimensionality reduction

The process of reducing the number of random variables under consideration, typically by obtaining a smaller set of principal variables that retain most of the information.

New cards

Unsupervised data mining technique

A data mining method that does not use labeled outcomes; PCA is unsupervised and focuses on capturing structure/variance in the data.

New cards

Principal Component (PC)

A linear combination of original variables that explains a portion of the total variance; PCs are ordered by explained variance (PC1, PC2, …), and are uncorrelated.

New cards

Eigenvalue

A scalar indicating how much variance is captured by its corresponding eigenvector in PCA; used to rank principal components.

New cards

Eigenvector

A weight vector that defines the direction of maximum variance for a principal component; columns form the eigenvectors matrix.

New cards

Loadings

The contributions of the original variables to a principal component; elements of an eigenvector showing how much each variable contributes.

New cards

Variance explained (explainedvarianceratio_)

The proportion of total variance explained by a given principal component (e.g., PC1 explains 46.62%).

New cards

Cumulative variance

The running total of explained variance across principal components; indicates how many components are needed to reach a desired information threshold.

New cards

Uncorrelated (orthogonal) PCs

Principal components are constructed to be uncorrelated with each other, meaning their pairwise covariances are zero.

New cards

Standardization (Z-score) before PCA

Scaling variables to zero mean and unit variance because PCA is sensitive to the scale of variables.

New cards

Covariance (co-variation) matrix

Matrix of covariances between pairs of variables; its eigenvalues/eigenvectors are used to compute principal components.

New cards

PC scores

The coordinates of observations in the PC space; computed as a weighted sum of standardized variables using PC weights.