Machine Learning Concepts Review

0.0(0)

Studied by 1 person

View linked note

Call with Kai

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/20

Earn XP

Description and Tags

Vocabulary flashcards for key concepts in machine learning, specifically focusing on ensemble methods, evaluation metrics, and dealing with imbalanced datasets.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

21 Terms

New cards

Ensemble Learning

A method that combines the predictions of multiple models to improve overall prediction accuracy.

New cards

Bagging

A technique that reduces variance by training multiple models independently using random subsets of data.

New cards

Boosting

A sequential technique that reduces bias by training models one after another, focusing on examples that previous models misclassified.

New cards

Bootstrapping

A sampling technique used to create multiple subsets of data from a single dataset, with replacement.

New cards

Bias

The error due to overly simplistic assumptions in the learning algorithm.

New cards

Variance

The error due to excessive sensitivity to small fluctuations in the training set.

New cards

Random Forest

An ensemble method that conditions on subtrees from multiple decision trees for regression and classification.

New cards

AdaBoost

An ensemble method that adjusts the weights of instances based on previous classifiers’ errors.

New cards

Gradient Boosting

A method where new models are added to correct errors made by existing models.

New cards

XGBoost

An optimized gradient boosting framework that is widely used for its performance.

New cards

Confusion Matrix

A table that summarizes the performance of a classification algorithm by comparing predicted vs actual classifications.

New cards

True Positive (TP)

Instances correctly predicted as the positive class.

New cards

False Positive (FP)

Instances incorrectly predicted as the positive class.

New cards

True Negative (TN)

Instances correctly predicted as the negative class.

New cards

False Negative (FN)

Instances incorrectly predicted as the negative class.

New cards

Precision

The ratio of true positive predictions to the total predicted positive cases.

New cards

Recall

The ratio of true positive predictions to the total actual positive cases.

New cards

F1 Score

The harmonic mean of precision and recall, used to evaluate a model's accuracy.

New cards

Imbalanced Dataset

A dataset where the distribution of classes is not uniform, affecting model performance.

New cards

Oversampling

The process of increasing the number of instances in the minority class in an imbalanced dataset.

New cards

Undersampling

The process of reducing the number of instances in the majority class in an imbalanced dataset.