17.1 Notes - Machine Learning & Logistic Regression (Vocabulary Flashcards)

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/42

flashcard set

Earn XP

Description and Tags

Vocabulary flashcards covering fundamental ML concepts, logistic regression, model evaluation, and hyperparameter tuning from the lecture notes.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

43 Terms

1
New cards

Machine Learning

A branch of artificial intelligence that enables algorithms to learn from data to make predictions or decisions without explicit programming.

2
New cards

Artificial Intelligence

A broad field focused on making machines perform tasks that require human intelligence; includes ML, DL, and rule-based systems.

3
New cards

Data preprocessing

Cleaning, transforming, and preparing data (handling missing values, scaling) before feeding it to a model.

4
New cards

Training data

Labeled data used to fit a machine learning model.

5
New cards

Test data

Data used to evaluate model performance after training.

6
New cards

Model

A mathematical representation built from data to make predictions or decisions.

7
New cards

Hyperparameter

A model setting chosen before training that controls learning behavior and model capacity.

8
New cards

Parameter

A quantity learned from data during training (e.g., coefficients in regression).

9
New cards

GridSearchCV

A tool that exhaustively searches over hyperparameter combinations using cross-validation to find the best settings.

10
New cards

Cross-validation

A model evaluation method that splits data into training and validation sets multiple times to estimate performance.

11
New cards

K-fold cross-validation

A type of cross-validation where data is split into k folds; each fold serves as validation once.

12
New cards

Feature scaling

Preprocessing to standardize or normalize features so they contribute equally to the model.

13
New cards

Standardization

Scaling features to zero mean and unit variance.

14
New cards

Normalization

Scaling features to a fixed range, typically [0, 1].

15
New cards

Logistic Regression

A supervised statistical technique to model the probability of a dependent variable using a logistic (sigmoid) function.

16
New cards

Sigmoid function

An S-shaped function mapping real numbers to a probability in the range 0 to 1.

17
New cards

Logistic model equation

p = 1 / (1 + e^{-(β0 + β1 x)}), the predicted probability.

18
New cards

Binary Logistic Regression

Logistic regression where the dependent variable has two possible outcomes/classes.

19
New cards

Multinomial Logistic Regression

Logistic regression for a dependent variable with three or more unordered categories.

20
New cards

Ordinal Logistic Regression

Logistic regression for an ordinal dependent variable with ordered categories.

21
New cards

Decision boundary

The boundary in feature space that separates predicted classes.

22
New cards

Confusion Matrix

A table comparing predicted vs actual class labels (TP, FP, TN, FN).

23
New cards

True Positive

Predicted positive and actual positive.

24
New cards

True Negative

Predicted negative and actual negative.

25
New cards

False Positive

Predicted positive but actually negative.

26
New cards

False Negative

Predicted negative but actually positive.

27
New cards

Accuracy

Proportion of correct predictions: (TP + TN) / (TP + TN + FP + FN).

28
New cards

Precision

Proportion of predicted positives that were actually positive: TP / (TP + FP).

29
New cards

Recall (Sensitivity)

Proportion of actual positives that were correctly predicted: TP / (TP + FN).

30
New cards

F1 Score

Harmonic mean of precision and recall: 2 * (Precision * Recall) / (Precision + Recall).

31
New cards

AUC-ROC

Area Under the Receiver Operating Characteristic Curve; measures classifier’s ability to distinguish classes.

32
New cards

Threshold

Probability cutoff used to assign a class; changing it affects precision and recall.

33
New cards

Odds

Ratio of the probability of an event to its complement: p / (1 - p).

34
New cards

Log odds

Natural logarithm of the odds; the link used in logistic regression.

35
New cards

Odds ratio

Exp(β); the multiplicative change in odds for a one-unit increase in a predictor.

36
New cards

Intercept

β0; the log odds when all predictors are zero.

37
New cards

Coefficient

βi; the change in log odds per one-unit increase in predictor i, holding others constant.

38
New cards

Multicollinearity

High correlation among independent variables; detected by metrics like VIF.

39
New cards

Variance Inflation Factor (VIF)

A metric to detect multicollinearity by quantifying variance inflation.

40
New cards

Box-Tidwell test

Test for linearity of the logit with respect to continuous predictors.

41
New cards

Likelihood interpretation of coefficients

Coefficients reflect log odds; exponentiated coefficients give odds ratios.

42
New cards

Decision boundary concept

A boundary that the model uses to separate classes and make predictions.

43
New cards

Feature scaling (overview)

Standardization or normalization to improve model performance and convergence.