Base ML

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/185

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 2:49 PM on 5/17/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

186 Terms

1
New cards

What is machine learning?

A way to learn patterns from data so a system can make predictions or decisions without being explicitly programmed for every case.

2
New cards

What is a feature?

An input variable used by a model to make a prediction.

3
New cards

What is a label?

The target value a supervised model is trained to predict.

4
New cards

What is a training example?

One row or instance of data containing features and usually a label.

5
New cards

What is supervised learning?

Learning from examples where the correct target values are known.

6
New cards

What is unsupervised learning?

Finding patterns or structure in data without known target labels.

7
New cards

What is semi-supervised learning?

Training with a small amount of labeled data plus a larger amount of unlabeled data.

8
New cards

What is reinforcement learning?

Learning actions through rewards and penalties from interaction with an environment.

9
New cards

What is regression?

A supervised task where the model predicts a numeric value.

10
New cards

What is classification?

A supervised task where the model predicts a category or class.

11
New cards

What is binary classification?

Classification with two possible classes such as fraud or not fraud.

12
New cards

What is multiclass classification?

Classification with more than two possible classes where each example belongs to one class.

13
New cards

What is multilabel classification?

Classification where one example can belong to multiple classes at the same time.

14
New cards

What is a model?

A mathematical function or learned system that maps inputs to outputs.

15
New cards

What are model parameters?

Values learned during training such as weights in linear regression or a neural network.

16
New cards

What are hyperparameters?

Settings chosen before or during training that control how learning happens.

17
New cards

What is a training set?

Data used to fit model parameters.

18
New cards

What is a validation set?

Data used to tune model choices and hyperparameters during development.

19
New cards

What is a test set?

Held-out data used once near the end to estimate final model performance.

20
New cards

Why split data into train validation and test sets?

To estimate how well the model generalizes to new data and avoid evaluating on data used for training decisions.

21
New cards

What is generalization?

A model's ability to perform well on new unseen data.

22
New cards

What is overfitting?

When a model learns noise or quirks of the training data and performs poorly on new data.

23
New cards

What is underfitting?

When a model is too simple or poorly trained to capture the real pattern.

24
New cards

What is bias in the bias variance tradeoff?

Error from assumptions that are too simple and miss the true relationship.

25
New cards

What is variance in the bias variance tradeoff?

Error from being too sensitive to the particular training data.

26
New cards

What is the bias variance tradeoff?

Reducing bias often increases variance and reducing variance often increases bias; good models balance both.

27
New cards

What is data leakage?

When training data includes information that would not be available at prediction time.

28
New cards

Why is data leakage dangerous?

It makes validation results look better than real-world performance.

29
New cards

What is target leakage?

A feature accidentally contains the answer or information created after the target is known.

30
New cards

What is train test contamination?

The same or closely related examples appear in both training and evaluation data.

31
New cards

What is cross validation?

A resampling method that trains and evaluates a model on multiple splits of the data.

32
New cards

What is k fold cross validation?

The data is split into k parts; each part is used once as validation while the rest is used for training.

33
New cards

What is stratified sampling?

Sampling that preserves the class proportions across train validation and test splits.

34
New cards

What is time series split validation?

A validation method that trains on past data and evaluates on later data to respect time order.

35
New cards

Why should time series data not be randomly shuffled for validation?

Random shuffling can let future information influence training and create leakage.

36
New cards

What is point in time correctness?

Using only data that would have been known at the exact time a prediction was made.

37
New cards

What is a baseline model?

A simple reference model used to judge whether a more complex model is actually useful.

38
New cards

What is a dummy classifier?

A simple classifier that predicts using a basic rule such as the most common class.

39
New cards

What is linear regression?

A model that predicts a numeric target as a weighted sum of input features.

40
New cards

What is logistic regression?

A classification model that estimates class probability using a linear score passed through a sigmoid or similar link.

41
New cards

What is a decision tree?

A model that makes predictions by following a sequence of feature-based splits.

42
New cards

What is a random forest?

An ensemble of decision trees trained on random samples and random feature subsets.

43
New cards

What is gradient boosting?

An ensemble method that builds trees sequentially so each new tree corrects errors from previous trees.

44
New cards

What is XGBoost LightGBM or CatBoost?

Popular high-performance gradient boosting libraries for tabular data.

45
New cards

What is k nearest neighbors?

A method that predicts based on the labels or values of the closest training examples.

46
New cards

What is support vector machine?

A model that tries to find a separating boundary with the largest margin between classes.

47
New cards

What is naive Bayes?

A probabilistic classifier that uses Bayes rule with a strong feature independence assumption.

48
New cards

What is a neural network?

A model made of layers of connected units that learn nonlinear patterns from data.

49
New cards

What is a perceptron?

A basic neural network unit that computes a weighted sum and applies an activation function.

50
New cards

What is an activation function?

A function that adds nonlinearity to a neural network such as ReLU sigmoid or tanh.

51
New cards

What is backpropagation?

The algorithm that computes gradients through a neural network so weights can be updated.

52
New cards

What is gradient descent?

An optimization method that updates parameters in the direction that reduces loss.

53
New cards

What is stochastic gradient descent?

Gradient descent using one example or a small batch at a time rather than the full dataset.

54
New cards

What is a learning rate?

A hyperparameter that controls how large each parameter update is during optimization.

55
New cards

What happens if the learning rate is too high?

Training may be unstable and fail to converge.

56
New cards

What happens if the learning rate is too low?

Training may be very slow or get stuck before reaching a good solution.

57
New cards

What is a loss function?

A function that measures how wrong a model prediction is during training.

58
New cards

What is mean squared error?

A regression loss or metric that averages squared prediction errors.

59
New cards

What is mean absolute error?

A regression metric that averages the absolute size of prediction errors.

60
New cards

What is root mean squared error?

The square root of mean squared error which puts the error back in target units.

61
New cards

What is R squared?

A regression metric that measures the share of target variation explained by the model.

62
New cards

What is accuracy?

The fraction of predictions that are correct.

63
New cards

When can accuracy be misleading?

When classes are imbalanced and predicting the majority class looks good despite poor usefulness.

64
New cards

What is a confusion matrix?

A table showing counts of true positives false positives true negatives and false negatives.

65
New cards

What is a true positive?

A positive example correctly predicted as positive.

66
New cards

What is a false positive?

A negative example incorrectly predicted as positive.

67
New cards

What is a true negative?

A negative example correctly predicted as negative.

68
New cards

What is a false negative?

A positive example incorrectly predicted as negative.

69
New cards

What is precision?

Among predicted positives the fraction that were truly positive.

70
New cards

What is recall?

Among actual positives the fraction the model correctly found.

71
New cards

What is sensitivity?

Another name for recall or true positive rate.

72
New cards

What is specificity?

Among actual negatives the fraction the model correctly predicted as negative.

73
New cards

What is F1 score?

The harmonic mean of precision and recall.

74
New cards

When is F1 score useful?

When you need to balance precision and recall especially with imbalanced classes.

75
New cards

What is ROC AUC?

A threshold-independent ranking metric based on true positive rate versus false positive rate.

76
New cards

What is PR AUC?

Area under the precision recall curve often useful for rare positive classes.

77
New cards

What is log loss?

A classification metric that penalizes confident wrong probability predictions.

78
New cards

What is calibration?

How well predicted probabilities match actual outcome frequencies.

79
New cards

What is class imbalance?

A situation where one class appears much more often than another.

80
New cards

How can class imbalance be handled?

Use better metrics resampling class weights threshold tuning or more data for the rare class.

81
New cards

What is a classification threshold?

The probability cutoff used to turn predicted probabilities into class labels.

82
New cards

Why tune a classification threshold?

To match business costs such as preferring fewer false negatives or fewer false positives.

83
New cards

What is regularization?

A technique that discourages overly complex models to reduce overfitting.

84
New cards

What is L1 regularization?

A penalty based on absolute weight size that can drive some weights to zero.

85
New cards

What is L2 regularization?

A penalty based on squared weight size that shrinks weights toward zero.

86
New cards

What is dropout?

A neural network regularization method that randomly disables units during training.

87
New cards

What is early stopping?

Stopping training when validation performance stops improving.

88
New cards

What is feature scaling?

Transforming numeric features so they are on comparable ranges.

89
New cards

When is feature scaling important?

It is important for models based on distances or gradients such as KNN SVM and neural networks.

90
New cards

What is standardization?

Scaling a feature to have mean zero and standard deviation one.

91
New cards

What is normalization?

Scaling values to a fixed range often zero to one.

92
New cards

What is one hot encoding?

Representing categories as separate binary indicator columns.

93
New cards

What is ordinal encoding?

Representing categories with ordered integers when the order is meaningful.

94
New cards

What is target encoding?

Encoding a category using the average target value for that category with care to avoid leakage.

95
New cards

What is imputation?

Filling in missing values using a rule or model.

96
New cards

What is feature engineering?

Creating or transforming input variables to make patterns easier for a model to learn.

97
New cards

What is feature selection?

Choosing a subset of useful features and removing irrelevant or harmful ones.

98
New cards

What is dimensionality reduction?

Reducing the number of features while trying to preserve important information.

99
New cards

What is PCA?

Principal component analysis finds new directions that capture as much variance as possible.

100
New cards

What is clustering?

Grouping similar examples without using target labels.