152 Machine Learning Semester 2

0.0(0)
Studied by 1 person
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/125

flashcard set

Earn XP

Description and Tags

Machine learning vocabulary

Last updated 6:56 AM on 4/28/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

126 Terms

1
New cards

Machine Learning

A field of artificial intelligence that uses statistical techniques to give computer systems the ability to learn from data.

2
New cards

Deep Learning

A subset of machine learning that uses neural networks with many layers to analyze various levels of data abstraction.

3
New cards

Keras

A high-level API for building and training neural networks, supporting multiple backend libraries like TensorFlow and PyTorch.

4
New cards

TensorFlow

A neural network library developed by Google for research purposes, known for its capabilities in deep learning and production use.

5
New cards

PyTorch

A neural network library developed by Meta, popular in research for its dynamic computation graph and debugging capabilities.

6
New cards

JAX

A machine learning framework that aims to follow NumPy's syntax and style, enabling high-performance numerical computing.

7
New cards

Scikit-learn

An industry-standard library for machine learning in Python, primarily focusing on classical machine learning algorithms.

8
New cards

Model.compile()

A Keras method used to configure the learning process by specifying optimizers, loss functions, and metrics.

9
New cards

Model.fit()

A Keras method that trains the model for a fixed number of epochs (iterations on a dataset).

10
New cards

Backends in Keras

Function sets or libraries like TensorFlow or PyTorch that Keras relies on to execute training and inference operations.

11
New cards

Sequential Model

A type of model in Keras where layers are defined in a linear stack, starting with the input layer and ending with the output layer.

12
New cards

Activation functions

Mathematical functions applied to the output of each layer in a neural network, determining the output based on the input.

13
New cards

Epochs

One complete presentation of the dataset to be estimated during training, often leading to adjusting the model's weights.

14
New cards

Batch size

The number of training examples utilized in one iteration of the training process.

15
New cards

Loss function

A function that measures the difference between the predicted values and the actual values during training.

16
New cards

Metrics

Quantitative measures used to evaluate the performance of a model during training and testing.

17
New cards

Training neural networks

The process of adjusting a neural network's weights based on training data to improve its predictions.

18
New cards

Linear regression

A statistical method for modeling the relationship between a dependent variable and one or more independent variables using a linear equation.

19
New cards

Least squares

A method for minimizing the differences between observed values and those predicted by a model.

20
New cards

Multilayer perceptron

A type of neural network consisting of multiple layers of nodes that are fully connected.

21
New cards

Forward propagation

The process of inputting data into a neural network and calculating the output based on current weights.

22
New cards

Backward propagation

The process of adjusting the weights in a neural network to minimize the loss function.

23
New cards

Batch size

The number of training instances used in one iteration to update the weights of a neural network.

24
New cards

Epoch

A full pass through the entire training dataset during training of a neural network.

25
New cards

Regularization

A technique used to prevent overfitting in predictive models by adding a penalty for larger weights.

26
New cards

Dropout

A regularization technique where a fraction of neurons are randomly ignored during training to reduce overfitting.

27
New cards

Dendrite

A branched extension of a neuron that receives signals from other neurons.

28
New cards

Node of Ranvier

A gap in the myelin sheath of a nerve, allowing for the rapid conduction of impulses.

29
New cards

Axon

A long, slender projection of a neuron that conducts electrical impulses away from the cell body.

30
New cards

Multilayer Perceptron

A type of neural network that consists of multiple layers of nodes, including an input layer, hidden layers, and an output layer.

31
New cards

Activation Function

A mathematical function that introduces non-linearity into the output of a node in a neural network, determining the activated output of that node.

32
New cards

ReLU Activation Function

An activation function defined as f(z) = max(0, z), commonly used for hidden layers in neural networks.

33
New cards

Sigmoid Activation Function

An activation function defined as f(z) = rac{1}{1 + e^{-z}}, often used for binary classification in neural networks.

34
New cards

Softmax Activation Function

An activation function used in the output layer for multi-class classification that converts logits to probabilities.

35
New cards

Weights in Neural Networks

Parameters that define the strength of the connection between nodes in a neural network, adjusted during the training process.

36
New cards

Feedforward Neural Network

A type of neural network where connections between the nodes do not form cycles, allowing data to flow in one direction.

37
New cards

Random Forests

An ensemble model that uses decision trees fitted to a subset of input features as the base models.

38
New cards

Ensemble Model

A model that combines multiple individual models to produce a better predictive performance.

39
New cards

Bootstrap Sample

A random sample of data points taken from a dataset with replacement, used in training models.

40
New cards

Feature Sampling

The selection of a random sample of features from the input features for training a model.

41
New cards

Decision Tree

A type of model that makes decisions based on asking a series of questions about the input features.

42
New cards

Prediction Aggregation

The process of combining predictions from multiple models to produce a final prediction, such as averaging or voting.

43
New cards

Feature Importance

A measure of the contribution of each input feature to the overall prediction of the model.

44
New cards

Correlated Input Features

Features that are related to each other, which can impact the performance of machine learning algorithms.

45
New cards

Gini Index

A metric used to measure the impurity or purity of a dataset; a lower Gini index indicates a more pure set.

46
New cards

Hyperparameters

Parameters that are set before the learning process begins, which can affect the performance of the model.

47
New cards

Data Cleaning

The process of addressing or fixing missing values, duplicate data, and incorrectly formatted data.

48
New cards

Data Integration

The process of combining data from different sources into a unified view.

49
New cards

Data Reduction

The process of reducing the dimensionality of the dataset, simplifying the data.

50
New cards

Data Transformation

The process of converting features into a format suitable for specific models or algorithms.

51
New cards

Imputation

A group of techniques used to replace missing values in a dataset with a reasonable estimate.

52
New cards

Univariate Imputation

Replaces missing values for a feature using only non-missing values for that same feature.

53
New cards

Multivariate Imputation

Provides more accurate imputations compared to univariate methods, especially when complex dependencies between features exist.

54
New cards

K-nearest Neighbors Imputation (KNN)

Uses the K most similar instances to a data point to impute the missing values, can handle numeric and categorical data.

55
New cards

Iterative Imputation

Uses regression to predict the missing values based on other features in the data.

56
New cards

SimpleImputation()

An initial imputation method used in iterative imputation to obtain estimates for missing values.

57
New cards

Stopping Criterion

Conditions that dictate when the iterative imputation process should stop, such as maximum iterations or specified tolerance.

58
New cards

Decision tree regressor

A decision tree used for regression tasks that divides input feature space using inequalities.

59
New cards

Leaf

A terminal node in a decision tree where the predicted value is computed from training instances.

60
New cards

Mean Squared Error (MSE)

A common measure of the average squared difference between predicted and actual values.

61
New cards

Mean Absolute Error (MAE)

A metric that measures the average magnitude of errors between predicted and actual values, without considering their direction.

62
New cards

Friedman's Improvement (FI)

A measure used to evaluate the performance of a decision tree regression model, comparing the differences in predictions.

63
New cards

Cost Complexity Pruning

A technique used in decision trees to remove nodes that do not significantly improve the model's performance.

64
New cards

Error estimate

An assessment of the accuracy of a model based on its predictions compared to actual values.

65
New cards

Performance evaluation

The process of assessing how well a decision tree regressor performs using metrics like MSE and MAE.

66
New cards

Subtree

A section of a decision tree that can be analyzed separately.

67
New cards

Cost complexity formula

Rx(T) = R(T) + alpha * |T|, where R(T) is the error estimate of the tree and |T| is the number of leaves.

68
New cards

Bootstrapping

The process of generating simulated samples by repeatedly sampling instances with replacement from an existing sample.

69
New cards

Sampling with replacement

A sampling method where some instances are selected multiple times while others may not be selected at all.

70
New cards

Out-of-bag sample (OOB)

A subset of data points that are not selected for training in a bootstrap sample, typically containing about 37% of the original unique data points.

71
New cards

Cross-validation with bootstrap

A technique where bootstrap samples are used as training sets in each iteration, and corresponding out-of-bag samples are used as validation sets.

72
New cards

Population

The complete set of items or individuals that are being studied or sampled from, in this case, comprising 60000 instances.

73
New cards

Bootstrap sample

A subset created from the original dataset by randomly sampling with replacement, utilized in bootstrapping to estimate the distribution of a statistic.

74
New cards

Satisfaction

A measure of contentment, often categorized in surveys as satisfied or unsatisfied based on responses.

75
New cards

Survey

A method of gathering information from individuals, often used to assess opinions or satisfaction in a sample population.

76
New cards

Ensemble Model

A model that combines predictions from multiple models trained on bootstrap samples, resulting in a stronger aggregated model.

77
New cards

Base Model

An individual model within an ensemble, often a decision tree used for its computational efficiency and suitability for classification and regression.

78
New cards

Bootstrap Sample

A sample drawn with replacement from the original dataset, used in the bagging algorithm to train base models.

79
New cards

Bagging Algorithm

A method involving bootstrapping to generate samples, fitting base models on those samples, and aggregating their predictions to form an ensemble.

80
New cards

Overfitting

A modeling error that occurs when a model learns the noise in the training data instead of the actual signal, leading to poor performance on unseen data.

81
New cards

Underfitting

A modeling error that occurs when a model is too simple to capture the underlying pattern of the data, resulting in poor performance even on training data.

82
New cards

Estimators

In the context of bagging, the number of base models used in the ensemble, which typically ranges between 10 and 25 for optimal performance.

83
New cards

Bias-Variance Tradeoff

The balance between the error introduced by the bias and the error introduced by the variance in a model, influencing overall prediction accuracy.

84
New cards

Averaging

The method of averaging the predicted outputs of different models to form an ensemble prediction, commonly used for regression problems in bagging.

85
New cards

Decision Tree

A hierarchical structure made up of nodes and edges used for classification tasks.

86
New cards

Node

An object in a decision tree representing a question or decision point.

87
New cards

Edge

A directed link from a parent node to a child node.

88
New cards

Leaf

A node that has no outgoing edges to child nodes and signifies a final decision.

89
New cards

Root Node

The top node of a decision tree with no parent.

90
New cards

Depth of a Node

The number of edges that must be followed to reach that node from the root node.

91
New cards

Decision Tree Classifier (DTC)

A decision tree specifically built for classification tasks, where each leaf corresponds to a class.

92
New cards

Hyperplane

A decision boundary used to divide feature space in classification tasks.

93
New cards

Gini Impurity Measure

A measure of impurity in a node given by the formula: Gini=P0(1P0)+P1(1P1)Gini = P_0 (1-P_0) + P_1 (1-P_1).

94
New cards

Overfitting

A condition where a model learns the details of the training data too well, leading to poor generalization on unseen data.

95
New cards

Early Stopping

A technique used to prevent further growth of the tree based on node statistics, aimed at reducing overfitting.

96
New cards

Cost Complexity Pruning

A method to prevent overfitting by reducing the size of an already trained decision tree.

97
New cards

Probability Estimate in DTC

The proportion of instances from the predicted class in the resulting leaf, provided by the method DTC.predictextproba(X)DTC.predict ext{-}proba(X).

98
New cards

Data Cleaning

The process of addressing and fixing missing values, duplicate data, and incorrectly formatted data.

99
New cards

Data Integration

The process of combining data from different sources into a unified view.

100
New cards

Data Reduction

The process of reducing the dimensionality of the dataset, simplifying the data.