AI exam 2

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/72

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

73 Terms

1
New cards

machine learning workflow steps

  1. data preprocessing

  2. model training

  3. model evaluation

  4. model selection

  5. model testing

2
New cards

data sets used for training AI

  1. training dataset

  2. validation dataset

  3. testing dataset

3
New cards

data preprocessing

preparing raw data for model training and analysis including

  • data cleaning

  • data transformation

  • feature engineering

  • encoding categorical data

  • feature selection

  • data splitting

4
New cards

data cleaning

addressing missing values through imputation or removal, correcting errors, and addressing outliers

5
New cards

data transformation

standardizing numerical data to ensure consistency in scale and distribution

6
New cards

feature engineering

creating new, informative features, or modifying existing ones, based on domain knowledge

7
New cards

encoding categorical data

converting non-numerical categories into a numerical format

8
New cards

feature selection

identifying and retaining the most relevant features to reduce data dimensionality

9
New cards

data splitting

dividing the preprocessed data into training, validation, and test sets

10
New cards

model training

focuses on the feeding the training data into machine learning algorithms to train models by tuning parameters like weights to minimize loss

11
New cards

model training process

  • feeding a training dataset

  • comprising features (input variables)

  • labels (output variables)

12
New cards

model training objective

adjusting the parameters of the model so it can accurately map the input to the output, typically involves minimizing a loss function

13
New cards

loss function

quantifies the difference between the predicted outputs of the model and the actual outputs in the training dataho

14
New cards

how to minimize the loss function

optimization algorithms, they iteratively adjust the models parameters like weights in a neural network to minimize the difference between predictions and actual results

15
New cards

model evaluation

assesses the training models on the validation set using relevant metrics for the problem

16
New cards

how is model performance measured

accuracy, precision, recall, F1 score for classification tasks, and MSE and MAE for regression tasks

17
New cards

hyperparameter tuning

a part of the model evaluation process, the configuration settings used to structure the machine learning model to enhance its performance or generalization

18
New cards

examples of hyperparameters

  • learning rate

  • number of layers in the neural network

  • number of trees in a random forest

19
New cards

goal of hyperparameter tuning

find the “sweet spot” where the model is complex enough to capture the underlying patterns in the data, but not so complex that it overfits to the training dataset

20
New cards

parameter vs hyperparameter

a parameter is a variable that is learned from the data during the training process, a hyperparameter is a variable that is set before the training period begins to control the learning process

21
New cards

overfitting

when the model learns the training data too well leading to poor generalization on new data

22
New cards

underfitting

when the model is too simple to capture the underlying patterns in the data, resulting in poor performance on both training sets and test sets

23
New cards

model selection

picking the best validation set performer, if multiple models are similar the simplest model is preferred

24
New cards

goal of model selection

to pick the optimal model for deployment

25
New cards

factors that are used to find the optimal model for deployment

  • model simplicity

  • training time

  • resource requirements

  • ease of interpretation

26
New cards

model testing

evaluating the chosen model on an unseen test dataset, which is a representative subset of the data that is expected to show up in the actual environment

27
New cards

dataset splitting goal

developing models that not only perform well on known data but also generalize efficiently to new, unseen data

28
New cards

training dataset

largest portion of the data and is used to train the model, it allows the model to learn patterns and relationships within the data by adjusting its parameters

29
New cards

validation dataset

used to tune hyperparameters and evaluate the model’s performance during training time, helps prevent overfitting and allows for model selection and optimization

30
New cards

test dataset

kept completely separate and is only used to assess the final model’s performance, it helps detect any overfitting and ensures the model will work well when deployed in real-life scenarios

31
New cards

the importance of splitting

  1. preventing overfitting

  2. model selection, choosing between different models and hyperparameters unbiased

  3. unbiased performance estimation

  4. iterative improvement, refine the model without contaminating the test dataset

32
New cards

the challenge of overfitting

adjusting the parameters solely based on the training dataset can lead to the model learning not only the general patterns, but also the peculiarities specific to the dataset

33
New cards

how to address overfitting

we continue training the model while monitoring its performance on both the training and validation datasets

34
New cards

what indicates overfitting

errors on the validation dataset may start to increase while errors on the training dataset continue to drop

35
New cards

why do we need a separate test set

  1. detects indirect optimization, a subtle form of overfitting

  2. information leakage

  3. unbiased final evaluation

  4. detecting validation set overfitting

  5. reliable performance reporting

  6. enhancing credibility

36
New cards

neural network

designed to simulate the way a human brain analyzes and processes information

37
New cards

neural network function

consist of layers of interconnected nodes, each of which perform a simple computation; output of these is passed through an activation function which helps standardize the output for the next layer

38
New cards

layers of a neural network

  • input layer

  • hidden layers

  • output layer

39
New cards

input layer

inputs are fed into the network

40
New cards

output layer

`

41
New cards

hidden layer

performs most of the computations, transforming inputs into features the model uses to make predictions; sits between the input and the output, processing data with weights and activation functions to learn patterns

42
New cards

activation functions

determine the output of a node, primarily to introduce non-linearity to the output as most real-world data is non-linear (couldn't handle complex patterns in data)

43
New cards

sigmoid (logistic) function

takes any real-valued number and maps it into a value between 0 and 1, used for model where we need to predict the probability of an output

44
New cards

tanh (hyperbolic tangent) function

maps real-valued numbers to values in the range of -1 to 1, useful when the model needs to predict values that are normalized

45
New cards

ReLU (rectified linear unit) function

allows only positive values to pass through it, and negative values are mapped to zero, allows models to converge faster and reduce the likelihood of vanishing gradient

46
New cards

output layer activation function

chosen according to the specific task, shapes the output into a form that matches the problem statement

47
New cards

regression

type of output layer activation function, no activation function is used or linear if the output is a real-valued prediction

48
New cards

binary classification

type of output layer activation function, the sigmoid function is typically used because it maps predictions to a probability distribution between two classes

49
New cards

multiclass classification

type of output layer activation function, the softmax function is used to produce a probability distribution over multiple classes

50
New cards
51
New cards
52
New cards
53
New cards
54
New cards
55
New cards
56
New cards
57
New cards
58
New cards
59
New cards
60
New cards
61
New cards
62
New cards
63
New cards
64
New cards
65
New cards
66
New cards
67
New cards
68
New cards
69
New cards
70
New cards
71
New cards
72
New cards
73
New cards