AI midterm

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/52

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

53 Terms

1
New cards
Linear Regression
Models the relationship between a dependent variable (y) and independent variables (x1, x2,..., xn).
2
New cards
Residuals
The differences between observed values and predicted values (ei = yi - ŷi).
3
New cards
Sum of Squared Errors (SSE)
Objective function that minimizes the total squared errors in a regression model.
4
New cards
Mean Squared Error (MSE)
The average of the squared errors; measures the quality of a regression model.
5
New cards
Gradient Descent
An optimization algorithm used to minimize the loss function by iteratively adjusting model parameters.
6
New cards
Learning Rate (α)
A parameter indicating how much to change the model in response to the estimated error.
7
New cards
k-Fold Cross-Validation

assess the generalization of a model by dividing the dataset into k parts.

8
New cards

9
New cards
Ridge Regression
A regression method that adds L2 penalty to the loss function, reducing the magnitude of the coefficients.
10
New cards
Lasso Regression
A regression method that adds L1 penalty to the loss function, allowing for coefficient shrinkage and feature selection.
11
New cards
Elastic Net
A regression technique that combines both L1 and L2 penalties for regularization.
12
New cards
Logistic Regression
A method for binary classification predicting the probability of binary outcomes.
13
New cards
Sigmoid Function
A function that maps any real-valued number into the (0, 1) interval, used to model probabilities.
14
New cards
Log-Odds
The logarithm of the odds, representing the likelihood of an event occurring.
15
New cards
Maximum Likelihood Estimation (MLE)
A method used in statistics to estimate the parameters of a statistical model.
16
New cards
Gradient Ascent
An optimization algorithm that updates parameters to maximize the log-likelihood function.
17
New cards
One-vs-Rest (OvR) Approach
A method where multiple binary classifiers are trained to distinguish one class from the rest.
18
New cards
Softmax Function
A function that converts raw scores into probabilities that sum to one in multi-class classification.
19
New cards
Confusion Matrix
A tabular layout that allows visualization of the performance of an algorithm.
20
New cards
Accuracy
The ratio of correctly predicted instances to the total instances.
21
New cards
Precision
The ratio of true positives to the total predicted positives.
22
New cards
Recall
The ratio of true positives to the total actual positives.
23
New cards
F1-Score
The harmonic mean of precision and recall, balancing both metrics.
24
New cards
One-Hot Encoding
A method for transforming categorical variables into a binary matrix representation.
25
New cards
Decision Trees
A model that makes decisions based on a tree-like structure of nodes representing feature splits.
26
New cards
Gini Index
A measure of impurity used in decision trees to create splits that minimize impurity.
27
New cards
Entropy
A measure of disorder or uncertainty used in decision trees to create splits.
28
New cards
Overfitting
When a model learns noise and details in the training data to the extent that it negatively impacts performance on new data.
29
New cards
Pruning
The process of removing parts of a tree that do not provide significant power in prediction.
30
New cards
Random Forests
An ensemble learning method that constructs multiple decision trees for improved accuracy.
31
New cards
Bootstrap Samples
Random sampling of data with replacement to create subsets for each decision tree in a random forest.
32
New cards
Out-of-Bag (OOB) Error
An estimate of the model's generalization performance based on data points not included in the bootstrap sample.
33
New cards
Threshold
A value that separates predicted outcomes in classification algorithms.
34
New cards
Hyperplane
A subspace of one dimension less than its ambient space, used in regression and classification.
35
New cards
L2 Penalty
A regularization term that helps prevent overfitting by penalizing large coefficients.
36
New cards
L1 Penalty
A regularization term that can shrink some coefficients to zero for feature selection.
37
New cards
Feature Selection
The process of selecting a subset of relevant features for model training.
38
New cards
Decision Boundary
The surface that separates instances of different classes in classification problems.
39
New cards
Collinearity
A situation in which some predictor variables are highly correlated, potentially complicating modeling.
40
New cards
Feature Engineering
The process of using domain knowledge to create features that make machine learning algorithms work.
41
New cards
Model Generalization
The ability of a model to perform well on unseen data.
42
New cards
Numerical Stability
The property of an algorithm to avoid significant numerical errors during computations.
43
New cards
Heuristic
A rule of thumb or strategy to simplify decision-making processes.
44
New cards
ID3 Algorithm
An algorithm used to construct decision trees based on information gain.
45
New cards
Pruning Techniques
Methods to reduce the complexity of decision trees by cutting back on branches.
46
New cards
Bagging
An ensemble method that improves the stability and accuracy of machine learning algorithms through averaging.
47
New cards
Top-Down Approach
Building a decision tree by starting with the whole dataset and splitting it down.
48
New cards
Binary Classification
Classifying instances into one of two distinct classes.
49
New cards
Model Evaluation
The process of assessing how well your machine learning model performs.
50
New cards
Training Set
The dataset used to train a machine learning model.
51
New cards
Test Set
The dataset used to evaluate the performance of a trained machine learning model.
52
New cards
Multicollinearity
Occurs when two or more predictor variables in a regression model are highly correlated.
53
New cards

=