Gradient Descent and Model Optimization

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/11

flashcard set

Earn XP

Description and Tags

Flashcards covering key concepts of gradient descent, model optimization, feature selection, and related statistical measures.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

12 Terms

1
New cards

Gradient Descent

An optimization algorithm that iteratively adjusts parameters to minimize a cost function.

2
New cards

Cost Function

A function that measures the error or loss in predictions of a model; it needs to be minimized.

3
New cards

Learning Rate (Alpha)

A hyperparameter that determines the size of the steps taken in the direction of the gradient in gradient descent.

4
New cards

Gradient

The vector of partial derivatives indicating the direction of the steepest increase in a function, used to find minima.

5
New cards

Convex Function

A function where a line segment between any two points on the graph of the function lies above the graph; ensures a single global minimum.

6
New cards

Global Minimum

The lowest point on a cost function graph, representing the best possible prediction accuracy for a model.

7
New cards

Local Minimum

A point on a cost function where the function has a lower value than its neighbors but may not be the lowest overall.

8
New cards

Mean Squared Error (MSE)

A common loss function used in regression that squares the errors between predicted and actual values.

9
New cards

P-Value

A statistical measure that indicates the probability of obtaining the observed results assuming that the null hypothesis is true.

10
New cards

Backward Elimination

A feature selection technique that removes the least significant features based on p-values from the model until all remaining features are statistically significant.

11
New cards

Overfitting

A modeling error that occurs when a model is too complex and captures noise rather than the underlying pattern.

12
New cards

Feature Scaling

A method used to standardize the range of independent variables or features of data, essential for many machine learning algorithms.