Gradient Descent

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/10

There's no tags or description

Looks like no tags are added yet.

Last updated 1:29 PM on 3/4/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

11 Terms

New cards

Gradient Descent Idea

sample input and target
measure the error
adapt the model to step where we descend to direction of low error
repeat till low error is found

New cards

Parameter Update

New cards

Gradient

New cards

Dataset

each elem is a n dimensional vector x
prediction target y is a tensor which we called ground truth

New cards

Model

The model has a set of adaptable parameters, 𝜽∈𝚯, generally real numbers: 𝜽 in ℝ.
We write: A model with parameters 𝜃 is 𝑓_𝜃:𝑋→𝑌
parameters control behaviour of the model

New cards

Learning Algorithm

params are adapted by loss fn
Goal to minimize loss functions
low loss=low error=high accuracy

New cards

Linear Regression

Goal: minimize the difference btw y (actual) and ŷ(prediction)

<p>Goal: minimize the difference btw y (actual) and <span><span>ŷ(prediction)</span></span></p><p></p>

New cards

Convex loss function

single global minimum
can be optimized much faster than with gradient descent

New cards

Learning Rate

determines how fast we adapt the parameters
high value= faster learning= risk of overshooting the minimum
low value=slower learning=hit minimum with accuracy

New cards

Logistic Regression

using regression for classification
idea: encode probability of belonging to a class as numeric probablity
fitting a (inverse) logistic function (aka sigmoid) to the data
based on the predicted value we assign a class

New cards

Gradient Descent use

applied to any differentiable model
train a linear/logistic regression model
SVMs
Neural networks
large language models