VL VI - Training Neural Networks

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/17

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 9:18 AM on 6/22/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

18 Terms

1
New cards

Properties L2 Loss:

  • Sum of squared differences

  • Prone to outliers

  • Compute-efficient optimization

2
New cards

Properties L1 Loss

  • Sum of absolute differences

  • Robust

  • Costly to optimize

3
New cards

Different learning rate decays

  • Step decay (linear decay)

  • Exponantional decay

4
New cards

What is learning DL

  • Generalization to unkown dataset

  • optimized parameters give similar result

5
New cards

What is the validation set used for

  • Hyperparameter optimization

  • Check generalization progress

6
New cards

Whats underfitting

Training and validation losses decrease even at the end of training

7
New cards

Whats overfitting

Training loss decreases and validation loss increases

8
New cards

Ideal training

Small gap between training and validation loss

9
New cards

Bad signs during training

  • Training error not going down

  • validation error not going down

  • Performance on validation better than on training

  • Tests on train set different than during training

10
New cards

Bad practice

use test data during training

11
New cards

Possible Hyperparameters

  • Number Layers

  • Number of iterations

  • Learning rate

  • Regularization

  • Batch size

12
New cards

Methods for Hyperparameter tuning

  • Manual search

  • Grid search

    • Use grid to search hyperparameters

  • Random search

    • Like grid but no structure

13
New cards

Whats a good learning rate

  • use training data with small weight decay

  • Makes loss drop significantly within 100 iteration

14
New cards

How to check if data loading is correct

  • Overfit to a single training sample

    • loss should go to 0

  • Increase training samples gradually

15
New cards

How to Debug Network

  • Train and Val Curves necessary

  • only make one change at a time

16
New cards

most common neural net mistakes

  • didn’t overfit single batch

  • toggle train/eval mode

  • forgot zero_grad() in pytorch

  • passed softmax outputs to a loss

  • didnt use bias false

17
New cards
<p>What can be said about this graph</p>

What can be said about this graph

Parameters are overfitted

18
New cards
<p>What can be said about this curve</p>

What can be said about this curve

Underfitting, because loss still decreasing