Introduction to AI & Linear regression

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/32

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 10:38 AM on 6/12/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

33 Terms

1
New cards

In nn.Linear(64, 256), what does 256 represent?

The number of output neurons.

2
New cards

How many trainable parameters does nn.Linear(10, 5) contain?

55

3
New cards

A model with more parameters is always more accurate.

False.

4
New cards

How many trainable parameters does nn.Linear(10, 5) contain? (Formula)

(10 × 5) + 5 = 55.

5
New cards

LayerNorm normalizes each feature across samples, while BatchNorm normalizes all features within a single sample.

False.

6
New cards

What happens if we stack two linear layers without an activation function?

The model is equivalent to a single linear layer.

7
New cards

Why are convolutional layers better suited for images?

They preserve spatial structure.

8
New cards

A batch size of 100 is used on a dataset of 1000 samples. How many gradient updates occur in one epoch?

10.

9
New cards

What is the main role of a ReLU activation?

Introduce non-linearity.

10
New cards

What does MaxPool2D keep from each local window?

The strongest (maximum) value.

11
New cards

What is the purpose of pooling layers?

Reduce spatial dimensions while retaining important features.

12
New cards

What do convolutional filters learn?

Local patterns/features such as edges and textures.

13
New cards

Which of the following is NOT an activation function?

SoftmaxPool.

14
New cards

What is the role of an optimizer?

Minimize the loss function by updating parameters.

15
New cards

What can happen if the learning rate is too high?

Training becomes unstable and may not converge.

16
New cards

What is overfitting?

The model memorizes training data and performs poorly on unseen data.

17
New cards

What is the purpose of the validation set?

Tune hyperparameters and monitor generalization.

18
New cards

What is the purpose of the test set?

Provide an unbiased final evaluation of the model.

19
New cards

Which metric is commonly used for classification?

Accuracy.

20
New cards

What is the goal of gradient descent?

Minimize the loss function.

21
New cards

Which learning paradigm uses unlabeled data?

Unsupervised learning.

22
New cards

Predicting house prices is an example of:

Regression.

23
New cards

What does a low loss value generally indicate?

Predictions are close to the target values.

24
New cards

What is Deep Learning?

Machine learning using neural networks with multiple layers.

25
New cards

What advantage do mini-batches provide?

A balance between efficiency and stable gradient estimates.

26
New cards

What is transfer learning?

Using knowledge from a pretrained model on a new task.

27
New cards

What is Adam?

An optimization algorithm.

28
New cards

What is the main goal of Machine Learning?

To enable systems to learn patterns from data and make predictions or decisions.

29
New cards

What is the purpose of Dropout?

Reduce overfitting by randomly deactivating neurons during training.

30
New cards

What is the purpose of Batch Normalization?

Normalize activations to stabilize and speed up training.

31
New cards

What is the purpose of Softmax?

Convert outputs into class probabilities that sum to 1.

32
New cards

What is the role of the output layer?

Produce the final prediction.

33
New cards

What is the purpose of padding in CNNs?

Preserve border information and control output size.