Deep learning - Chapter 3: CNN Concepts

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/29

Last updated 5:54 PM on 3/30/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

30 Terms

1
New cards

What is a CNN?

A neural network designed to process grid-like data like images using convolutional layers.

2
New cards

What is the main advantage of CNNs over fully connected networks for images?

They preserve spatial relationships and dramatically reduce parameters through weight sharing.

3
New cards

What is a filter/kernel in a CNN?

A small matrix of weights that slides across the image to detect specific features.

4
New cards

How does 2D convolution work on a 3D RGB image?

The 3D filter slides horizontally and vertically (not along depth) because filter depth matches input depth.

5
New cards

What happens when multiple filters are applied to an input?

They produce an output volume where depth equals the number of filters used.

6
New cards

What is the formula for output size after convolution?

(Input Size - Filter Size + 2×Padding) / Stride + 1

7
New cards

What is padding?

Adding zeros around the input image to control output size and preserve border information.

8
New cards

What is stride?

The step size (number of pixels) the filter moves each time it slides across the image.

9
New cards

What happens when stride increases?

The output size decreases and computations are reduced.

10
New cards

 What is the purpose of pooling?

To down sample feature maps, reducing spatial dimensions, parameters, and computational load.

11
New cards

What is max pooling?

Taking the maximum value from each region of the feature map.

12
New cards

What is the typical structure of a CNN?

Conv → ReLU → Pooling → Conv → ReLU → Pooling → Flatten → Fully Connected → Output.

13
New cards

What makes VGG architecture unique?

It uses only small 3×3 filters stacked very deep in a simple, uniform pattern.

14
New cards

What is the main drawback of VGG?

It has a huge number of parameters (~138 million), making it computationally expensive.

15
New cards

What is the key innovation of GoogleNet (Inception)?

The Inception module that applies multiple filter sizes (1×1, 3×3, 5×5) in parallel.

16
New cards

What is the key innovation of ResNet?

Skip connections (residual connections) that allow gradients to flow directly through very deep networks.

17
New cards

What problem do skip connections solve?

The vanishing gradient problem, enabling training of networks with over 100 layers.

18
New cards

What is transfer learning?

Reusing a model pre-trained on a large dataset as a starting point for a new, related task.

19
New cards

Why is transfer learning effective?

It leverages pre-learned features, requiring less data and training time.

20
New cards

What are the three main CNN models covered?

VGG (simple and deep), GoogleNet (multi-scale filters), and ResNet (skip connections).

21
New cards

 What is the receptive field?

The region of the input image that influences a particular output neuron.

22
New cards

What is a feature map?

The output produced by applying a single filter to the input.

23
New cards

What is the role of the activation function (ReLU) after convolution?

To introduce non-linearity, allowing the network to learn complex patterns.

24
New cards

What is flattening in a CNN?

Converting the 3D output of convolutional layers into a 1D vector for fully connected layers.

25
New cards

What is the relationship between filter size and padding to maintain output size?

Padding needed = (Filter Size - 1) / 2 for same output size.

26
New cards

How do early CNN layers differ from later layers?

Early layers detect simple features (edges, colors); later layers detect complex features (shapes, objects).

27
New cards

What is weight sharing in CNNs?

The same filter weights are used across all positions of the input, reducing parameters.

28
New cards

What is translation invariance in CNNs?

The ability to recognize objects regardless of their position in the image.

29
New cards

What is the output of a CNN for classification?

A probability distribution over classes, typically using Softmax activation.

30
New cards

What is the difference between valid padding and same padding?

Valid padding gives smaller output; same padding gives output equal to input size when stride=1.

Explore top notes

note
US History Chap. 11
Updated 921d ago
0.0(0)
note
homeostasis
Updated 1341d ago
0.0(0)
note
1.3: Cell membranes and transport
Updated 1052d ago
0.0(0)
note
SourceOfFinance
Updated 390d ago
0.0(0)
note
PDHPE
Updated 516d ago
0.0(0)
note
Chapter 37 - The Eisenhower Era
Updated 1295d ago
0.0(0)
note
Korean Food & Drink
Updated 432d ago
0.0(0)
note
2024Chem. IMFs ↓↑
Updated 591d ago
0.0(0)
note
US History Chap. 11
Updated 921d ago
0.0(0)
note
homeostasis
Updated 1341d ago
0.0(0)
note
1.3: Cell membranes and transport
Updated 1052d ago
0.0(0)
note
SourceOfFinance
Updated 390d ago
0.0(0)
note
PDHPE
Updated 516d ago
0.0(0)
note
Chapter 37 - The Eisenhower Era
Updated 1295d ago
0.0(0)
note
Korean Food & Drink
Updated 432d ago
0.0(0)
note
2024Chem. IMFs ↓↑
Updated 591d ago
0.0(0)

Explore top flashcards

flashcards
Envol 5 - Unité 7
46
Updated 992d ago
0.0(0)
flashcards
Respiration and Excretion
85
Updated 292d ago
0.0(0)
flashcards
TỪ VỰNG LESSON 8
33
Updated 929d ago
0.0(0)
flashcards
CSUF Biol 151 Exam 1
66
Updated 764d ago
0.0(0)
flashcards
Vocab list #1
50
Updated 173d ago
0.0(0)
flashcards
Medical Terminology: Chapter 4
75
Updated 1285d ago
0.0(0)
flashcards
Envol 5 - Unité 7
46
Updated 992d ago
0.0(0)
flashcards
Respiration and Excretion
85
Updated 292d ago
0.0(0)
flashcards
TỪ VỰNG LESSON 8
33
Updated 929d ago
0.0(0)
flashcards
CSUF Biol 151 Exam 1
66
Updated 764d ago
0.0(0)
flashcards
Vocab list #1
50
Updated 173d ago
0.0(0)
flashcards
Medical Terminology: Chapter 4
75
Updated 1285d ago
0.0(0)