Intro to Image Segmentation

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/23

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

24 Terms

1
New cards

What is image segmentation

Task in which an image is partitioned into subgroups of pixels called segments or regions. Doing this reduces the image’s complexity, making analysis simpler.

2
New cards

What applications exist for image segmentation?

  • Self-driving cars (used to identify lanes and other necessary information)

  • Virtual try-on (need exact points of a given vĂȘtement and then, you can replace it)

  • Background subtraction

  • Handwritten text recognition

  • Biomedical Image Understanding: MRI & Cancerogenous cell segmentation

3
New cards

Semantic Segmentation

Each object within segmentation has a pre-established class that you are looking for, aka each subregion has a semantic meaning (specific class)

Required to find a class for each pixel

4
New cards

What type of data do we have?

in Supervised Image Segmentation there are two pieces of data:

  • image, I, which can be grayscale or RGB

  • A segmentation map (same size as I), each entry is a label corresponding to pixel in I

5
New cards

What type of Neural Network do we need here?

  • Our input should have the same height and width as the input image

  • A Fully Convolutional Network is needed

    • Here, our input tensor is an RGB image of shape (3, W, H) and the output tensor will be of shape (K, W, H), where K is the number of classes within a given image

    • Dense layers are not needed because we are not flattening output anymore, but end to end working in tensors

6
New cards

U-Net

  • Gets its shape from the process of downsampling and then upsampling that results in a U shape

  • Number on top of each strip is number of channels and numbers on sides are heights and widths

  • We start with an RGB image (3 Channels) and end up with a tensor of K channels (one segment of each class)

7
New cards

Downsampling

  • The idea of the original image getting smaller, but thicker in terms of number of channels

  • Here, the image is being compressed/ encoding the visual information of the most important features

8
New cards

Upsampling

  • Process of getting thinner and larger, ensuring that the only useful information of the image is being ‘decompressed’/decoded

  • Done via Transpose Convolutions

9
New cards

Skip connections

Why are they used?

  • Mitigate the issue of vanishing gradients

  • Add extra information to the decoder that might be lost because of downsampling on encoder side of network


10
New cards

Transpose Convolution Operation

Typical way of increasing size of a tensor is via Transpose Convolution Operations

  • if we have two matrices A * B, you multiply each element within A by all the elements in B.

  • If there is a stride defined, that represents the number of squares you add to both the width and height of the original matrix

    • Should there be overlap, just sum

  • Usually, the easiest transpose convolution to perform is with a stride of 2

11
New cards

Transpose Convolution and U-Net in Pytorch

Pytorch has module nn.ConvTranspose2d() for transpose convolutions

  • weights learned in this layer are the ones in the kernel and amount does not change with input size, only with input’s number of channels

12
New cards

Instance Segmentation

Special type of segmentation

  • Not only are you dividing image into relevant subgroups, you are further classifying instances of the same object class

13
New cards

Applications of Instance Segmentation

Useful when distinct objets of similar type are present and need to be monitored separately

  • Self-driving cars: keeping track of individual pedestrians and cars in videos

  • Medical scans: segment different nuclei

  • Satellite imagery: detection and counter of cars, etc.

14
New cards

Data in Instance Segmentation

Following data needed:

  • images

  • object-bounding boxes

  • instance-level segmentation ground-truth (MS COCO)

15
New cards

Revisiting Faster R-CNN

  • A CNN network

16
New cards

Mask R-CNN

  • adapts Fast R-CNN to Instance Segmentation

  • Main diff is that Mask R-CNN adds CNN module for segmentation

  • Also uses Rol Align module

17
New cards

Advantages of Mask R-CNN

  • Simplicity

  • Efficiency

  • Flexibility

18
New cards

Panoptic Segmentation

  • Task to detect and segment all objets in picture, including background, and distinguish different instances as well

19
New cards
20
New cards
21
New cards
22
New cards
23
New cards
24
New cards