Computer Vision lectures 6.1-8

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/30

There's no tags or description

Looks like no tags are added yet.

Last updated 11:30 PM on 3/15/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

31 Terms

New cards

Equation of circle

(xi - a)² + (yi - b)² = r²

New cards

If the radius is known

2D Hough Space - Accumulator array = A(a,b)

New cards

If the radius is not known

3D Hough Space - Accumulator array = A(a,b,r)

New cards

Using Gradient Information

can save lots of computation,

edge detection (xi, yi)

edge direction (phi)

a = x - r cos(phi)

b = y - r sin(phi)

New cards

Can you use Hough Transforms for other objects,

beyond lines and circles?

Yes

New cards

Generalized Hough Transform

Find the object center (xc,yc) given edges (xi,yi,phi)

create accumulator A(xc,yc)

initialize A(xc,yc) = 0 and ∀(xc,yc)

for each edge point (xi,yi,phi)
for each entry rk in table, compute

xc = xi + rk cos ak

yc = yi + rk sin ak

increment accumulator A(xc,yc) = A(xc,yc)+1

find local maxima in A(xc,yc)

New cards

Application of Hough Transforms

Detecting shape features

New cards

Hough transform

deals with occlusion, detects multiple instances, robust to noise

New cards

Template

an example of how an object looks

New cards

Sum of Squared Differences

A way to measure similarities using sum of squared differences

SIGMA(d=1,D) (vd-wd)²

New cards

Defining Similarity Using SSD

we need to measure the SSD between each sub window and the template (good matches correspond to low ssd scores)

New cards

Shortcomings of SSD

works if the face in the image has similar brightness and contrast

works if the face in the image has the same size as the face in template

works if the image is rotated in the same way as the template

works if the face in the image is fully visible

New cards

Normalizing for brightness

we can subtract from every window its average intensity value

template is normalized by subtracting from its average intensity

each sub window is normalized by subtracting from its average intensity right before measuring its SSD with the normalized template

DO NOT normalize the entire input image in single step

New cards

Normalize for Contrast

Brightness normalization - we subtract from every window its average intensity value

Contrast normalization (done after brightness normalization) - we divide each window by the standard deviation of values in that window

New cards

Normalized Cross-correlation

provides alternative way to find matches of a template with an image (instead of looking for lowest SSD score we look for highest normalized cross-correlation score)

New cards

Detection at Different Scales

SSD and normalized cross-correlation assume that the object that we want to detect is about as large as the template. Therefore we need to fix the scaling

New cards

Scaling the Image

Approach by scaling the image down by a factor of 3 (1/3), and then get normalized cross-correlation scores between scaled down image and the template. Or we scale up by a factor of 3.

New cards

Scaling the Template

Alternative to scaling the image, same method but with the template instead.

New cards

Multiscale Search

used when we do not know the size of the object we want to detect. (has more potential false matches)

New cards

In-Plane Rotations

do not change what is visible, change orientation of the visible parts.

New cards

Out-of-Plane Rotations

the object rotates in a way that changes what is visible.

New cards

Handling Rotations

Done in similar way for multi-scale (search at multiple rotation and grab high score)

New cards

Number of Detection Results

Set a threshold on the score

Any score above the threshold qualifies as a detection (except non-maxima suppression)

New cards

Cropping

beneficial in a way so that we can keep parts of a template that is most likely to be present in the image.

New cards

Measuring Accuracy

we need to use some dataset of images. For every image in that dataset, we need to set annotations that specify how many objects of interest are present and the bounding box of every object of interest.

New cards

Is a Detection Correct?

measure the intersection over union (IoU) score between the detection and the ground truth

# of pixels in A ∩ B
—————————
# of pixels in A U B

New cards

True positive

a detection box whose IoU score with a ground truth box is over the threshold

New cards

False positive

detection box whose maximum IoU score with all ground truth boxes is under the threshold.

New cards

False negative

a ground truth box G is a false negative if its maximum IoU score with all detection boxes is under the threshold

New cards

Thresholds

2 types required:
IoU threshold - usually 0.5
detection threshold

New cards

detection threshold

Changing the detection threshold usually leads to one of the two numbers (true positives or false positives) getting better and the other one getting worse