cv prelim 1

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/88

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 9:57 PM on 3/25/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

89 Terms

1
New cards

image

function mapping coordinates to intensity values

2
New cards

mean filter

all weights are 1/(2k + 1)², blurring the image

3
New cards

convolution

rotate kernel horizontally and vertically, then cross-correlate

4
New cards

full convolution

compute if any kernel overlap exists (use zero-padding) —> m + k - 1 output

5
New cards

same convolution

compute if kernel center is on image —> m output

6
New cards

valid convolution

compute only if kernel entirely on image —> m - k + 1 output

7
New cards

What’s the idea behind the sharpening filter, and how does the kernel work?

blurring removes fine detail, so add the detail you lost (original - blurred) to the original
kernel: 2 * identity - mean_filter

8
New cards

shift equivariance

doesn’t matter if you shift input then filter or filter first then shift input

9
New cards

Gaussian filter

weight pixels by a Gaussian - neighbors closer to center get high weight

10
New cards

How does the sigma parameter control the Gaussian?

small sigma —> narrower —> less blurring (very nearby pixels weighted)
large sigma —> wider —> more blurring

11
New cards

Difference of Gaussians

subtract two Gaussian-blurred images: sharp smaller blur - broad larger blur leaves behind only edges/fine detail

12
New cards

separable filters

decomposing 2D kernels as outer product of two 1D vectors, then apply those in sequence

13
New cards

inverse mapping

for all output pixels, fetch where it came from in the input

14
New cards

interpolation

estimating the value of inverse mapping results if it’s between input pixels (non-int coords)

15
New cards

bilinear interpolation

weighted average of 4 surrounding pixels based on opposing rectangle

16
New cards

aliasing

high-frequency detail (e.g. edges) getting misrepresented as a false low-frequency pattern

17
New cards

Nyquist Theorem

to avoid aliasing, your sampling rate must be >= 2x the highest frequency in the signal

18
New cards

sampling rate

how many samples you take per unit of space (e.g. 2 samples at peak/trough for sine wave)

19
New cards

Gaussian pre-filtering

blur first with a Gaussian to remove any high-frequency details, then safely subsample —> smooths out any rapid variation that would have been aliased

20
New cards

How do you determine where an edge exists?

pixel intensity changes rapidly at an edge, so take derivative of an image function along a row —> edges are extrema/peaks

21
New cards

What do the gradient magnitude and gradient direction tell you?

gradient magnitude (magnitude of the gradient vector [df/dx, df/dy]) tells you edge strength
gradient direction (arctan(dx/dy)) tells you which way intensity increases the most and is perpendicular to the edge

22
New cards

How to handle noise when differentiating?

differentiation amplifies high frequencies (includes noise), so smooth first then differentiate

23
New cards

Sobel filter

smooths along edge direction and differentiates across it, resulting in an image telling you how strong an edge is at that location

24
New cards

Canny Edge Detection

  1. Smooth by convolving w/ Gaussian to separate noise

  2. Compute gradient with x/y derivatives of Gaussian (precompute)

  3. Non-max suppression —> keeps local max gradients, sharp/thin edges

  4. Hysteresis thresholding with two thresholds (mark if either above high or above low + connected to high)

25
New cards

What do lines do that edges don’t?

organize edges into geometric structures

26
New cards

Hough Voting

letting Canny-decided edge pixels vote on what lines they could lie on, and lines with most votes become actual lines

27
New cards

How can a line be written in polar form?

x cos theta + y sin theta = p (perpendicular dist from origin to line)

28
New cards

What do lines correspond to in Hough Space? What do points correspond to?

all lines correspond to a point (p, theta) representing perpendicular dist from origin + the angle of the line

all points correspond to a sinusoidal curve (possible line point pairs that satisfy the polar form)

29
New cards

Line Detection algorithm

  1. Initialize 2D Hough array ‘acc’ as 0’s (axes are p and theta)

  2. For all edge pixels (x, y), loop over theta values, compute p, and increment acc[p][theta] (use gradient detection at an edge pixel to restrict which theta you vote for to perpendicular angles only)

  3. Find local maxima in acc (lines w/ more votes than its neighbors)

  4. Threshold lines with a minimum vote count

30
New cards

segmentation

grouping pixels into regions that belong together

31
New cards

K-Means

  1. Randomly pick k centers

  2. Assign each point to nearest center

  3. Recompute means as avg of assigned points

  4. Repeat until guaranteed convergence

32
New cards

How to apply K-Means to images?

represent a pixel as a point (r, g, b), set k to k colors (color segmentation), and encode position so we don’t group same color but different position (r, g, b, x, y)

33
New cards

How can we fix issues of large regions getting split up in K-Means?

use a very large k so that each small region is a superpixel, then decide later how to merge superpixels

34
New cards

Flood Fill

  1. Represent all pixels as nodes and add edges in between only if that edge doesn’t cross an actual detected edge (edges between nodes within a defined region)

  2. Find connected components using DFS, representing regions

35
New cards

What does weighted graph-based segmentation do differently?

assign weights representing how likely two pixels are to belong to the same object (high = similar color and no edge, low = different color and strong edge)

36
New cards

Min-Cut Based GBS

segmentation involves cutting weak edges to separate clusters, so find the min-cut (minimizing total weight of cut edges) with Ford-Fulkerson

37
New cards

GBS algorithms are a family of algorithms that vary these four things:

  1. Graph connectivity (which pixels get edges)

  2. Edge weight detection (colors, gradient, etc.)

  3. Per-node cost (cost for belonging to a segment)

  4. Objective function (min-cut)

38
New cards

correspondence estimation

finding matching pixels/regions across 2+ images of same scene

39
New cards

General pipeline for correspondence estimation

  1. Feature detection ~ find sparse (few high-confidence) set of distinctive points worth matching

  2. Feature description ~ for all detected points, find a compact representation of its local appearance

  3. Feature matching ~ find best matches among descriptors

  4. Downstream task ~ use matches for later task, e.g. pose estimation

40
New cards

What are two key elements of good feature points?

repeatibility ~ same point detectable between images despite changes in light/perspective

discriminability ~ point looks different from neighbors, so the matching is unambiguous

41
New cards

corner

point where shifting a small window in any direction leads to a large change in appearance

42
New cards

Why is a corner better than a flat or an edge?

flat ~ no change in any direction

edge ~ no change along edge, only across

43
New cards

What do the eigenvalues of the structure tensor tell you about intensity change?

both eigenvalues near 0 —> flat region (no gradient)

one eigenvalue high, other near 0 —> edge (change in only one dir)

both eigenvalues high —> corner

44
New cards

Harris operator

avoids computing eigenvalues by approximating cornerness score R = det(M) - k * trace(M)²

45
New cards

Harris Corner Detection

  1. Compute image gradients Ix, Iy (apply Sobel filter)

  2. Compute Ix², Iy², Ix * Iy

  3. Gaussian blur each of these new images

  4. Compute Harris response

  5. Threshold R (keep only strong response)

  6. Non-max suppression (keep only local maxima)

46
New cards

invariant

output doesn’t change when image is transformed (e.g. cornerness score)

47
New cards

equivariant

output transforms in same way as input image (e.g. corner location)

48
New cards

What are the two photometric transformations?

additive ~ I’ = I + c (represents a brightness change, shifting histogram of intensity values)

multiplicative ~ I’ = c * I (contrast changes, scales/stretches histogram)

49
New cards

Laplacian of Gaussians

applying Difference of Gaussians to find local maxima/minima (bright centers/dark centers)

50
New cards

characteristic scale

sigma value where LoG response peaks and blob size approximately matches size of the feature

51
New cards

blobs

circular regions of interest in an image

52
New cards

How do Harris and LoG behave under photometric transforms?

Harris ~ invariant to additive, not to multiplicative

LoG ~ not invariant to either

53
New cards

Scale-Invariant Feature Detection

Run detector at many sigma values and find points (x, y, sigma) that are local maxima in 3D space, allowing you to match features at different distances in images

54
New cards

Gaussian pyramid

downsampling an image at multiple scales and applying a fixed-size filter at each (similar to a scaling filter)

55
New cards

descriptor

compact representation (vector of numbers) of the local appearance around a feature point

56
New cards

Normalized Cross Correlation

use the normalized pixel patch as the descriptor to remove effect of photometric changes

d = (patch - patch_mean) / std

57
New cards

Rotation Invariance for descriptors

  1. Compute second moment matrix M (apply Sobel filter and match pixel positions)

  2. Find eigenvector x_max corresponding to biggest eigenvalue

  3. Rotate by computing theta = (x_max_x / x_max_y) and applying rotation matrix

  4. Describe rotated pattern w/ descriptor

58
New cards

Multiscale Oriented Patches Descriptor

  1. Choose the right scale (find level in Gaussian pyramid with max Harris cornerness score)

  2. Apply transformation matrix

  3. Normalize intensity

59
New cards

MOPS Transformation Matrix

  1. Apply MT1 ~ translate so feature is at origin

  2. MR ~ rotate to standard orientation

  3. MS ~ scale down 40 × 40 px —> 8 × 8 px

  4. MT2 ~ translate to output image center

60
New cards

What fix would resolve the MOPS issue of failing at sophisticated lighting changes and rotations?

describe edge orientation instead of raw pixel values

61
New cards

Quantized Orientation Histograms

instead of recording exact gradient orientation, bin them into coarse 45 degree buckets

62
New cards

Scale-Invariant Feature Transform (SIFT)

  1. Scale + rotation normalization (same as MOPS)

  2. Divide patch into grid of 4×4 cells

  3. Build orientation histogram per cell

  4. Concatenate all histograms (4×4 cells x 8 bins = 128-dim description vectors

In other words, for all cells in patch, count how many strong edges point in each of 8 dirs + stack these counts

63
New cards

What are three key improvements to SIFT?

  1. Threshold weak edges

  2. Soft voting with bilinear interpolation (vote proportionally into bins)

  3. Normalize descriptor to unit length, clamp any high values, and renormalize

64
New cards

Ratio Test

Find ||f1 - f2|| / ||f1 - f2’||, where f2 is best match and f2’ is second best match

Small ratio —> best is important, keep the feature point; vice versa for large ratio

65
New cards

What are the key aspects of the pinhole model?

origin at the pinhole, Z-axis points away from image plane (which sits at Z = -1), 3D point P = (X, Y, Z) —> 2D point p = (x, y)

66
New cards

What behavior do parallel lines have as a result of projection?

Converge at a vanishing point and share the same vanishing line as Z —> infinity

67
New cards

What extra thing is required for a camera when considering the real-world coordinate system?

needs 3×3 orthogonal rotation matrix R and 3D translation matrix t

p_cam = R * p_world + t

68
New cards

homogeneous coordinates

adding an extra coordinate to represent points so that perspective projection becomes matrix multiplication (e.g. (X, Y, Z) —> (X, Y, Z, 1)

69
New cards

What is the key equation for camera reconstruction?

p_camera = K * [R | t] * p_world
K = intrinsic camera matrix

[R | t] = rotation and translation combined into 3×4 projection matrix

p_world = 4D homogenous point

70
New cards

parallax

nearby things shift more than far things

71
New cards

calibration

finding camera parameters based on known 3D points + correspondences

72
New cards

Direct Linear Transform

you know world (X, Y, Z) —> img (x, y), i.e. x = PX

Find 6 non-coplanar correspondences, producing 12 equations, and solve for p by finding eigenvector of smallest eigenvalue in ATA (unflatten p to yield P)

73
New cards

reprojection error

project known 3D points onto image using estimated P from direct linear transform, then calculate sum of squared dists from true observations

74
New cards

panorama effect

two images taken with pure rotation and no translation of 3D scene are still related by a homography (despite rotating, you still look down the same ray at a given point)

75
New cards

triangulation

given two calibrated cameras and a correspondence, find 3D point (X, Y, Z, 1)

76
New cards

rectified cameras

both cameras parallel and separated by horizontal translation t

77
New cards

stereo

2 cameras side by side, match same point in both images —> helps you solve for Z

78
New cards

disparity

x2 - x1, difference in x coordinate between 2 images (tx)

79
New cards

How is disparity related to depth Z?

Z = tx/(x2 - x1), tx = baseline/distance between two cameras

80
New cards

Adjusted NCC for Stereo

Rectified cameras —> corr pts on same row, so only search along same horizontal line

  1. Take patch around pixel a in image 1, pixel b in image 2

  2. Subtract means of patches from each patch

  3. NCC = (a * b) / |a||b|

NCC = 1 —> perfect match, opposite for -1

81
New cards

plane sweep

reduce cost of NCC by looping over potential disparity values for a pixel, shifting the second image by d pixels, then computing NCC simultaneously

82
New cards

homography

3×3 transformation mapping pixels in one image to pixels in another (approximation unless the scene is flat)

83
New cards

How do you adjust for some rotation in real cameras?

apply homography to each image to get a rectified config, then run stereo

84
New cards

Structure from Motion

images from unknown cameras looking at unknown 3D scene —> recover both camera params and scene

85
New cards

epipolar line

line passing through a point X on camera 1 onto camera 2

86
New cards

epipole

where the other camera appears in a camera’s image

87
New cards

epipolar pencil

the idea that the baseline is fixed even if you rotate the epipolar plane

88
New cards

Essential Matrix

given a point on img 1, E tells you which line in img 2 to seearch (encodes rotation of camera)

89
New cards

Fundamental Matrix

same as essential matrix E, but works on raw pixel coordinates w/o needing to know K (can be approximated)

Explore top notes

note
Thermal Physics
Updated 437d ago
0.0(0)
note
myth & religion exam tips
Updated 225d ago
0.0(0)
note
COURT CASE: Roe v. Wade
Updated 1152d ago
0.0(0)
note
Theoretical Perspectives
Updated 1234d ago
0.0(0)
note
Terms
Updated 1061d ago
0.0(0)
note
Thermal Physics
Updated 437d ago
0.0(0)
note
myth & religion exam tips
Updated 225d ago
0.0(0)
note
COURT CASE: Roe v. Wade
Updated 1152d ago
0.0(0)
note
Theoretical Perspectives
Updated 1234d ago
0.0(0)
note
Terms
Updated 1061d ago
0.0(0)

Explore top flashcards

flashcards
AP Government Killer Vocab
100
Updated 1157d ago
0.0(0)
flashcards
Repaso de Español 1
104
Updated 206d ago
0.0(0)
flashcards
Spanish Vocab 4
136
Updated 916d ago
0.0(0)
flashcards
Palmer- Spinal- Exam 2 LO/TQs
131
Updated 241d ago
0.0(0)
flashcards
Patho Exam 4
292
Updated 606d ago
0.0(0)
flashcards
AP Government Killer Vocab
100
Updated 1157d ago
0.0(0)
flashcards
Repaso de Español 1
104
Updated 206d ago
0.0(0)
flashcards
Spanish Vocab 4
136
Updated 916d ago
0.0(0)
flashcards
Palmer- Spinal- Exam 2 LO/TQs
131
Updated 241d ago
0.0(0)
flashcards
Patho Exam 4
292
Updated 606d ago
0.0(0)