SPATIAL PROCESSING AND DEPTH PERCEPTION

SPATIAL PROCESSING AND DEPTH PERCEPTION

Problem Statement

  • Problem: How do we reconstruct a three-dimensional world from distorted and curved 2-dimensional retinal images?

  • Answer: We utilize various monocular and binocular cues to achieve this reconstruction.

Introduction to Depth Cues

  • Depth cue: Information that provides insight into the third dimension (depth) of visual space.

    • Monocular depth cue: A depth cue that is accessible even when the world is viewed with one eye alone.

    • Binocular depth cue: A depth cue that necessitates information from both eyes for perception.

Key Definitions
  • Binocular disparity: The differences observed between the two retinal images of the same scene.

    • Stereopsis: A vivid perception of the three-dimensionality of the world, which arises from binocular disparity and is absent in monocular vision.

Binocular Depth Cues

  • Disparity in the Retinal Images:

    • Human sensitivity to small binocular disparities is remarkable; some adults can detect a difference in depth as small as 4 mm at a distance of 5 m.

    • Stereopsis typically provides information about relative depth up to a distance of about 200 m. This functional capacity offers significant adaptive and evolutionary advantages to individuals.

Monocular Cues to Three-Dimensional Space

  • Occlusion: A cue representing relative depth order where one object partially obscures another object, indicating that the obscured object is farther away.

  • Relative Size: This cue involves comparing the sizes of various objects without knowledge of their absolute sizes.

    • Generally, we assume that smaller objects are located farther away from the observer than larger objects, all else being equal.

  • Relative Height: For objects that are touching the ground:

    • Objects appearing higher in the visual field are perceived to be farther away.

    • Conversely, in the sky above the horizon, objects lower in the visual field appear to be farther away.

  • Familiar Size: This cue is based on the inherent knowledge of the typical size of various objects.

    • Knowing what size an object typically is helps create estimations about its distance based on how small or large it appears in the visual field.

  • Aerial Perspective: A depth cue that leverages the understanding that light gets scattered by the atmosphere.

    • More atmosphere results in additional light scattering; thus, the more distant an object is, the fainter, bluer, and less distinct it appears.

  • Dynamic Cues:

    • Motion Parallax: A phenomenon where images closer to the observer move more rapidly across the visual field compared to images that are further away.

    • The brain utilizes this motion information to deduce distances of objects in the environment.