Vision

Learning Objectives

By the end of this section, you will be able to:
- Describe the basic anatomy of the visual system
- Discuss how rods and cones contribute to different aspects of vision
- Describe how monocular and binocular cues are used in the perception of depth

Overview of the Visual System

The visual system actively constructs a coherent mental representation of the external world around us, interpreting light information into meaningful images.
This intricate representation is fundamental to our ability to navigate complex physical spaces, identify and interact with important objects, and recognize individuals within our environment.

Anatomy of the Visual System

The Eye
- The major sensory organ uniquely structured for vision.
- Light waves are initially transmitted across the cornea and subsequently enter through the pupil.
- Cornea: This transparent, protective outer covering of the eye acts as the primary barrier against external elements and is responsible for approximately $80\%$ of the eye's focusing power, bending light rays inward.
- Pupil: A small, adjustable opening in the center of the iris that regulates the amount of light entering the eye. Its size dynamically adjusts in response to varying light levels and emotional arousal. For instance, in dim light, dilator muscles cause the pupil to dilate (expand) to allow more light in, while in bright light, sphincter muscles cause it to constrict (become smaller).
- This size regulation is controlled by involuntary muscles connected to the iris (the colored, muscular portion of the eye).

Light Pathway Through the Eye

After passing through the pupil:
- Light then crosses the lens, a curved, transparent, and flexible structure located behind the pupil, which provides additional focusing power.
- The lens changes shape—a process called accommodation—through the action of ciliary muscles, to ensure that light rays converge precisely onto the retina, regardless of whether the object is near or far.
- Accommodation is crucial for maintaining a clear image on the retina.
- Retina: The light-sensitive inner surface at the back of the eye, containing photoreceptor cells.
- Fovea: A small, central indent within the retina, specifically responsible for sharp, detailed central vision. It is densely packed exclusively with photoreceptor cells known as cones.
- Cones: These photoreceptors function optimally in bright light conditions (photopic vision), provide high spatial resolution (detail), and are primarily responsible for color vision. There are three types of cones, sensitive to short (S-cones, blue), medium (M-cones, green), and long (L-cones, red) wavelengths of light.
Rods: Numerously abundant photoreceptors (approximately $120$ million) distributed throughout the rest of the retina, outside the fovea.
- They function exceptionally well in low-light conditions (scotopic vision), are highly sensitive to movement and dim light, but lack spatial resolution and are achromatic (do not perceive color).

Sensitivity of Rods and Cones

The differing sensitivities of rods and cones are evident in phenomena like dark and light adaptation.
- Examples: When entering a dark theater from a brightly lit lobby, initial blindness occurs as cones cease to function and rods gradually adapt (dark adaptation, which can take up to $30$ minutes). Conversely, light adaptation occurs rapidly when moving from dim to bright environments.
- Difficulty seeing in dim light, often referred to as night blindness, can be a symptom of conditions where rods do not efficiently convert light into nerve impulses due potentially to vitamin A deficiency or genetic defects.

Connection to Retinal Ganglion Cells and Optic Nerve

Rods and cones transmit visual information to retinal ganglion cells via intermediary neurons, including bipolar cells, horizontal cells, and amacrine cells.
The axons of these ganglion cells converge and exit the eye at a single point, forming the optic nerve, which then transmits all visual information to the brain.
Blind Spot: This specific area in the visual field corresponds to where the optic nerve exits the retina. Because there are no photoreceptors (rods or cones) at this point, no visual information can be processed. It is not consciously perceived because the brain 'fills in' the missing information, and the overlapping visual fields of both eyes compensate for it.

Neural Pathways to the Brain

The optic chiasm: An X-shaped structure located at the base of the brain where the optic nerves from both eyes partially cross over.
- Information originating from the right visual field of both eyes (nasal retina of the right eye and temporal retina of the left eye) proceeds to the left cerebral hemisphere, and vice versa for the left visual field, ensuring contralateral processing.
From the optic chiasm, visual information is relayed through various subcortical structures, most notably the lateral geniculate nucleus (LGN) of the thalamus, which acts as a crucial relay station, before being transmitted to the primary visual cortex (V1) in the occipital lobe for further processing.

Visual Processing Pathways

Within the brain, visual information follows two main processing streams:
- What Pathway (Ventral Stream): Extends from the occipital lobe to the temporal lobe and is primarily involved in object recognition and identification (e.g., perceiving the form, color, and identity of an object).
- Where/How Pathway (Dorsal Stream): Extends from the occipital lobe to the parietal lobe and is involved in processing spatial location, motion, and guiding interaction with visual stimuli (e.g., determining an object's position, movement, and how to interact with it).
- Example: Identifying a round, red object as a 'ball' (What pathway) and perceiving its trajectory and preparing to catch it (Where/How pathway).

Research on Visual Perception

David Hubel and Torsten Wiesel: Awarded the Nobel Prize in Physiology or Medicine in $1981$ for their pioneering research on how individual neurons in the visual cortex respond to specific features of visual stimuli.
- They utilized techniques including single-unit recordings to map the receptive fields of neurons in the visual cortex of cats and monkeys, identifying specialized cells (e.g., simple cells responding to lines of specific orientations, complex cells responding to moving lines, and hypercomplex cells responding to lines of specific length).
- Their research also included studying critical periods in visual development, famously by suturing one eye of newborn kittens. This deprivation led to a significant loss of neural connections and functionality in the deprived visual cortex, demonstrating the importance of early visual experience for proper neural development.
Ethical questions frequently arise around animal research, sparking debates that center on the balance between the potential benefits for human understanding of disease and perception against concerns for animal welfare and suffering.

Color and Depth Perception

We perceive the world not just as a collection of shapes but as a rich, three-dimensional space imbued with a full spectrum of colors.

Color Vision

Normal-sighted individuals are trichromats, possessing three distinct types of cone photoreceptors, each maximally sensitive to different wavelengths of light, which collectively contribute to our experience of color vision.
- Trichromatic Theory (Young-Helmholtz Theory): Proposes that our perception of colors arises from the differential activation and combination of these three types of cone cells: one sensitive to short wavelengths (blue), one to medium wavelengths (green), and one to long wavelengths (red).
Opponent-Process Theory (Hering's Theory): Suggests that color information is processed in opposing pairs at a later stage, beyond the retina, by specialized opponent-process cells in the retinal ganglion cells and thalamus. These pairs are black-white, yellow-blue, and green-red. Activation of one color in a pair inhibits the perception of its opponent.
- This theory effectively explains phenomena like negative afterimages (perceptual experiences where one sees the opposing colors after removing a stimulus), which can be demonstrated through activities such as staring intensely at a colored image for a sustained period and then viewing a blank white page, where the complementary colors appear.

Integration of Theories

Both theories are accepted as accurate and apply to different stages in visual processing:
- The Trichromatic theory best describes the initial encoding of color information at the level of the retina within the cone photoreceptors.
- The Opponent-process theory comes into play at post-retinal processing stages, specifically within the retinal ganglion cells and further neural pathways in the brain, where color contrasts are processed.

Depth Perception

Depth perception is the remarkable ability to perceive spatial relationships, allowing us to accurately ascertain the positioning, distance, and relative layout of objects within a three-dimensional environment.
Binocular Cues: These cues for depth perception critically require information from both eyes and are particularly effective for judging distances of nearby objects.
- Binocular Disparity (Retinal Disparity): Each eye receives a slightly different two-dimensional image of the world because they are separated by a few inches ( $2.5$ inches on average). The brain actively computes the difference between these two images to construct a perception of depth. (e.g., the classic finger experiment where holding a finger close to the face and alternating eye closure shows the apparent shift).
- Convergence: As an object moves closer to the observer, the eyes must rotate inward (converge) to maintain focus on the object. The brain uses the extent of this muscular effort and eye angle as a cue for its perceived distance.
- 3D movies skillfully leverage the principle of binocular disparity by presenting slightly different images to each eye through specially designed glasses, thereby creating a convincing illusion of depth.

Monocular Cues

Cues that require only one eye to perceive depth; these are especially crucial for depth perception in two-dimensional representations such as paintings, photographs, or evaluating distant objects where binocular cues are less effective.
Table SAP.1: Lists various monocular depth cues with descriptions and examples:
- Image Position (Relative Height): Objects positioned higher in the visual field are typically perceived as farther away, while those lower appear closer, especially on the ground plane.
- Relative Size: If two objects are known to be of similar actual size, the one that projects a smaller image on the retina is perceived as farther away (e.g., distant cars appear smaller than nearby cars).
- Linear Perspective: Parallel lines appear to converge as they extend into the distance (e.g., railroad tracks appearing to meet on the horizon).
- Light and Shadow: Objects that are closer typically appear brighter and more clearly defined. Shadows provide crucial information about the shape and depth of an object, indicating its contours and relative position to a light source.
- Interposition (Overlap): When one object partially blocks the view of another, the object doing the blocking is perceived as being nearer.
- Aerial Perspective (Atmospheric Perspective): Objects that are farther away often appear hazier, less distinct, and bluer due to the scattering of light by dust and moisture particles in the atmosphere. This haziness makes them seem more distant.

Case Study: Stereoblindness

The captivating example of Bruce Bridgeman, who was stereoblind (unable to perceive depth using binocular cues) for most of his life due to an uncorrected eye problem in childhood, experienced a profound transformation in his depth perception in his old age after seeing a 3D movie. This spontaneous acquisition of binocular depth perception illustrates the remarkable plasticity and persistence of certain visual