Lecture 4: Perceiving and Recognizing Objects

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/88

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

89 Terms

1
New cards

Idealism:

  • Reality is inseparable from perception; reality is a mental construct

  • Reality as a top-down process starting from the mind

2
New cards

Materialism:

  • The mind can be explained in terms of matter and physical phenomena

  • (Outdated) Reality is a bottom-up process starting from the sensors

3
New cards

Some ventral stream areas

  • Lateral occipital complex (LOC)

  • Parahippocamp al Place Area (PPA): Scenery, locations

  • Fusiform Face Area (FFA): Faces

  • Extrastriate Body Area (EBA): Bodies

<ul><li><p>Lateral occipital complex (LOC)</p></li><li><p>Parahippocamp al Place Area (PPA): Scenery, locations</p></li><li><p>Fusiform Face Area (FFA): Faces</p></li><li><p>Extrastriate Body Area (EBA): Bodies</p></li></ul><p></p>
4
New cards

Lateral occipital complex (LOC)

  • Object perception

  • lesions in both you can see colour, texture but not shapes.

  • Can’t tell difference between cat and car

5
New cards

Parahippocamp al Place Area (PPA):

  • Scenery, locations

  • Recognizing the sight of scenes

  • Ex. I recognize a picture of campus as UTSC

  • In hippocampus, because hippocampus has to do with navigating And processing scenes helps with navigating

6
New cards

Fusiform Face Area (FFA):

Processing Faces

7
New cards

Extrastriate Body Area (EBA):

  • Responds to sight of body

  • Doesn’t care whether face is there

  • Process non verbal signals in body language like emotional state

8
New cards

Two streams for visual processing

  • Dorsal stream vs ventral stream

  • Superior temporal sulcus pathway also proposed

  • Not separated, connected to one another

  • Broad generalization: cross connections, feedback as well as feed forward.

<ul><li><p>Dorsal stream vs ventral stream</p></li><li><p>Superior temporal sulcus pathway also proposed</p></li><li><p>Not separated, connected to one another</p></li><li><p>Broad generalization: cross connections, feedback as well as feed forward.</p></li></ul><p></p>
9
New cards

Dorsal stream

  • Visual processing and location

  • focuses on spatial location and guides actions related to objects

  • processes where, how information.

  • Fast but colourblind

10
New cards

Ventral stream

  • processes what information (what is there)

  • Responsible for object recognition, form, and color perception, essentially helping us understand "what" an object is

11
New cards

Superior temporal sulcus pathway

for biological motion and social perception

12
New cards

Object perception

  • set of lines that belong together

  • Piece things together to say → thats an object

  • Middle vision combines

    • features into objects.

    • The result is object perception.

13
New cards

Object recognition

  • We say “that picture is a house”

  • Set of objects that belong together into a learned category in our memory

  • we match a perceived object representation to a representation encoded in memory.

  • These memory traces can contain information about object categories

  • the ability to identify and categorize objects visually

  • Aliens landing on earth would have to learn this category

  • Ex. “That is a house” or “thats a face”

14
New cards

Object identification

  • Special case of recognition

  • Recognizing the same object in relation to you when you interact with it more than once

  • These memory traces from object recognition can contain information about that particular object. That’ s object identification.

  • “That house is my grandmothers’” or “thats my friends face” → sub categories within categories of object recognition

15
New cards

Object naming

  • Attaching labels to object

  • involves retrieving the linguistic label associated with that object

  • Recognized objects usually have semantic labels and names assigned to them. – Language functions

16
New cards

What V1 sees

Break up things into edges and lines through simple cells

17
New cards

Challenges to what V1 sees:

  • curved lines, overlap, gaps

  • What belongs together?

  • Figure vs. ground?

18
New cards

Challenges to what V1 sees: curved lines, overlap, gaps

  • Simple cells detect edges in hyper columns if the edges respond to the same orientation in its receptive field

    • Ex. A simple cell that responds to horizontal orientation or vertical and will recognize those edges

  • Doesn’t work as well for curved edges, overlap, gaps

<ul><li><p>Simple cells detect edges in hyper columns if the edges respond to the same orientation in its receptive field</p><ul><li><p>Ex. A simple cell that responds to horizontal orientation or vertical and will recognize those edges</p></li></ul></li><li><p>Doesn’t work as well for curved edges, overlap, gaps</p></li></ul><p></p>
19
New cards

Challenges to what V1 sees: What belongs together?

When area V1 processing the picture, it get the second picture. It’s a mess of lines. So what belongs together?

<p><span>When area V1 processing the picture, it get the second picture. It’s a mess of lines. So what belongs together?</span></p>
20
New cards

Gestalt:

  • German for “whole”

  • “The whole is greater than the sum of its parts”

  • Wertheimer, Köhler, Koffka (1920s–1950s); Palmer and Rock (1990s)

  • Reaction to earlier structuralist school of psychology

21
New cards

Gestalt laws (grouping rules):

  • set of rules describing which elements in an image will appear to group together

  • Includes

    • Gestalt law of “good continuation”

    • Gestalt law of similarity

    • group parallel and symmetric elements together

    • Gestalt law of common fate

    • Gestalt law of synchrony

    • Common region

    • Connectedness

22
New cards

Gestalt law of “good continuation”:

  • two elements tend to group together if they seem to lie on the same smooth contour

  • In area V1, the neurons N1 and N2 get excited because they both see lines of the same orientations

  • They inhibit N3 because it has similiar reseptive field but not the same orientations

<ul><li><p>two elements tend to group together if they seem to lie on the same smooth contour</p></li><li><p>In area V1, the neurons N1 and N2 get excited because they both see lines of the same orientations</p></li><li><p>They inhibit N3 because it has similiar reseptive field but not the same orientations</p></li></ul><p></p>
23
New cards

Gestalt law of “good continuation”

  • what is considered smooth

    • Geisler et al. (2001): natural scene statistics explain the gestalt law of good continuation.

    • Photo of forest, do 2 lines belong to same twig was done in study

    • Lines that are more smooth will belong to same object.

<ul><li><p>what is considered smooth</p><ul><li><p>Geisler et al. (2001): natural scene statistics explain the gestalt law of good continuation.</p></li><li><p>Photo of forest, do 2 lines belong to same twig was done in study</p></li><li><p>Lines that are more smooth will belong to same object.</p></li></ul></li></ul><p></p>
24
New cards

What to do with gaps in contours?

  • A sudden stop in an edge

  • Illusory contours:

    • Kanizsa figures

  • Visual interpretation of several “aligned” end stopping

    • that’s no coincidence there must be an occluding contour!

<ul><li><p>A sudden stop in an edge</p></li><li><p>Illusory contours:</p><ul><li><p>Kanizsa figures</p></li></ul></li><li><p>Visual interpretation of several “aligned” end stopping</p><ul><li><p>that’s no coincidence there must be an occluding contour!</p></li></ul></li></ul><p></p>
25
New cards

Murray et al. (2002): late (higher level) visual processes

Different pigments = 126 ms to Process

<p><span>Different pigments = 126 ms to Process</span></p>
26
New cards

Texture segmentation:

  • carving (parsing) an image into regions of common texture properties

  • Doesn’t always work very well, unstable, dependent on image quality

<ul><li><p>carving (parsing) an image into regions of common texture properties</p></li><li><p>Doesn’t always work very well, unstable, dependent on image quality</p></li></ul><p></p>
27
New cards

Gestalt law of similarity:

elements group together if they are similar

28
New cards

Camouflage is the attempt to

  • trick texture segmentation.

  • If we are tricking Gestalt law of similarity meaning we use it to some extent

<ul><li><p>trick texture segmentation.</p></li><li><p>If we are tricking Gestalt law of similarity meaning we use it to some extent</p></li></ul><p></p>
29
New cards

Gestalt law of “proximity”

  • two elements group together if their distance is small

  • The stars form rows and lines

<ul><li><p>two elements group together if their distance is small</p></li><li><p>The stars form rows and lines</p></li></ul><p></p>
30
New cards

Somewhat weaker Gestalt grouping principles

group parallel and symmetric elements together

<p><span>group parallel and symmetric elements together</span></p>
31
New cards

Gestalt law of common fate:

groups together elements that are moving in the same direction.

<p><span>groups together elements that are moving in the same direction.</span></p>
32
New cards

Gestalt law of synchrony:

  • groups elements together that are changing at the same time.

  • Same 4 dots change colour

<ul><li><p>groups elements together that are changing at the same time.</p></li><li><p>Same 4 dots change colour</p></li></ul><p></p>
33
New cards

Common region:

  • Elements perceived to be part of a larger region group together

  • Row 2

<ul><li><p>Elements perceived to be part of a larger region group together</p></li><li><p>Row 2</p></li></ul><p></p>
34
New cards

Connectedness:

  • Elements that are connected to each other group together

  • Row 3

<ul><li><p>Elements that are connected to each other group together</p></li><li><p>Row 3</p></li></ul><p></p>
35
New cards

How do multiple gestalt principles work together

  • Parallel processing

  • Perceptual committee models: middle vision similar to a collection of “specialists” for certain features (feature values) who vote on their opinions

  • E.g., pandemonium model (Selfridge, 1959)

    • Each demon is like a simple cell or neuron

    • Feature demon → cognitive demon → decision demons

  • Letter recognition

  • “Demons” loosely represent (sets of) neurons; each level = different brain area

<ul><li><p>Parallel processing</p></li><li><p>Perceptual committee models: middle vision similar to a collection of “specialists” for certain features (feature values) who vote on their opinions</p></li><li><p>E.g., pandemonium model (Selfridge, 1959)</p><ul><li><p>Each demon is like a simple cell or neuron</p></li><li><p>Feature demon → cognitive demon → decision demons</p></li></ul></li><li><p>Letter recognition</p></li><li><p>“Demons” loosely represent (sets of) neurons; each level = different brain area</p></li></ul><p></p>
36
New cards

Committee rules

  • Honour the laws of physics (& biology)

    • Nothing is going to defy gravity on earth. The pallet has concave and convex circles, not floating circles

  • Resolve ambiguity (Necker cube)

    • Perceiving squares from below or above, different perceptions

  • Stats: reject accidental viewpoints

    • What is likely what is unlikely

    • Dismiss coincidences

<ul><li><p>Honour the laws of physics (&amp; biology)</p><ul><li><p>Nothing is going to defy gravity on earth. The pallet has concave and convex circles, not floating circles</p></li></ul></li><li><p>Resolve ambiguity (Necker cube)</p><ul><li><p>Perceiving squares from below or above, different perceptions</p></li></ul></li><li><p>Stats: reject accidental viewpoints</p><ul><li><p>What is likely what is unlikely</p></li><li><p>Dismiss coincidences</p></li></ul></li></ul><p></p>
37
New cards

Multiple gestalt principles for figure-ground segmentation

  • What is the to-be- recognized object and what is the background?

  • Foreground is more symmetrical, parallel

<ul><li><p>What is the to-be- recognized object and what is the background?</p></li><li><p>Foreground is more symmetrical, parallel</p></li></ul><p></p>
38
New cards

Figure-ground assignment:

  • determines that some image region belongs to an object in the foreground, other regions are part of the background.

  • Uses heuristics

39
New cards

Gestalt figure–ground assignment principles:

Suroundedness , size, symmetry, parallelism

40
New cards

Extremal edges:

  • horizons of self-occlusion on smooth convex surfaces

  • powerful figure-ground cue

<ul><li><p>horizons of self-occlusion on smooth convex surfaces</p></li><li><p>powerful figure-ground cue</p></li></ul><p></p>
41
New cards

Heuristics for partially occluded features

We complete edges behind occluders when the edges are relatable by an “elbow curve”

<p><span>We complete edges behind occluders when the edges are relatable by an “elbow curve”</span></p>
42
New cards

Heuristics

are mental shortcuts that work most of the time but not always.

43
New cards

Relatability

degree to which two line segments appear to be part of the same contour.

44
New cards

More heuristics that serve as cues for depth & occlusion:

  • Non-accidental features provide clues to object structure. Don’t depend on exact viewing position.

  • T junction = occlusion; not arrow/Y junctions → not coincident or depend on differing viewpoints

<ul><li><p>Non-accidental features provide clues to object structure. Don’t depend on exact viewing position.</p></li><li><p>T junction = occlusion; not arrow/Y junctions → not coincident or depend on differing viewpoints</p></li></ul><p></p>
45
New cards

An arrow junction has one line extending

"into" a "v" formed by the other two

<p><span>"into" a "v" formed by the other two</span></p>
46
New cards

Y-junction has the three lines

meeting at an angle less than 180 degrees, forming a "Y" shape

<p><span>meeting at an angle less than 180 degrees, forming a "Y" shape</span></p>
47
New cards

Gestalt laws → object recognition → object perception in middle vision through button up

knowt flashcard image
48
New cards

bottom-up processing

refers to building perceptions from basic sensory information, like individual light patterns or sound frequencies. 

49
New cards

Top-down processing

uses prior knowledge, expectations, and context to interpret those sensations

50
New cards

Rubin figure:

  • Ambiguous image

  • Has to do with gestalt law to with figure ground segmentation

  • Orange surrounded by white: vase, symmetrical, you would assume its at the front and white is background, therefore object is a vase

  • But if orange is background, the object is 2 faces looking at each other, also symmetrical

  • Is brown or white the figure here?

  • Object recognition starts before figure–ground assignment finishes!

    • If perception was only feed forward (apply rules of figure ground segmentation, then recognize things) you would say “i see vase” and never see the 2 faces because brain already dismissed it

    • Therefore bottom up and top down processing you can switch between the 2

<ul><li><p>Ambiguous image</p></li><li><p>Has to do with gestalt law to with figure ground segmentation</p></li><li><p>Orange surrounded by white: vase, symmetrical, you would assume its at the front and white is background, therefore object is a vase</p></li><li><p>But if orange is background, the object is 2 faces looking at each other, also symmetrical</p></li><li><p>Is brown or white the figure here?</p></li><li><p>Object recognition starts before figure–ground assignment finishes!</p><ul><li><p>If perception was only feed forward (apply rules of figure ground segmentation, then recognize things) you would say “i see vase” and never see the 2 faces because brain already dismissed it</p></li><li><p>Therefore bottom up and top down processing you can switch between the 2</p></li></ul></li></ul><p></p>
51
New cards

Global superiority effect

  • Children see local stuff first (details) before global (whole picture/ objects)

  • Adults are opposite

    • See forest before you see the tree, less to deal with

  • Properties of a whole object take precedence over parts of the object.

<ul><li><p>Children see local stuff first (details) before global (whole picture/ objects)</p></li><li><p>Adults are opposite</p><ul><li><p>See forest before you see the tree, less to deal with</p></li></ul></li><li><p>Properties of a whole object take precedence over parts of the object.</p></li></ul><p></p>
52
New cards

Naive Template Theory

  • says that we hold a sample template of objects in our memories, and so exemplar methods allow for matching new stimuli to existing examples, or prototypes, in their memory

  • “Lock-and- key” representations

    • Object that you perceive (key) stuff stored in memory (lock). When key goes in lock, you preform recognition

    • However only one key works for each lock (template would not work only work for different object but also different versions of one object or letter. The fonts below of would not be recognized by the pixel lock and key) sooo….

  • Problem: You would need too many templates!

    • Not practical

<ul><li><p><strong>says that we hold a sample template of objects in our memories, and so exemplar methods allow for matching new stimuli to existing examples, or prototypes, in their memory</strong></p></li><li><p>“Lock-and- key” representations</p><ul><li><p>Object that you perceive (key) stuff stored in memory (lock). When key goes in lock, you preform recognition</p></li><li><p>However only one key works for each lock (template would not work only work for different object but also different versions of one object or letter. The fonts below of would not be recognized by the pixel lock and key) sooo….</p></li></ul></li><li><p>Problem: You would need too many templates!</p><ul><li><p>Not practical</p></li></ul></li></ul><p></p>
53
New cards

solutions to Naive Template Theory

  • Structural Description Theory

  • View dependent models

54
New cards

Structural Description Theory

  • CAD-like 3D (“view-invariant”) models

  • Represent the structure of objects, not single views

<ul><li><p>CAD-like 3D (“view-invariant”) models</p></li><li><p>Represent the structure of objects, not single views</p></li></ul><p></p>
55
New cards

Marr & Nishihara (1978): cylinders

  • Theory on how Structural Description Theory occurs

  • Early processing: figure ground segregation, edge extraction

  • Part segmentation: seperate object to parts (head, arms, legs, torso)

  • Axis estimation : which body parts on which axis / orientation

  • Volumetric modelling ( same axis have same shaped cylinder, like 2 arms different then 2 legs , different from torso and head) → length of cylinder and orientation

  • See if 3D models match what you have stored in memory

<ul><li><p>Theory on how Structural Description Theory occurs</p></li><li><p>Early processing: figure ground segregation, edge extraction</p></li><li><p>Part segmentation: seperate object to parts (head, arms, legs, torso)</p></li><li><p>Axis estimation : which body parts on which axis / orientation</p></li><li><p>Volumetric modelling ( same axis have same shaped cylinder, like 2 arms different then 2 legs , different from torso and head) → length of cylinder and orientation</p></li><li><p>See if 3D models match what you have stored in memory</p></li></ul><p></p>
56
New cards

Biederman (1987): Recognition-By-Components (RBC)

  • Don’t have cylinders but geons

  • geons - generalized cylinders where the cross-section can vary over the length of the axis, which itself might not be straight

<ul><li><p>Don’t have cylinders but geons</p></li><li><p>geons - generalized cylinders where the cross-section can vary over the length of the axis, which itself might not be straight</p></li></ul><p></p>
57
New cards

Did Picasso inspire structural description theory & RBC

  • Cubism: Picasso, Braque and others

  • Attempt to depict the visual world with basic shapes (neck recognized by cylinder)

<ul><li><p>Cubism: Picasso, Braque and others</p></li><li><p>Attempt to depict the visual world with basic shapes (neck recognized by cylinder)</p></li></ul><p></p>
58
New cards

Flaw in structural description theory & RBC

  • Geons/cylinders difficult to extract from real images

  • Subordinate level recognition?

    • Different orientation as the same person

  • Natural objects with complex structures?

    • That’s a lot of cylinder

  • Total viewpoint invariance??

    • an object can be identified consistently regardless of the viewing angle or orientation. → not true we dont have Total viewpoint invariance

<ul><li><p>Geons/cylinders difficult to extract from real images</p></li><li><p>Subordinate level recognition?</p><ul><li><p>Different orientation as the same person</p></li></ul></li><li><p>Natural objects with complex structures?</p><ul><li><p>That’s a lot of cylinder</p></li></ul></li><li><p>Total viewpoint invariance??</p><ul><li><p><strong>an object can be identified consistently regardless of the viewing angle or orientation</strong>.&nbsp;→ not true we dont have Total viewpoint invariance</p></li></ul></li></ul><p></p>
59
New cards

view-dependent models

  • Tarr, Buelthoff and others assume that we store (a small set of) different views of the same image.

  • To limit the number of views that have to be stored we may interpolate between the views.→ all the bees have wings this consistent features help us interpolate

  • This requires cues that are fairly robust to the vantage point (non-accidental features).

<ul><li><p>Tarr, Buelthoff and others assume that we store (a small set of) different views of the same image.</p></li><li><p>To limit the number of views that have to be stored we may interpolate between the views.→ all the bees have wings this consistent features help us interpolate</p></li><li><p>This requires cues that are fairly robust to the vantage point (non-accidental features).</p></li></ul><p></p>
60
New cards

view-dependent models E.g., object pieced together from individual parts

knowt flashcard image
61
New cards

Optical imaging in view dependant models

  • When they showed monkey a fire extinguisher, these parts in red of IT cortex were active. The blue portion was not

  • Then they took the fire extinguisher apart and showed each part to the monkey individually to see which part correlated with each brain activity

  • shows neurons in IT are organized in feature columns that serve as basic building blocks of recognition. ➔ Perception/recognition may work through distributed processes within and across many areas

<ul><li><p>When they showed monkey a fire extinguisher, these parts in red of IT cortex were active. The blue portion was not</p></li><li><p>Then they took the fire extinguisher apart and showed each part to the monkey individually to see which part correlated with each brain activity</p></li><li><p>shows neurons in IT are organized in feature columns that serve as basic building blocks of recognition. ➔ Perception/recognition may work through distributed processes within and across many areas</p></li></ul><p></p>
62
New cards

Multiple recognition committees

  • Perhaps more than one object recognition strategy?

  • When we see a normal bird we say “thats a bird” but when we see an ostrich we don’t say that a bird even though it’s also a bird. But its not a typical representative of birds

63
New cards

Object recognition often associates a percept with a category of objects.

  • Categories are discrete, hierarchically organized.

  • Objects are usually named at entry level.

  • Atypical category members ➔ subordinate level

  • Experts ➔ subordinate level

  • This shows object recognition relies on different acts of recognition

  • E.g., recognition at subordinate level might be difficult to perform with Biederman’s geons.

  • Involve different brain areas

<ul><li><p>Categories are discrete, hierarchically organized.</p></li><li><p>Objects are usually named at entry level.</p></li><li><p>Atypical category members ➔ subordinate level</p></li><li><p>Experts ➔ subordinate level</p></li><li><p>This shows object recognition relies on different acts of recognition</p></li><li><p>E.g., recognition at subordinate level might be difficult to perform with Biederman’s geons.</p></li><li><p>Involve different brain areas</p></li></ul><p></p>
64
New cards

“Special” processes may be involved in identifying individual faces

  • Reason 1: evolutionary/learning arguments

  • Reason 2: cognitive argument

  • Reason 3: Patients with prosopagnosia.

  • Reason 4: Special face area

  • Reason 5: Jennifer Aniston cell

65
New cards

Reason 1: evolutionary/learning arguments

  • Faces are important for us and have been for long time

  • Evolutionary advantage

  • Learning argument, we encounter faces a lot in our daily lives so we have lots of experience so its not surprising our face recognition is good because we have lots of practice

66
New cards

Reason 2: cognitive argument

  • We Identify faces all the time: we don’t think “is this a face” we think “who’s face is this”

  • We could identify other objects like a car, I want to get into my car not someone else’s, but we do it more with faces

67
New cards

Reason 3: Patients with prosopagnosia.

  • In right temporal cortex Patient can’t recognize faces

  • They know its a face, not a car, but they can’t identify who’s face it is

68
New cards

Reason 4: Special face area

  • Fusiform face area

  • Special area of brain that recognizes faces, this is damaged in patients with prosopagnosia

  • No part of brain that is only for car or trees

69
New cards

Reason 5: Jennifer Aniston cell

  • Cells that identify faces

  • Showed monkey picture of faces and objects and saw the response of the neurons action potentials

  • Cells recognize parts of face like nose or eye (like the fire extinguisher)

<ul><li><p>Cells that identify faces</p></li><li><p>Showed monkey picture of faces and objects and saw the response of the neurons action potentials</p></li><li><p>Cells recognize parts of face like nose or eye (like the fire extinguisher)</p></li></ul><p></p>
70
New cards

But are faces really special?

  • Gauthier et al. (1999): expertise in recognizing novel objects. After being trained on recognizing “greebles” subjects showed increased activity in the right FFA

  • People who have never seen greebles did not get activity in FFA upon seeing them.

  • Conclusion: expertise in recognizing objects that activate FFA. Not area of face recognition but area of visual expertise

  • Is the FFA an area of visual expertise? Example: Bird experts get activity in FFA for birds. Car experts get activity in FFA for cars

<ul><li><p>Gauthier et al. (1999): expertise in recognizing novel objects. After being trained on recognizing “greebles” subjects showed increased activity in the right FFA</p></li><li><p>People who have never seen greebles did not get activity in FFA upon seeing them.</p></li><li><p>Conclusion: expertise in recognizing objects that activate FFA. Not area of face recognition but area of visual expertise</p></li><li><p>Is the FFA an area of visual expertise? Example: Bird experts get activity in FFA for birds. Car experts get activity in FFA for cars</p></li></ul><p></p>
71
New cards

Line label idea:

Neural tuning functions (orientation, motion, etc.) are fixed such that neural response “means” that a stimulus with an attribute close to the neuron’s preference is present

72
New cards

Dynamic view:

  • top-down signals cause neurons to change the “meaning” of the info they carry to carry more info about the stimulus being discriminated

  • Higher-order areas sending top-down signals then ‘interpret’ the resulting bottom-up responses

73
New cards

The bottom-up and top-down of perception

  • Increasingly complex stimuli drive neurons in different parts of the ‘What’ system.

  • (Larger receptive fields)

  • ➔ bottom-up

  • “line label idea”

<ul><li><p>Increasingly complex stimuli drive neurons in different parts of the ‘What’ system.</p></li><li><p>(Larger receptive fields)</p></li><li><p>➔ bottom-up</p></li><li><p>“line label idea”</p></li></ul><p></p>
74
New cards

Object parts => whole objects <= context.

  • What is this?

  • Easy to detect face when there is blurry and some context

  • Blurred head no body : FFA will not respond very high when blurry

  • Body with face: respond a little more not much

  • Body and face but wrong arrangement : respond a little more not much

  • Head on body with right arrangement: more signal than with just head. Body is the context and is necessary for recognition

  • degraded faces activate fusiform face area when presented in the right context using top down processes

<ul><li><p>What is this?</p></li><li><p>Easy to detect face when there is blurry and some context</p></li><li><p>Blurred head no body : FFA will not respond very high when blurry</p></li><li><p>Body with face: respond a little more not much</p></li><li><p>Body and face but wrong arrangement : respond a little more not much</p></li><li><p>Head on body with right arrangement: more signal than with just head. Body is the context and is necessary for recognition</p></li><li><p>degraded faces activate fusiform face area when presented in the right context using top down processes</p></li></ul><p></p>
75
New cards

Bayesian inference:

  • The probability of a stimulus given an image is unknown.

  • Probability of things depend on context

  • But the reverse can be learned through experience, i.e., the probability of an image, given a stimulus.

  • So does the a-priori probability of stimuli (our prior knowledge that there are dogs etc.)

<ul><li><p>The probability of a stimulus given an image is unknown.</p></li><li><p>Probability of things depend on context</p></li><li><p>But the reverse can be learned through experience, i.e., the probability of an image, given a stimulus.</p></li><li><p>So does the a-priori probability of stimuli (our prior knowledge that there are dogs etc.)</p></li></ul><p></p>
76
New cards

p(S|I): posterior probability

i.e., given what we see what is really out there in the world? Can’t know directly.

<p><span>i.e., given what we see what is really out there in the world? Can’t know directly.</span></p>
77
New cards

p(I|S):

  • given that there is a soccer ball, how likely will its projection be a Necker cube (VERY unlikely) etc.

    • models how objects create images on the retina

78
New cards

p(S)

  • a-priori probability of the things in the world, what we know exists and how likely.

    • representation of the objects that exist in the world

<ul><li><p>a-priori probability of the things in the world, what we know exists and how likely.</p><ul><li><p>representation of the objects that exist in the world</p></li></ul></li></ul><p></p>
79
New cards

p(I)

a-priori probability of stimulation; usually not so important…Our image on the retina:

80
New cards

p(S|I) ~ p(I|S)p(S)

Combining p(I|S) and p(S)

<p><span>Combining p(I|S) and p(S)</span></p>
81
New cards

The idea that perception can be described as Bayesian inference means that perception is

governed by learned expectations.

82
New cards

Bayesian inference combines:

  • Inference

  • Generation

83
New cards

Inference:

given the image, what object do I see

84
New cards

Generation:

  • given what I know about objects what should they look like?

  • Important influence on computer science…

  • Imagine letter U, I can picture it

85
New cards

Convolutional neuralnetworks (CNNs):

  • image classification through bottom-up (feed-forward) processes

  • Problem with these is it is susceptible to noise

<ul><li><p>image classification through bottom-up (feed-forward) processes</p></li><li><p>Problem with these is it is susceptible to noise</p></li></ul><p></p>
86
New cards

Deconvolutional generative

  • models (DGMs) create images from text

  • E.g., DALL-E 2

<ul><li><p>models (DGMs) create images from text</p></li><li><p>E.g., DALL-E 2</p></li></ul><p></p>
87
New cards

Neural networks with feedback (e.g., CNN-Fs)

combine inference and generation through bottom-up (feed-forward) and top-down (feedback) processes, respectively

88
New cards

Early stagepsychosis patients vs. healthy control ptcpts.

  • (1) Before: “Which half-tone image shows a person?”

  • (2) Presentation of colour images

  • (3) After: First test repeated more noticeable to notice the object because you saw the colour version

  • Patients improved more than controls!

  • Conclusion : [Perception can be considered a form of] controlled hallucination [that depends on the…] interaction between top-down, brain-based predictions and bottom-up sensory data.“

  • [This is in contrast to a] hallucination as a kind of false perception

89
New cards

In sum… perception uses generative models,

  • i.e., it generates models/representations of the world to understand how the world creates our sensations.

  • Model of (something in) the world

    • P(S)

    • P(IlS)

  • We can’t expect everything in the world just by closing our eyes and imagining it. That’s the element of surprise

<ul><li><p>i.e., it generates models/representations of the world to understand how the world creates our sensations.</p></li><li><p>Model of (something in) the world</p><ul><li><p>P(S)</p></li><li><p>P(IlS)</p></li></ul></li><li><p>We can’t expect everything in the world just by closing our eyes and imagining it. That’s the element of surprise</p></li></ul><p></p>