PSYCH1X03 - Instrumental Conditioning

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/51

Earn XP

Description and Tags

Unit/Week 3

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

52 Terms

New cards

Instrumental Conditioning (IC)

Training to learn contingency between voluntary behavior and its consequence (aka Operant conditioning)

New cards

Thorndike’s Law of Effect

Behaviours with positive consequence are stamped, negative consequences are stamped out

New cards

Thorndike’s Puzzle Box

Cat was placed in puzzle box that could be opened by pulling on rope. Cat would do random behaviour until rope was pulled & door opened.

Thorndike predicted that in subsequence trials, cat would escape immediately like a human would. In reality, escape time was linear.

Animals only followed simple stimulus-response type process, lack humans’ “ah-ha!” moment.

New cards

Skinner’s Box/Operant Chamber

Apparatus to study instrumental conditioning by rewarding/punishing an animal for doing something

New cards

Skinner’s Pigeon Box

Free food is periodically provided to pigeons, pigeons would repeat whatever behaviour that was being performed prior to food in a “superstitious” manner

New cards

Reinforcer

Stimulus used after behaviour occurs to influence its frequency

Reward
Punishment
Escape
Omission

New cards

IC/Reinforcer

Reward Training

Presentation of positive reinforcer

(↑ frequency)

New cards

IC/Reinforcer

Punishment

Presentation of negative reinforcer

(↓ frequency)

Controversial, ethics of inflicting fear. Causes classical conditioning fear of authority figure

New cards

IC/Reinforcer
Escape

Removal of negative reinforcer

(↑ frequency)

New cards

IC/Reinforcer
Omission

Removal of positive reinforcer

(↓ frequency)

New cards

Reinforcer Timing

Correct timing of reinforcer is critical; more effective if minimized delay

New cards

Acquisition of Conditioning

Visualized using a cumulative recorder

Flat horizonal = no response
Increase = response

Pattern depends on the participant, behaviour complexity, and reinforcer used

<p>Visualized using a cumulative recorder</p><ul><li><p>Flat horizonal = no response</p></li><li><p>Increase = response</p></li></ul><p>Pattern depends on the participant, behaviour complexity, and reinforcer used</p>

New cards

Autoshaping

Learn contingency without guidance, can only be simple

New cards

Shaping

Learn contingency with guidance through successive approximations, reinforce smaller behaviour to eventually build to wished behaviour

New cards

Chaining

Learn to connect series of actions together, reinforced by providing opportunity to perform next sequential behaviour and given positive reinforcer after finishing

New cards

Shaping vs Chaining

Shaping reinforces for improvement

Chaining reinforced for correct order

New cards

Discriminative Stimulus

Signals validity of response-reinforcer contingency

New cards

IC/Discriminative Stimulus

SD/S+

Signals contingency is valid

i.e. being at parents house → eating vegetables = dessert

Can also be generalized

New cards

IC/Discriminative Stimulus

Sδ/S-

Signals contingency is invalid

i,e being at grandparents’ house → eating vegetables ≠ dessert

New cards

IC/Discriminative Stimulus

SD/S+ Generalization Gradient

SD/S+ can be generalized, stimuli similarity will affect rate of response. Must exist in same modality (existence)

i.e. pigeon will peck button when light is green SD/S+, but also sometimes at light with similar wavelength

New cards

IC/Discriminative Stimulus

Sδ/S- Stimulus Discrimination

Sδ/S- will constrict range of generalization gradient. Training with Sδ/S- is better for fine tuning. Must exist in same modality (existence)

i.e. pigeon will not peck button when light is red Sδ/S- but also sometimes at light with similar wavelength

New cards

CC & IC

CS+ vs SD/S+

CS+ reflexive, involuntary

SD/S+ sets occasion for voluntary

New cards

CC & IC

CS- vs Sδ/S-

CS- predicts absence of US

Sδ/S- establishes no reinforcer

New cards

Reinforcement Schedules

Rules that dictates when reinforcement will occur

Continuous reinforcement
Partial reinforcement

Partial is more robust compared to continuous, because its less obvious that reinforcement ceased

Same reason why VR-# is better than FR-#

New cards

IC/Reinforcement Schedules

Continuous Reinforcement (CRS)

Every response leads to reinforcement, very rare in real world

New cards

IC/Reinforcement Schedule

Partial Reinforcement Schedule (PRS)

Responses are only reinforced sometimes, based on

Ratio vs Interval
Fixed vs Variable

New cards