Operant Conditioning
Classical vs. Operant Conditioning
Similarities
Both involve:
Acquisition
Extinction
Spontaneous recovery
Generalization
Discrimination
Key Difference
Classical conditioning:
Passive learning
Involuntary, reflexive responses
Association between two stimuli
Example: Pavlov’s dog salivating to a bell
Operant conditioning:
Active learning
Voluntary behaviors
Association between behavior and consequence
Example: Skinner’s rat pressing a lever for food
Memory hook
Pavlov: Something happens to me
Skinner: I do something, something happens
Operant Conditioning (B.F. Skinner)
Behavior is shaped by consequences
Consequences determine whether behavior increases or decreases
Acquisition
Learning a new behavior
Occurs when a behavior is reliably performed
Gradual, not all-or-nothing
Shaping
Reinforcing successive approximations of a target behavior
Used when the behavior does not occur naturally
Example:
Rat rewarded for moving toward lever
Then touching lever
Finally pressing lever
Chaining
Reinforcement after a sequence of behaviors
Each step is learned individually, then linked
Example:
Dog jumps through hoop → rings bell → catches Frisbee
Extinction (Operant Conditioning)
Behavior decreases when reinforcement stops
Example:
Rat stops pressing lever when food stops
Child stops tantrum when ignored
Extinction Burst
Temporary increase in behavior before extinction
“Where’s my reward?”
Spontaneous Recovery
Reappearance of an extinguished behavior after reinforcement returns
Example:
One candy bar brings tantrums back
Generalization
Similar behaviors occur because they were previously reinforced
Example:
Pigeon pecks red key and also pecks pink key
Discrimination
Learning which behaviors are rewarded and which are not
Discriminative stimulus signals when reward is available
Example:
Pigeon pecks key only when light is on
Reinforcement vs. Punishment
Reinforcement
Always increases behavior
Positive Reinforcement
Adding something pleasant
Example:
Treat for cleaning room
Negative Reinforcement
Removing something unpleasant
Two types:
Avoidance: behavior prevents aversive stimulus
Escape: behavior stops aversive stimulus
Example:
Seatbelt stops buzzer
Punishment
Always decreases behavior
Involves adding an aversive stimulus
Example:
Shock for lever pressing
Effective punishment must be:
Immediate
Consistent
Linked to behavior
Paired with reinforcement of good behavior
Omission Training (Removal of Privilege)
Removing a positive reinforcer to stop behavior
Alternative to punishment
Example:
Losing phone privileges for misbehavior
Biological Constraints
Instinctual Drift
Learned behaviors revert to instinctual patterns
Breland & Breland experiment:
Raccoons hoarded coins instead of depositing them
Schedules of Reinforcement
Continuous Reinforcement
Reinforce every response
Fast learning
Fast extinction
Intermittent Reinforcement
Reinforce some responses
More resistant to extinction
Fixed-Ratio (FR)
Reward after set number of responses
Example:
Paid every 10 items produced
Fixed-Interval (FI)
Reward after set time
Example:
Weekly test → cramming
Variable-Ratio (VR)
Varying number of responses
Highest response rate
Most resistant to extinction
Example:
Slot machines
Variable-Interval (VI)
Varying time intervals
Steady behavior
Example:
Pop quizzes