Operant Conditioning (3.8)

Moving from classical conditioning, we explore operant conditioning, which focuses on consequences

Note: Established by B.F. Skinner

Definition: Associating consequences (reinforcement and punishment) with behaviors

EX: A dog sits on command and in return receives a treat

Law of Effect: Behaviors with reinforcing consequences are more likely to be repeated; those with punishing consequences are less likely
EX: A child cleans their room and receives praise, increasing the likelihood of cleaning again

Understanding operant conditioning helps in shaping behaviors through reinforcement and punishment

Reinforcement and Punishment

Positive Reinforcement: Adding a desirable stimulus to increase a behavior

EX: Giving a treat for a good behavior

Negative Reinforcement: Removing an aversive stimulus to increase behavior

EX: Turning off a loud alarm when a task is completed

Positive Punishment: Adding an aversive stimulus to decrease a behavior
EX: Receiving a speeding ticket for going over the speed limit

Negative Punishment: Removing a desirable stimulus to decrease a behavior
EX: Taking away video game privileges

Primary and Secondary Reinforcers

Primary Reinforcers: Naturally reinforcing, satisfy biological needs

EX: Food, water

Secondary Reinforcers: Learned reinforcers, associated with primary reinforcers

EX: Money, praise

Discrimination: Responding to specific stimuli that signal reinforcement

EX: A rat pressing a lever only when a green light is on

Generalization: Responding similarly to stimuli that are similar to the original conditioned stimulus

EX: A rat pressing a lever in response to both green and blue lights

Reinforcement Schedules

The timing of reinforcement delivery affects the strength of behavior conditioning

Continuous Reinforcement: Reinforcement is delivered for each correct behavior

EX: Giving a treat every time a dog sits on command

Partial Reinforcement: Reinforcement is delivered on a variable schedule

Types

Fixed Interval: Reinforcement after a set time period

EX: Paychecks bi-weekly
Variable Interval: Reinforcement after varying time intervals

EX: Checking for a response to an email

Fixed Ratio: Reinforcement after a set number of responses
EX: Commission after every tenth sale

Variable Ratio: Reinforcement after a varying number of responses

EX: Slot machine payouts

Different schedules influence the persistence and strength of conditioned behaviors

Shaping and Instinctive Drift

Behaviors can be gradually shaped through reinforcement

Shaping: Gradually reinforcing successive approximations of the desired behaviors

EX: Rewarding a pigeon as it pecks closer and closer to a button or lever

Instinctive Drift: Tendency for animals to revert back to instinctual behaviors despite reinforcement

EX: A raccoon washing food even when trained to put coins in a bank

Shaping and instinctive drift demonstrate the power and limitations of operant conditioning

Unexpected Applications

Sometimes, behaviors are reinforced or suppressed in unexpected ways

Superstitious Behavior: Occurs when consequences reinforce unrelated behaviors

EX: Fear of crossing the path of a black cat because something bad happened last time


Learned Helplessness: Occurs when organisms learn they have no control over aversive outcomes

EX: Dogs not attempting to escape shock after being restrained initially