Operant Conditioning Notes
Operant Conditioning
Operant conditioning pairs voluntary behaviors with outcomes (punishment or reward).
Classical conditioning pairs two stimuli.
Thorndike's Law of Effect
Behaviors with satisfying outcomes are more likely to be repeated.
Behaviors with unsatisfying outcomes are less likely to be repeated.
B.F. Skinner's Terminology
Reinforcement: Anything with a satisfying outcome.
Increases the likelihood of repeating a behavior.
Example: Rewarding a child for saying "please."
Punishment: Anything with an unsatisfying outcome.
Decreases the likelihood of repeating a behavior.
Example: Spraying a cat with water for jumping on counters.
Positive vs. Negative
Positive: Something is added.
Negative: Something is taken away.
Reinforcement
Positive Reinforcement: Adding something good.
Examples: Getting paid, receiving a smile.
Negative Reinforcement: Removing something aversive.
Behavior is reinforced because an aversive stimulus is taken away.
Examples: Cleaning a stinky litter box, getting a flu shot to stop a mother's nagging.
Punishment
Positive Punishment: Adding something bad.
Examples: Spanking, getting yelled at, spraying a cat with water.
Negative Punishment: Taking away something good.
Examples: Losing car keys, time-out.
Schedules of Reinforcement
Continuous Schedule:
Reinforcing a behavior every time it's performed.
Useful for initially training a new behavior.
Partial Reinforcement Schedule:
Behavior is rewarded sometimes, but not always.
Classified by:
Fixed or Variable
Ratio (number of responses) or Interval (duration of behavior)
Types of Partial Reinforcement Schedules
Fixed Ratio:
Rewarded for performing a behavior every certain number of times.
Example: Buy 9 cups of coffee, get the 10th free.
Variable Ratio:
Rewarded based on the number of times performing a behavior, but the timing is unpredictable.
Example: Slot machine.
Fixed Interval:
Rewarded based on performing a behavior for a certain length of time, with the timing known.
Example: Hourly wage.
Variable Interval:
Rewarded for performing a behavior for a certain length of time on average, but the timing is unpredictable.
Example: Phone notifications.
Effects of Reinforcement Schedules
Variable ratio schedules produce the highest amount of responses.
Fixed interval schedules produce the lowest amount of responses.
Variable schedules can lead to potentially addictive behaviors.