Operant Conditioning Notes

Operant Conditioning

  • Operant conditioning pairs voluntary behaviors with outcomes (punishment or reward).

    • Classical conditioning pairs two stimuli.

Thorndike's Law of Effect

  • Behaviors with satisfying outcomes are more likely to be repeated.

  • Behaviors with unsatisfying outcomes are less likely to be repeated.

B.F. Skinner's Terminology

  • Reinforcement: Anything with a satisfying outcome.

    • Increases the likelihood of repeating a behavior.

    • Example: Rewarding a child for saying "please."

  • Punishment: Anything with an unsatisfying outcome.

    • Decreases the likelihood of repeating a behavior.

    • Example: Spraying a cat with water for jumping on counters.

Positive vs. Negative

  • Positive: Something is added.

  • Negative: Something is taken away.

Reinforcement

  • Positive Reinforcement: Adding something good.

    • Examples: Getting paid, receiving a smile.

  • Negative Reinforcement: Removing something aversive.

    • Behavior is reinforced because an aversive stimulus is taken away.

    • Examples: Cleaning a stinky litter box, getting a flu shot to stop a mother's nagging.

Punishment

  • Positive Punishment: Adding something bad.

    • Examples: Spanking, getting yelled at, spraying a cat with water.

  • Negative Punishment: Taking away something good.

    • Examples: Losing car keys, time-out.

Schedules of Reinforcement

  • Continuous Schedule:

    • Reinforcing a behavior every time it's performed.

    • Useful for initially training a new behavior.

  • Partial Reinforcement Schedule:

    • Behavior is rewarded sometimes, but not always.

    • Classified by:

      • Fixed or Variable

      • Ratio (number of responses) or Interval (duration of behavior)

Types of Partial Reinforcement Schedules

  • Fixed Ratio:

    • Rewarded for performing a behavior every certain number of times.

    • Example: Buy 9 cups of coffee, get the 10th free.

  • Variable Ratio:

    • Rewarded based on the number of times performing a behavior, but the timing is unpredictable.

    • Example: Slot machine.

  • Fixed Interval:

    • Rewarded based on performing a behavior for a certain length of time, with the timing known.

    • Example: Hourly wage.

  • Variable Interval:

    • Rewarded for performing a behavior for a certain length of time on average, but the timing is unpredictable.

    • Example: Phone notifications.

Effects of Reinforcement Schedules

  • Variable ratio schedules produce the highest amount of responses.

  • Fixed interval schedules produce the lowest amount of responses.

  • Variable schedules can lead to potentially addictive behaviors.