Reinforcement (Chapter 4) - Key Concepts
Defining Reinforcement
- Reinforcement is a basic principle of behavior established by Skinner in laboratory research and extended to over 40 years of human research; it is a component of many (most) Behavior Modification procedures.
- A behavior occurs and is immediately followed by a consequence that strengthens the behavior, making it more likely to occur again in similar circumstances.
- Present: Behavior is followed by a consequence
- Future: Behavior is more likely to occur
Effects of Reinforcement on Behavior
- Increases in various dimensions when reinforced appropriately:
- Frequency
- Duration
- Intensity
- Quickness (decrease in latency)
- You get more of whatever the reinforcer is contingent on
Example: Grocery Store Crying
- A child cries at the grocery store and the parent gives the child candy.
- Result: Increase in frequency, intensity, and duration of crying at the store; behavior is strengthened and more likely to occur in the future.
Quick Check: Reinforcement or Not?
- The slides prompt you to decide whether a given scenario involves reinforcement.
Quiz Scenarios ( Reinforcement or not? )
- Guy taunts girl and gets slapped; guy keeps taunting girl in future → Reinforcement: Yes
- Kid answers question correctly and gets praise → Reinforcement: Not clearly indicated; we don’t know if the behavior strengthened
- Mom asks kid nicely to make bed, kid makes bed, mom continues to ask nicely to make bed → Reinforcement: Yes
- Mom nags kid to make bed, kid complies, mom less likely to nag → Reinforcement: No (the behavior was less likely to occur)
- Man chooses lottery ticket from a specific gas station, wins lottery, continues to buy lottery tickets from the same gas station → Reinforcement: Yes
- Boy sticks fork in toaster and gets shock, and is more likely to stick fork in toaster → Reinforcement: Yes
- Teenager plays music loud, mom comes into his room and sits by him and asks how his day was, kid less likely to play music loud → Reinforcement: No (behavior less likely to occur)
- Alarm goes off, you slap the off button, and aversive alarm sound is terminated → Reinforcement: Not clearly indicated (future strengthening not specified)
What is Operant Behavior?
- Operant behavior is behavior that is strengthened through the process of reinforcement and is controlled by its consequences.
- Example: Andy answers a question in class and is praised by the teacher; as a result, Andy is more likely to answer questions in class. Andy’s behavior of answering questions in class is referred to as operant behavior.
What is Operant Behavior? (Key Points)
- Operant behavior is strengthened through reinforcement.
- The consequence that strengthens an operant behavior is a reinforcer.
What is a Reinforcer?
- The consequence (stimulus or event) that follows operant behavior and strengthens operant behavior.
You Reinforce Behavior, Not People
- Reinforcement is a process; reinforcer is a thing.
- Reward vs reinforcers: terminology notes.
Two Types of Reinforcement
- Positive reinforcement
- Negative reinforcement
- Both Positive and Negative Reinforcement strengthen behavior and make the behavior more likely to occur in the future.
Positive Reinforcement
- Definition: Behavior is followed by the presentation of a stimulus (a reinforcer) and the behavior is strengthened.
Visual: Reinforcement Flow (Pleasant vs Unpleasant; Apply vs Remove)
- Positive Reinforcement: Pleasant stimulus applied after behavior (Apply reinforcer) to strengthen behavior.
- Negative Reinforcement: Unpleasant stimulus removed after behavior (Remove aversive stimulus) to strengthen behavior.
Negative Reinforcement? (Definition)
- A behavior is strengthened by the removal of an aversive stimulus.
- Example: When Matt makes a 3-point shot, his coach makes him run one less lap in practice → The removal of the run is the negative reinforcement that strengthens the shot behavior.
Escape and Avoidance Behaviors (Negative Reinforcement)
- Escape: The behavior results in the termination of an aversive stimulus and the behavior is strengthened.
- Example: Children tantrum when they don’t want to do something, and parents back off.
- Avoidance: The behavior prevents the presentation of an aversive stimulus and the behavior is strengthened.
- Example: You come home on time so that you are not grounded by your parents.
Escape and Avoidance (Further Details)
- In escape behavior, the occurrence of the behavior results in the termination of an aversive stimulus (stop speeding on the highway to escape a ticket).
- In avoidance behavior, the occurrence of the behavior prevents the presentation of an aversive stimulus (drive below the speed limit to avoid a ticket).
Your First Question (Conceptual)
- Purpose of behavior vs. behavior itself (terminology exercise from the slides)
Operant Conditioning Summary (Slide Highlight)
- Positive Reinforcement vs Negative Reinforcement
- Pleasant vs Unpleasant; Apply vs Remove; Behavior vs Purpose
- Ex: Reward; Ex: Escape-Avoid
Everyday Examples of Negative and Positive Reinforcement
- Push the power button on the TV remote to turn on the TV → Positive reinforcement
- Put on a coat on a cold day → Negative reinforcement (removal of cold)
- Use an umbrella → Negative reinforcement (removal of rain discomfort)
- Slap a mosquito → Negative reinforcement (removal of mosquito bite annoyance)
- Turn up the volume on your iPod → Positive reinforcement
- Close the window when it rains → Negative reinforcement
- Wear earplugs on a noisy job → Negative reinforcement
- Composing an e-mail to connect with someone → Positive reinforcement (social/functional reward)
- Turn on the exhaust fan → Negative reinforcement (removal of heat/odors)
- Click on the “Explorer” icon and get to the internet → Positive reinforcement (access/achievement)
- Turn off the alarm buzzer → Negative reinforcement (removal of alarm noise)
- Give in to a screaming kid → Negative reinforcement (removal of distress for the caregiver, though the kid is reinforced)
- Screaming kid gets a treat in a store → Positive reinforcement (reinforces crying)
- Tell someone to be quiet in a movie → Negative reinforcement (reduces noise/disruption)
- Screaming kid gets a treat in a store (repeat) → Positive reinforcement
- Tell someone to be quiet in a movie → Negative reinforcement
Positive and Negative Reinforcement in the Same Situation
- Tantrum in a grocery store: Child’s tantrum is strengthened by getting candy from a parent (positive reinforcement for the tantrum).
- The parent’s behavior of giving candy is strengthened by the termination of the tantrum (negative reinforcement for stopping the tantrum).
Unconditioned Reinforcers
- Biologically determined; survival value for the individual.
- Examples: Food, water, human contact (warmth), oxygen, sexual contact, escape from cold, heat, pain, extreme stimulation.
- Is money an unconditioned reinforcer? No; money is not biologically pre-wired and is a conditioned reinforcer.
Conditioned Reinforcers
- A previously neutral stimulus repeatedly paired with an established reinforcer will function as a reinforcer.
- Examples: Sight, sound, and scent of parents; Parents’ smile, tone of voice, attention, praise; Types of toys, TV shows, music, clothes, activities; Grades, positive evaluations; Accomplishments (social, physical); Money; Others
Conditioned Generalized Reinforcers
- Paired with a wide variety of other reinforcers
- Examples: Money, praise, tokens
Factors that Influence Reinforcement
- Immediacy
- Consistency (contingency)
- Individual differences
- Intensity of the stimulus
- Concurrent schedule of reinforcement or punishment
Schedules of Reinforcement
- Specifies which responses will be followed by the reinforcer
- Continuous reinforcement: Each response is followed by the reinforcer
- Intermittent reinforcement: Not every response is followed by a reinforcer
Fixed-Ratio Schedule
- Reinforcer after a fixed number of responses; the number does not change
- Produces a high rate of responding with a post-reinforcement pause
- Example: For every 10 cars you sell, you get a 500 dollar bonus
Variable-Ratio Schedule
- Reinforcer after a number of responses on average (the exact number varies around the average)
- Produces a high rate of responding with no post-reinforcement pause
- Example: Slot machine; phone solicitor does not know the number of calls required to make a sale
Fixed-Interval Schedule
- Reinforcer for the first response after a fixed amount of time
- Time interval does not change
- Produces a low rate of responding that increases toward the end of the interval
- Examples: Public bathroom cleaned every 2 hours; hourly supervisor checks on a worker
Variable-Interval Schedule
- Reinforcer for the first response after a variable amount of time on average
- Produces a low but steady rate of behavior
- Examples: Unpredictable supervisor checks; checking for phone or e-mail messages
Concurrent Schedules of Reinforcement
- Schedules that exist at the same time for two or more different behaviors
- Concurrent operants
- Factors influencing choice of concurrent operants:
- Schedule of reinforcement
- Magnitude of reinforcement
- Immediacy of reinforcement
- Response effort