Basic Principles of Reinforcement
Basic Principles of Reinforcement
Definition of Reinforcement
- Reinforcement is a crucial behavioral process for modifying behavior.
- It involves a behavior being followed by a consequence.
- This consequence strengthens the behavior, making it more likely to occur in the future.
- Example: A teen does chores and receives an allowance. If the chore-doing behavior increases because of the allowance, reinforcement has occurred.
Components of Reinforcement
- Reinforcement is a process.
- It has an effect on behavior, specifically strengthening it.
- Operant behavior is influenced by antecedents and consequences.
- Consequences either strengthen or weaken behavior.
- Reinforcement strengthens behavior.
Three-Term Contingency (A-B-C)
- A: Antecedent (what comes before the behavior).
- B: Behavior or Response.
- C: Consequence (what follows the behavior).
- Focus is on the behavior and the consequence to understand reinforcement.
Examples:
- Dog Whining:
- Behavior: Dog whines.
- Consequence: Owner provides food.
- If whining increases, reinforcement has occurred.
- Guitar Playing:
- Behavior: Bill plays guitar.
- Consequence: Audience claps and cheers.
- If guitar playing increases, reinforcement has occurred.
- Child Crying:
- Behavior: Child cries.
- Consequence: Father gives a doll.
- If crying increases, reinforcement has occurred.
Types of Reinforcement
- Two types based on adding or removing stimuli after a behavior.
- Reinforcement always strengthens behavior.
Positive Reinforcement
- A behavior is followed by the addition of a stimulus.
- This strengthens the behavior.
- Example: Child cries, father gives a doll. The doll being given is positive reinforcement if crying increases.
- Example: Student completes work quickly, teacher praises. Praise is positive reinforcement if working quickly increases.
- Pleasurable stimuli are added as a consequence.
Negative Reinforcement
- A behavior is followed by the removal of a stimulus.
- This strengthens the behavior.
- Important: Negative reinforcement is NOT punishment. Punishment weakens behavior.
- Example: Loud radio in the car. Turning the knob removes the loud music. If knob-turning increases, negative reinforcement has occurred.
- Example: Shutting a window to stop cold air. Removing the cold air increases window-shutting behavior.
- Unpleasant stimuli are removed.
Summary of Differences
- Positive Reinforcement: Strengthens behavior by adding a pleasant stimulus.
- Negative Reinforcement: Strengthens behavior by removing an unpleasant stimulus.
- The effect on behavior (strengthening) determines the type of reinforcement.
Extinction
- A reinforcer is no longer provided after a response.
- Behaviors that were previously reinforced are no longer reinforced.
- Extinction causes responses to decrease over time.
- Example: A child who received candy for greeting his teacher stops receiving candy. The greeting behavior will decrease.
- When teaching new skills, schedules are thinned gradually to maintain the skill without constant reinforcement.
Extinction Burst
- Behavior may increase temporarily before decreasing when reinforcement is removed.
- Example: Child pulls hair and is picked up. If picking up stops, hair pulling might increase before decreasing.
Types of Reinforcers
Primary Reinforcers (Unconditioned)
- Function as reinforcers without prior learning.
- Essential for biological functioning or survival.
- Examples: food, water, warmth, sexual stimulation.
Secondary Reinforcers (Conditioned)
- Acquire reinforcing properties by being paired with primary reinforcers or other secondary reinforcers.
- Examples: toys, games, praise, social interaction, money.
Token Economies
- Use tokens that are earned and exchanged for backup reinforcers.
- Backup reinforcers are primary or secondary reinforcers chosen by the client.
- Tokens themselves have no intrinsic value.
- Tokens act as positive reinforcement.
Example:
- Giving tokens for desired behaviors, like saying "juice," standing up, jumping, and sitting down. Tokens are then exchanged for shoes.
Benefits of Token Systems
- Increase time between target behavior and backup reinforcement.
- Provide reinforcement without interrupting instruction.
- Less subject to satiation.
Token Boards
- Tokens are placed on the board for target behaviors.
- When the board is full, the client receives the backup reinforcer.
Factors Influencing Reinforcement Effectiveness
- Time between behavior and reinforcer presentation.
- Short time = higher probability of strengthening behavior.
- Long time = lower probability of strengthening behavior.
- Example: Dolphin gets a fish immediately after retrieving a ring vs. 30 minutes later.
Contingency
- Rule about when a behavior will be reinforced.
- Reinforce only desired behaviors; do not reinforce incorrect behaviors.
- Example: Praise children for bringing plates to the sink but not for leaving them on the table.
Quality
- Preference for the reinforcing stimulus.
- Example: Chocolate ice cream is a better reinforcer than vanilla if the child prefers chocolate.
Magnitude
- Size or duration of the reinforcer.
- Example: 15 minutes of TV is more effective than 5 minutes.
Effort
- Response effort required to obtain reinforcement.
- Too much effort can prevent the behavior from occurring.
- Example: Choosing a restaurant that is a 5 min walk compared to 20 min subway ride and 10 min walk.
- Example: Reducing required social behaviors from 5 to 1 to get access to the iPad.
History of Reinforcement
- Consider what has worked in the past for the individual.
- Example: Salty snacks might work for some but not others. Social interaction may work for some but not others.
Motivating Operations (MOs)
- Events that temporarily alter the effectiveness of a reinforcer.
- Can make a reinforcer more or less effective.
Satiation
- Repeated use of a reinforcer reduces its effectiveness.
- Example: Eating a big meal reduces the reinforcing value of more food.
Deprivation
- Lack of access to a reinforcer increases its effectiveness.
- Example: Hunger increases the reinforcing value of food.
Schedules of Reinforcement
- Rule specifying which occurrences of a behavior will be reinforced.
- Continuum from no reinforcement (extinction) to reinforcing every response (continuous reinforcement).
- Partial or intermittent reinforcement lies in between the two extremes.
Continuous Reinforcement
- Used when teaching new skills.
- Each correct response is reinforced.
Intermittent Reinforcement
- Some, but not all, responses are reinforced.
- Used as the response becomes more regular.
- Example: Reinforcing every correct response initially, then reinforcing after a few correct responses in a row.
Thinning Schedules
- Gradually reinforcing responses less often.
- Important because:
- Reinforcement in the natural environment is less frequent.
- Intermittent schedules are more resistant to extinction.
Types of Intermittent Schedules
Ratio Schedules
- Response-based: Reinforcer is delivered after a specific number of responses.
- Example: Tom gets paid $120 for every 6 pages of translation.
Fixed Ratio (FR)
- Reinforcer occurs after a fixed number of responses.
- Example: Tom’s typing is on an FR6 schedule (paid after every 6 pages).
Variable Ratio (VR)
- Reinforcer occurs after a variable number of responses (average).
- Example: Jane's car starts after an average of 3 key turns (VR3).
Interval Schedules
- Time-based: Require a time period to pass between reinforcers.
- Reinforcer delivered for the first response after the interval ends.
- Example: Teaching a child to string beads. At a fixed interval of 20 seconds, the first bead strung is reinforced.
Fixed Interval (FI)
- Reinforcer delivered for the first response after a fixed time interval.
- Example: Bead stringing on an FI 20-second schedule.
Variable Interval (VI)
- Reinforcer delivered for the first response after a variable time interval (average).
- Example: Teacher gives reward sticker after an average of 4 minutes (VI 4 minutes).
Schedule Thinning (Detailed)
- Gradually moving from continuous to intermittent reinforcement.
Thinning Ratio Schedules
- Gradually increase the number of responses before reinforcement.
- Increase the response ratio in small increments.
- Too quick schedule thinning will result to the learner giving up.
- Example: Moving from FR1 to VR3 for saying “hi.”
Thinning Interval Schedules
- Gradually increase the time interval before reinforcement.
- Increase the time interval in small increments.
- Too quick schedule thinning will result to the learner giving up.
- Example: Moving from FI 1 minute to VI 2 minutes for toy play.
Review
- Reinforcement strengthens behavior.
- Positive and negative reinforcement.
- Conditioned and unconditioned reinforcers.
- Factors influencing reinforcement effectiveness.
- Schedules of reinforcement should be thinned gradually.