Operant Conditioning – B.F. Skinner
Introduction to Operant Conditioning
Definition: Operant Conditioning is a learning process where behavior is influenced by its consequences, formulated by B.F. Skinner who built upon Thorndike’s Law of Effect.
Core Belief: Skinner posited that only observable behavior matters; internal thoughts and motivations do not explain behavior effectively.
Key Aspects: Animals (and humans) learn to connect their actions with outcomes—either reinforcement or punishment.
Basic Concepts no
Reinforcement: Increases the likelihood of a behavior being repeated.
Positive Reinforcement: Introducing a pleasant stimulus following a desired behavior (e.g., giving a treat for completing homework).
Negative Reinforcement: Removing an unpleasant stimulus to encourage a behavior (e.g., stopping loud noise when a button is pressed).
Punishment: Decreases the probability of a behavior being repeated.
Positive Punishment: Adding an unpleasant outcome after an undesired behavior (e.g., scolding a child).
Negative Punishment: Taking away a pleasant stimulus in response to an undesired behavior (e.g., confiscating a toy).
Skinner’s Experiments
Skinner Box: An operant conditioning chamber used for experiments.
Example 1: A rat pressing a lever to receive food (demonstrating positive reinforcement).
Example 2: A pigeon pecking a disc to evade an electric shock (demonstrating negative reinforcement).
Conclusion: These experiments confirmed that behavior can be learned or altered through reinforcement and punishment.
Shaping and Chaining
Shaping: Involves reinforcing successive approximations toward a desired behavior.
Example: Rewarding a dog for progressively moving toward the action of rolling over.
Chaining: Involves teaching a sequence of behaviors where each action cues the next; beneficial for teaching complex skills.
Reinforcement Schedules
Skinner's research revealed how timing and frequency of reinforcement impact behavior learning:
Fixed Ratio (FR): Reinforcement is provided after a set number of responses (e.g., reward after every 5 actions).
Variable Ratio (VR): Reinforcement occurs after a varying number of responses (e.g., gambling on slot machines).
Fixed Interval (FI): Reinforcement is given after a specified time period (e.g., weekly quizzes).
Variable Interval (VI): Reinforcement follows unpredictable time intervals (e.g., random check-ins by a teacher).
The Variable Ratio schedule is considered the most effective as it creates a strong resistance to extinction.
Important Terms
Reinforcer: Any event that enhances the likelihood of a behavior occurring again.
Punisher: Any event that diminishes the likelihood of a behavior occurring again.
Extinction: The process through which a behavior ceases because of the absence of reinforcement.
Discriminative Stimulus: A cue indicating that a specific response will be reinforced.
Applications of Operant Conditioning
Education: Rewarding students for good behavior or correct answers fosters a positive learning environment.
Parenting: Tools like praise, time-outs, and reward charts help shape children’s behavior.
Clinical Therapy: Techniques used in behavior modification can help address phobias and addictions.
Workplace: Incentive programs and performance bonuses encourage desired employee behaviors.