Classical and Operant Conditioning
Learning Definition
Learning: A relatively permanent change in behavior brought about by experience or practice.
- Relatively permanent: You can unlearn something as well as learn it.
Classical Conditioning
Scenario:
- Instructor turns off the light before lecturing.
- Students associate the light going out with an unpleasant lecture.
- Student groans when girlfriend turns off the light outside of class.
Pavlov's Discovery (Early 1900s):
- Ivan Pavlov (1849-1936) - Physiologist measuring salivation in dogs.
- Dogs salivated before food appeared, associating footsteps with food.
- Example: Dog/cat gets excited when you open a cupboard or a can of food.
Classical Conditioning Structure
- Unconditioned Stimulus (UCS): Naturally occurring event.
- Unconditioned Response (UCR): Naturally occurring response to the UCS.
- Neutral Stimulus (NS): Does not initially elicit a response.
- Conditioned Stimulus (CS): The neutral stimulus after pairing with the UCS.
- Conditioned Response (CR): The response elicited by the conditioned stimulus (same as UCR).
- Key elements:
- Reflexes
- Associations/Pairings (e.g., bell with food)
- Contiguity (things next to each other)
Taste Aversion
- Example: Eat something, get sick, then feel nauseous even smelling/thinking about that food.
Pavlov's Experiment
- UCS: Food
- UCR: Salivation
- NS: Bell
- Pairing: Bell with food
- CS: Bell
- CR: Salivation
Watson and Rayner - Little Albert Experiment
John Watson and assistant Rosalie Rayner conditioned fear into a 9-month-old baby (Little Albert).
Unethical experiment (causing fear).
Watson and Rayner had an affair and he was booted out of academia. He then did ads in New York using classical and operant conditioning.
Procedure:
- Exposed Albert to furry animals (rat).
- Banged a hammer on a pipe when presenting the rat.
- Albert showed fear.
Analysis:
- UCS: Clanging sound
- UCR: Fear
- NS: Furry rat
- Pairing: Clanging with the rat
- CS: Rat
- CR: Fear
Generalization and Discrimination
- Generalization: Little Albert feared all white and furry things.
- Discrimination: Albert could tell the difference between a rat and a Santa Claus mask.
- Experiment deemed not credible because:
- Ethics: unethical to cause a child fear.
- Data: Watson has no real observable data that he collected.
- IRB (Institutional Review Board) ensures ethical experiments.
Acquisition
How behaviors are acquired.
Types of Acquisition Trials:
- Delay: Present the neutral stimulus, then the unconditioned stimulus (best approach).
- Simultaneous: Present food and bell at the same time.
- Backward: Ring the bell, then give food (doesn't work well).
Extinction and Spontaneous Recovery
- Extinction Trials: Ring the bell many times without food; the response diminishes.
- Spontaneous Recovery: The behavior reappears after extinction (bell rings, child gets fearful).
- Pavlov believed fear fades but doesn't disappear because there is a neural network in the brain.
Higher Order Conditioning
- Also called secondary conditioning.
- Adding another stimulus or a second conditioned stimulus.
- Example: Child watches TV, garage door opens, parents come home.
- TV show = excitement, garage door = excitement.
Limitations of Classical Conditioning
Doesn't always work; free will.
Instinctive Drift (Brelands): Instincts interfere with learned behaviors.
- Example: Raccoon trained to put a coin in a piggy bank Commercial, but instinct to dip their food in the water took over for a bank commercial.
Classical Conditioning Recap
- Identify UCS, UCR, NS, CS, CR in the high school classroom scenario.
- Any stimulus an organism can perceive is capable of eliciting any reaction the organism is capable of making.
- Classical conditioning is a reflex.
Operant Conditioning
- You act on your environment and get feedback, which determines your behavior.
Instrumental Conditioning (Edward Thorndike)
Forerunner to operant conditioning.
Probability of a response based on its consequences.
Random trial and error.
Law of Effect:
- Positive result = stamped in.
- Punishment = stamped out.
- How your brain is wiring itself.
Cats in puzzle boxes
- The cat was in the box, and it wanted to get the food. So, it had to push something, anything to make that happen.
- Trial and error, meowing a lot, pushing this or that, scratching on the side.
BF Skinner
- Everything we do is determined by our history of rewards and punishments.
- Operant conditioning: I operate or act on my environment and get feedback, which determines your future behavior.
- Skinner called it the science of behavior - behaviour is predictable.
- Walden Two (utopian community based on behavioral principles) and Beyond Freedom Dignity (theories) talk about no autonomous man, no free will.
Skinner Box (Operant Box)
- Rats or pigeons could push on things.
- Aversive types of stuff in it (shocks)
Skinner's Baby Tender
- Temperature and humidity regulated.
- Crank to change the sheets.
- Skinner raised his own child in it.
- Rumors that his child had psychological issues are urban legends.
Basics of Operant Conditioning
- Discriminative Stimulus: Signals that reinforcement is available (e.g., green light).
- Primary Reinforcers: Unlearned reinforcers needed for survival (food, water).
- Secondary Reinforcers: Conditioned reinforcers, learned (money).
- Accidental Reinforcement: Superstitions (lucky tie).
Types of Reinforcement
Enhance likelihood of repeating behavior:
- Positive Reinforcement
- Negative Reinforcement
Cause cessation of behavior:
- Punishment
Positive Reinforcement
- Getting a star, reward, trophy, good job.
Negative Reinforcement
- NOT punishment.
- Avoid/escape aversive event (unpleasant).
- Example: Fire alarm goes off, you leave the room.
- Example: Getting rid of someone who is annoying by actively ignoring them.
Punishment
- Punishment by Application: Something unpleasant happens (parent yells).
- Punishment by Removal: Taking away something liked (ball).
Examples
Positive Reinforcement: Works well with training animals.
Negative Reinforcement: Getting rid of something unpleasant.
- Wiping hands with towel
- Turning on windshield wipers
- Pushing 'stop' on alarm clock
- Spraying cat with water
Punishment by Removal: No more TV.
Punishment by Application: Soap in mouth.
Schedules of Reinforcement
Continuous: Reinforce every behavior (extinguishes quickly).
Ratio: Number of behaviors.
- Fixed Ratio: Set number of behaviors (every quiz counts double).
- Variable Ratio: Never know when you'll be reinforced (slot machines).
Interval: Time.
- Fixed Interval: Fixed set of time (paid every two weeks).
- Variable Interval: Never know when it's going to happen (fishing).
Shaping
- Successive approximations.
- Steps to get a person or animal to perform a total behavior.
- Shaping the behavior of swimming: You're gonna reinforce you when you breathe right. I'm gonna reinforce you when you do your arms right. I'm gonna reinforce you when you do your legs right.
- Once pigeon turns a little, it's reinforced. Turns a little more. Turns a little more
- Call this chaining.
Premack Principle
- Do something unpleasant to get a positive reinforcer.
- Example: Mow the lawn, and you get to use a car.
- Pigeons reading
- The task was to isolate an individual piece of behavior and see how that could be changed.
Social Media Operant Conditioning
- Advertisers spent an estimated 31,000,000,000 (31 Billion) on social media in 2016.
- Cell phones are slot machines operating on a variable schedule reward.
- Snapstreaks: Affecting kids' feelings of obligation to each other because they feel they have to respond and it maximises amount of time spent.