Psychology Notes: Learning — Classical Conditioning, Operant Conditioning, and Cognitive Approaches
Classical Conditioning
What is learning? Learning is the process by which associations between stimuli and responses are formed. Classical conditioning is a key type of learning that explains how a neutral stimulus can come to elicit a response after being paired with a stimulus that naturally brings about that response.
Habituation (brief note): A form of learning where there is a relatively permanent change in behavior due to experience. It is sometimes difficult to separate learned effects from innate (nature) versus experience (nurture). Infants show habituation: a decrease in response to a repeated stimulus. Most learning is more complex.
The Basics of Classical Conditioning
Ivan Pavlov’s classic experiments with dogs showed that a neutral stimulus (bell) could come to elicit salivation after being paired with an unconditioned stimulus (food) that naturally causes salivation.
Key terms:
Neutral stimulus (NS): a stimulus that does not naturally bring about the response of interest. Example: bell before conditioning.
Unconditioned stimulus (UCS): naturally brings about a response without learning (e.g., food).
Unconditioned response (UCR): natural response to the UCS (e.g., salivation to food).
Conditioned stimulus (CS): a previously neutral stimulus that, after pairing with the UCS, comes to induce a response.
Conditioned response (CR): the learned response to the CS.
Basic Process (Illustrated by the bell–meat example)
Before conditioning: bell (NS) does not produce salivation; meat (UCS) naturally produces salivation (UCR).
During conditioning: the bell (CS) is presented just before the meat (UCS).
After conditioning: the bell alone (CS) produces salivation (CR).
Summary:
Conditioned = learned; Unconditioned = not learned.
A CS leads to a CR after conditioning; an UCS leads to a UCR naturally.
An UCR and a CR are similar in appearance, but the UCR occurs naturally without training.
Applications and Implications
Watson and Rayner’s Little Albert study (1920) demonstrated conditioned fear using ethically questionable procedures; outcomes could not be replicated today. This work helped explain how:
Phobias can develop through conditioning.
PTSD may involve conditioned responses to stimuli.
Positive experiences can be conditioned (e.g., a song triggering memories).
Conditioning helps explain why drug addiction is difficult to treat (cue-conditioned cravings).
Extinction and Spontaneous Recovery
Extinction: a previously conditioned response decreases in frequency and can disappear when the CS is presented without the UCS repeatedly.
Spontaneous recovery: after a rest period, an extinguished conditioned response can reappear when the CS is presented again, though extinction tends to reoccur quickly.
Acquisition, Extinction, and Spontaneous Recovery (Figure 2 concept)
Acquisition: gradual strengthening of the CR during conditioning.
Extinction: CS alone leads to weakening of the CR.
Spontaneous recovery: after a break, CR may reappear, though it often diminishes again with further extinction.
Generalization and Discrimination
Stimulus generalization: after a response has been conditioned to a stimulus, similar stimuli evoke the same response. The greater the similarity, the greater the generalization.
Stimulus discrimination: when two stimuli are sufficiently different, only one evokes the CR; the organism learns to discriminate between them.
Beyond Traditional Conditioning: Preparedness and Taste Aversion
John Garcia showed that some organisms are biologically prepared to quickly learn to avoid foods that smelled or tasted like something that made them sick (learned taste aversion).
Characteristics of taste aversion:
Conditioning can occur with long delays between the conditioned stimulus and the outcome (e.g., up to 8 hours).
Conditioning can occur after just one pairing (one trial) and persist for long periods.
Operant Conditioning
Operant conditioning focuses on voluntary behaviors strengthened or weakened by consequences. The organism operates on the environment to obtain a desirable outcome, unlike the involuntary responses in classical conditioning.
Thorndike’s Law of Effect
Responses that lead to satisfying consequences are more likely to be repeated.
Over time, an organism forms a direct connection between the stimulus and the response through experience, often without conscious awareness.
Thorndike’s and Skinner’s Foundations
Puzzle boxes for cats demonstrated trial-and-error learning leading to increased probability of exiting the box when rewarded.
B. F. Skinner advanced operant conditioning with the Skinner box, where controlled environments allow the study of how organisms learn to press a lever to obtain reinforcement.
Reinforcement and Reinforcers
Reinforcement: any stimulus that increases the probability that a preceding behavior will be repeated.
Reinforcer: the stimulus that increases the likelihood of the behavior again.
Primary reinforcers: satisfy biological needs (e.g., food).
Secondary reinforcers: gain reinforcing power through association with a primary reinforcer (e.g., money).
Positive Reinforcers, Negative Reinforcers, and Punishment
Positive reinforcer: adds a stimulus to increase a behavior. Example: raise for good performance.
Negative reinforcer: removes an aversive stimulus to increase a behavior. Example: applying ointment to relieve itch increases likelihood of using it again.
Punishment: a stimulus that decreases the probability of a prior behavior.
Types of Punishment
Positive punishment: adds an unpleasant stimulus to decrease a behavior (e.g., shouting at a teenager who steals).
Negative punishment: removes a pleasant stimulus to decrease a behavior (e.g., restricting access to the car).
Reinforcement and Punishment Table (conceptual)
Increase in behavior (reinforcement)
Positive reinforcement: Example: Giving a raise for good performance → Result: Increase in good performance.
Negative reinforcement: Example: Applying ointment for itch → Result: Increase in use of ointment.
Decrease in behavior (punishment)
Positive punishment: Example: Yelling for stealing → Result: Decrease in stealing.
Negative punishment: Example: Grounding or removing privileges → Result: Decrease in breaking curfew.
Pros and Cons of Punishment
Pros: Can be appropriate for dangerous behaviors; temporary suppression can allow time to reinforce more desirable behaviors.
Cons: Often ineffective long-term; may lead to replacement with unwanted behaviors; can cause harm (fear, lowered self-esteem); does not teach replacement behaviors.
Schedules of Reinforcement: Timing Life’s Rewards
Continuous reinforcement: reinforcement after every instance of the behavior.
Partial (intermittent) reinforcement: reinforcement only some of the time; more resistant to extinction.
Fixed-ratio (FR): reinforcement after a specific number of responses.
Variable-ratio (VR): reinforcement after an average number of responses, unpredictable.
Fixed-interval (FI): reinforcement after a fixed amount of time since the last reinforcement.
Variable-interval (VI): time between reinforcements varies around an average.
Key idea: the schedule type (not just frequency) shapes response patterns and extinction rates.
Typical Outcomes by Schedule (Figure 4 concept)
FR: high, predictable rate of responding; more responses yield more reinforcement.
VR: high, steady rate of responding with high resistance to extinction.
FI: lower overall rate with a post-reinforcement pause; responding accelerates as the interval ends.
VI: steady, moderate to high rate of responding with little post-reinforcement pause.
Discrimination and Generalization in Operant Conditioning
Stimulus control training: behavior is reinforced in the presence of a specific discriminative stimulus (S^D) but not in its absence.
A discriminative stimulus signals the likelihood that reinforcement will follow a response.
Shaping: Reinforcing What Doesn’t Come Naturally
Shaping is teaching a complex behavior by rewarding closer and closer approximations of the target behavior.
Process:
Start by reinforcing any behavior similar to the target.
Then reinforce only closer approximations.
Finally reinforce only the desired response.
Shaping enables learning of complex behaviors that would not occur naturally, including those in lower animals.
Biological Constraints on Learning
Not all behaviors can be taught equally across species due to built-in limitations.
Learning is influenced by evolution and adaptive benefits that promote survival.
Predispositions: organisms may be genetically inclined to fear certain stimuli depending on their environment.
Classical vs Operant Conditioning (Overview)
Classical Conditioning
Basic principle: building associations between a conditioned stimulus (CS) and a conditioned response (CR).
Nature of behavior: involuntary, elicited by stimuli.
Order of events: UCS → UCR; CS → CR after conditioning.
Example: A child who develops fear after painful injections (UCS) when seeing the physician (CS).
Operant Conditioning
Basic principle: reinforcement increases, punishment decreases the frequency of the preceding behavior.
Nature of behavior: voluntary and operant, actions taken to obtain consequences.
Example: Studying hard and earning an A increases future studying.
Behavior Modification
A systematic technique to promote desirable behaviors and reduce unwanted ones.
Typical steps:
Identify goals and target behaviors.
Design a data-recording system and collect baseline data.
Select a behavior-change strategy.
Implement the program.
Maintain careful records after implementation.
Evaluate and adjust the program as needed.
Cognitive Approaches to Learning
Cognitive approaches study the thought processes that underlie learning, not just responses to stimuli.
Core idea: people form expectations that a reinforcer will follow a response, shaping learning beyond direct stimulus–response associations.
Cognitive Learning Theory
Emphasizes anticipation of consequences; learning involves mental representations and expectations, not just behavior.
Latent Learning
Learning that is not immediately demonstrated; a new behavior may become visible only when there is incentive to perform it.
Tolman and Honzik (1930) demonstrated cognitive maps in rats: rats developed a mental representation of the maze even without immediate reward.
Experimental insight: latent learning can occur without reinforcement; performance may reveal learning later when rewarded.
Latent Learning (Figure 1 concept)
Rats roam a maze with different reward schedules.
Rewarded groups show immediate improvement; experimental groups rewarded later show rapid improvement once rewarded, indicating latent learning.
Observational Learning: Learning Through Imitation
Albert Bandura emphasized learning by observing others (models), especially when direct trial-and-error learning is inappropriate.
Classic Bobo doll experiment showed children imitate aggressive behavior after observing models.
Social cognitive view: learning is influenced by observation, imitation, and social context.
Mirror neurons: neural basis for imitation and social learning.
Violence in Media: Do Media Messages Matter?
Most psychologists agree that high levels of media violence can increase aggression, though some studies show mixed results.
The American Academy of Pediatrics recommends limiting children’s exposure to violent media.
Mechanisms contributing to aggression:
Lowering inhibitions against aggression.
Making viewers more likely to interpret others’ actions as aggressive.
Desensitization to violence.
Real-life exposure to violence also correlates with increased aggression.
Cultural Influences on Learning
People develop learning styles influenced by cultural background and abilities.
Examples of learning styles:
Rational learners may master material by understanding the big picture.
Analytical learners may perform best when first analyzing component parts.
Summary Connections
Classical conditioning explains how associations form between stimuli and involuntary responses.
Operant conditioning explains how behaviors are shaped by consequences and reinforced over time.
Cognitive approaches emphasize internal mental processes, expectations, and social context in learning.
Real-world relevance: education, behavior modification, therapy, media effects, and cultural considerations all intersect with learning theories.
Key Formulas and Concepts (LaTeX)
Conditioning relationships:
ext{Neutral Stimulus (NS)}
ightarrow ext{UCS}
ightarrow ext{UCR}After conditioning: ext{CS}
ightarrow ext{CR}
Conditioning terminology in concise form:
ext{CS}
ightarrow ext{CR} ext{ (learned)}ext{UCS}
ightarrow ext{UCR} ext{ (unlearned)}
Delayed conditioning example (bell and meat): ext{CS (bell)}
ightarrow ext{UCS (meat)}
ightarrow ext{UCR (salivation)}Latent learning and cognitive maps (conceptual): no explicit equation, but involve internal representations of space and expectations.
8 hours interval example for taste aversion: ext{CS (tainted food)}
ightarrow 8 ext{ hours}
ightarrow ext{Sickness}
ightarrow ext{taste aversion}
Connections, Implications, and Real-World Relevance
Ethical implications: historical studies like Little Albert highlight the need for ethical standards in research with humans and animals.
Practical implications: understanding conditioning informs education, therapy (e.g., exposure therapy for phobias), behavior modification programs, and addiction treatment strategies.
The interplay of biology and learning: biological constraints and preparedness influence what kinds of associations are readily learned, guiding expectations about which teaching methods may be effective for different species or individuals.
Media literacy and cultural sensitivity: learning is shaped by media exposure and cultural backgrounds, suggesting tailored approaches in education and intervention.