Chapter 5 Notes: Learning – Comprehensive Study Guide (English)

5-1 Learning, Experience, and Change

Learning is a key concept in psychology, defined differently by major theories:
- Behaviorists: learning is a relatively permanent change in behavior that arises from practice or experience. They emphasize observable behavior and downplay cognition/choice.
- Cognitive psychologists: learning is the process by which organisms change the way they represent the environment because of experience. These changes influence behavior but do not fully determine it; learning is a mental process and may be shown by changes in behavior.
Real-world example from the chapter: Phoebe the dog and a parent learning to fetch via social/observational cues and reinforcement. The narrator learns that attention from Phoebe can be earned by running after the ball, but the dog’s behavior is not simply reflexive; cognition and values influence imitation.
Observational learning: learning from watching others’ behavior and its consequences, not just from direct reinforcement.
Direct vs. observational learning illustrated through family dynamics and dog training.
The chapter also introduces a short “What Do You Think?” activity that asks readers to judge statements about learning (e.g., taste aversion, battlefield/war-time conditioning, media effects) to engage critical thinking about learning concepts.

5-2 Classical Conditioning: Learning What Is Linked to What

Classical conditioning involves learning associations between events and is involuntary/automatic.
Core terms:
- Unconditioned Stimulus (UCS): a stimulus that naturally evokes a response.
- Unconditioned Response (UCR): an unlearned, automatic response to the UCS.
- Conditioned Stimulus (CS): a previously neutral stimulus that, after repeated pairing with the UCS, elicits a response.
- Conditioned Response (CR): a learned response to the CS.
Pavlov’s classic experiments with dogs:
- Initially, meat powder (UCS) provokes salivation (UCR).
- A neutral stimulus (tone/bell) is paired with the UCS. After several pairings, the tone becomes a CS that elicits salivation (CR).
- Pavlov called the salivation to the tone a conditioned reflex (CR); reflexes are unlearned responses that can be conditioned via association.
Cognitive perspective on classical conditioning:
- Classical conditioning reflects learning of relationships among events, enabling mental representations and predictions about the environment.
- The dog’s salivation to the tone is attributed to a mental connection between the tone and meat powder, not a direct awareness of food; organism-evoked information processing matters.
5-2a Explaining Classical Conditioning
- Behaviorists: learning is the result of publicly observable stimulus–response pairings.
- Cognitive theorists: learning involves mental representations and predictions about relationships among events.
5-2b Stimuli and Responses in Classical Conditioning
- In Pavlov’s experiments:
- UCS = meat powder (stimulus that triggers an unlearned reflex)
- UCR = salivation to meat powder
- Neutral stimulus initially elicits no CR (and may elicit an orienting response)
- After conditioning, the CS (tone) elicits the CR (salivation)
- Example mini-experiment: tone precedes meat powder by a brief interval, repeated several times; after conditioning, the tone alone elicits salivation.
- Key figure (Fig. 5.1) shows the transition from CS (neutral) to CS+UCS pairing to CR, illustrating acquisition, extinction, and recovery in a typical classical-conditioning sequence.
5-2a Extinction and Related Processes
- Extinction: the CS loses its ability to elicit the CR when it is presented repeatedly without the UCS.
- Extinction reflects adaptation to changing environments and updating mental representations (cognitive view).
- Pavlov’s data show extinction as a decline in CR across repeated CS-alone trials.
5-2c Extinction and Spontaneous Recovery
- Spontaneous recovery: after extinction, the CR can reappear when the CS is later presented again, suggesting that the learned association remains but is inhibited.
- Extinction does not erase learning; it suppresses the conditioned response. Evolutionary perspective: spontaneous recovery can be adaptive when environmental conditions recur (e.g., water holes that dry up and later fill again).
5-2d Generalization and Discrimination
- Generalization: a CR is elicited by stimuli similar to the CS; functional equivalence can be adaptive.
- Discrimination: learning to respond differently to stimuli that are not predictive of the UCS.
- Pavlov’s dog generalization: salivation to circles, and later to close geometric figures; discrimination challenged with more complex shapes (extinction experiments can influence generalization vs discrimination).
- Spontaneous recovery can occur for extinguished CRs as time passes.
5-2e Higher-Order Conditioning
- A neutral stimulus becomes a CS after being paired with a stimulus that is already a CS (first-order CS).
- Example: light (initially neutral) paired with a tone (first-order CS) eventually elicits a response similar to the tone’s CR (higher-order CS).
- Practical consequence: fear or other responses can spread from a primary trigger to related cues (e.g., stove word or room cues linked to a fear-inducing event).

5-3 Applications of Classical Conditioning

Classical conditioning applications include fear conditioning and fear reduction (counterconditioning) within behavior therapy.
5-3a Taste Aversion
- A single nauseating experience can produce a long-lasting taste aversion, illustrating the strong one-trial learning capability of taste associations.
- Evolutionary perspective: taste aversion has survival value, helping organisms avoid poisonous foods.
- Garcia & Koelling (1966) study: CS was a taste (sweetened water), a light, and a clicker presented together; nausea (UCS) vs electric shock produced different aversions depending on the UCS type:
- Nausea produced aversion to taste only (sweetened water) and not to light or clicker.
- Shock produced aversion to light and clicker but not to taste.
- Implication: organisms are biologically prepared to form particular CS–UCS associations that are ecologically relevant (not all pairings are equally learnable).
5-3b Little Albert: Classical Conditioning of Emotional Responses
- John B. Watson and Rosalie Rayner trained Little Albert to fear a white rat by pairing rat exposure with loud clanging noises; fear generalized to other furry objects (rabbit, coat collar).
- Preparedness: humans/primates are biologically prepared to fear certain stimuli (snakes, thunder, heights, etc.) due to evolutionary pressures.
- Ethical concerns: the famous Little Albert study is widely criticized for ethical reasons, and counterconditioning was not pursued in the original study.
- Preparedness vs conditioning: preparedness suggests certain fears are more easily learned due to evolutionary history.
5-3c Counterconditioning
- Counterconditioning aims to replace an undesirable conditioned response with a compatible, more desirable response (e.g., relaxing instead of fear).
- Mary Cover Jones (1924) counterconditioned fear in a child named Peter by gradually exposing him to a feared rabbit while the child consumed cookies, creating a positive association with the feared object.
- Systematic desensitization vs flooding: systematic desensitization gradually increases exposure while maintaining relaxation; flooding exposes the client to the fear stimulus all at once, often leading to rapid extinction but high discomfort.
Preparedness and multiple learning routes
- Evolutionary perspective suggests combining conditioning with conscious cognition provides greater adaptability for survival.

5-4 Operant Conditioning: Learning What Does What to What

Key idea: operant conditioning involves learning to engage in behaviors that produce desirable outcomes, emphasizing consequences rather than mere associations.
Classical conditioning vs operant conditioning:
- Classical conditioning: learning to anticipate events; reflexive/automatic responses.
- Operant conditioning: learning to perform actions based on consequences; voluntary behaviors.
5-4a Edward L. Thorndike and the Law of Effect
- Thorndike’s puzzle boxes with cats demonstrated that responses followed by satisfying consequences become more likely in that situation (reinforcement), while responses followed by unpleasant outcomes are less likely (punishment).
- Law of Effect: pleasant outcomes stamp in responses; unpleasant outcomes stamp out responses.
5-4b B. F. Skinner and Reinforcement
- Skinner extended operant conditioning using the Skinner box (operant chamber) to study how reinforcement shapes behavior.
- Projects such as Project Pigeon explored using reinforced pigeons in guided missiles, though the project did not come to fruition.
- Distinction: operant conditioning focuses on how organisms operate on their environment to produce consequences; behavior is influenced by reinforcement rather than by matching to a preexisting response.
5-4c Methods of Operant Conditioning
- Emphasis on discrete behaviors as units of analysis (e.g., lever pressing).
- Behavior is shaped by reinforcement to achieve target behaviors; knowledge of results can serve as reinforcement.
- Direct guidance (prompting) can help shape initial responses, especially in humans.
- Reinforcement rather than punishment should be used to promote desired behaviors.
5-4d Types of Reinforcers
- Primary reinforcers: biologically based (food, water, warmth, sex) that satisfy biological needs.
- Secondary (conditioned) reinforcers: acquire value through association with primary reinforcers (e.g., money).
- Positive reinforcers: increase the probability of a behavior by presenting a pleasing stimulus.
- Negative reinforcers: increase probability of a behavior by removing an aversive stimulus.
- Immediate reinforcers are typically more effective than delayed reinforcers.
- Discrimination between reinforcers arises when signals indicate when reinforcement is available (discriminative stimuli).
5-4e Extinction and Spontaneous Recovery in Operant Conditioning
- Extinction occurs when reinforcement stops; the previously reinforced behavior gradually diminishes.
- Spontaneous recovery can occur after a period without reinforcement, with the behavior temporarily reappearing.
- Self-reinforcement strategies enable individuals to reinforce desired behavior in the absence of external reinforcement.
5-4f Reinforcers vs Rewards and Punishments
- Reinforcers are defined by their effects (increase the frequency of behavior).
- Rewards/punishments refer to the subjective experience and may not accurately reflect the functional impact on behavior.
- Punishments can rapidly suppress undesirable behavior but may have adverse side effects; negative punishment examples include time-outs and removal of privileges.
5-4g Schedules of Reinforcement
- Continuous reinforcement: every correct response is reinforced.
- Partial (intermittent) reinforcement is more resistant to extinction and includes four basic schedules:
- Fixed-Interval (FI): reinforcement after a fixed amount of time; pattern shows scallops with bursts of responding as reinforcement time approaches.
- Variable-Interval (VI): time between reinforcements varies; results in steady, moderate response rate.
- Fixed-Ratio (FR): reinforcement after a fixed number of correct responses; high, steady response with a post-reinforcement pause.
- Variable-Ratio (VR): reinforcement after a variable number of responses; high, steady response with no predictable pause (common in gambling).
5-4h Discriminative Stimuli
- Stimuli that signal when reinforcement is available (e.g., a green light signaling that a pigeon’s peck will be reinforced).
- Discriminative stimuli help organisms learn which behaviors will be reinforced and which will not.
5-4i Applications and Real-World Relevance
- Partial reinforcement schedules and the persistence of gambling behaviors (slot machines).
- Real-life examples include work incentives, classroom incentives, and social behavior modulation.

5-5 Applications of Operant Conditioning

5-5a Biofeedback Training (BFT)
- Based on operant conditioning, BFT uses feedback to reinforce autonomic responses (e.g., alpha brain waves, muscle relaxation, lowered heart rate, reduced blood pressure).
5-5b Shaping
- Shaping reinforces successive approximations toward a target behavior (e.g., teaching complex skills like dancing or driving a standard-shift car by rewarding closer and closer steps).
5-5c Behavior Modification
- Uses reinforcement principles to increase desirable behavior and extinguish undesirable behavior, including strategies like time-outs and ignoring misbehavior when possible.
5-5d Programmed Learning
- B. F. Skinner’s programmed learning breaks complex tasks into small steps, reinforcing correct responses and providing immediate feedback; designed to minimize errors and enable self-paced learning.
- Controversy/Note: the notion that you must make mistakes to learn is challenged by programmed learning, which emphasizes error-minimization and structured progression.

5-6 Cognitive Factors in Learning

Classical and operant conditioning can be understood through cognitive lenses as well; human learning involves mental representations, schemas, and information processing.
5-6a Latent Learning: Forming Cognitive Maps
- E. C. Tolman showed that rats can learn maze layouts even without reinforcement and form cognitive maps; when a reward is introduced, they reach the goal quickly, indicating prior learning without explicit reinforcement.
- Tolman distinguished between learning (knowledge acquisition) and performance (expressed behavior).
5-6b Contingency Theory (Rescorla)
- Learning occurs when the CS provides information about the likelihood of the UCS; not just when they are paired in time.
- Experimental variations by Rescorla showed that a CS followed by UCS consistently yields learning, while random or unpaired pairings do not produce the same predictive learning.
- Key concept: the predictive value of the CS is critical for learning.
5-6c Observational Learning
- Albert Bandura and colleagues demonstrated that people can acquire new behaviors by observing others (models) and that witnessing models being reinforced increases the likelihood of imitation (vicarious reinforcement).
- Observational learning includes attention, retention, reproduction, and motivation; not all observed behaviors are imitated unless motivated.
- Model organisms and concepts like mirror neurons suggest an automatic mechanism for imitation and social learning.
5-6d Violence in the Media and Aggression
- The debate on whether media violence causes real-world aggression has persisted for decades.
- Research suggests media violence is a risk factor for increased emotional arousal, aggressive thoughts, and aggression, though not all viewers become aggressive.
- Mechanisms include: observational learning (models of aggression), disinhibition (learning that aggression is allowed or rewarded), increased emotional arousal, priming of aggressive thoughts, habituation (reduced sensitivity to real violence), and provision of aggressive scripts.
- Violent video games, in particular, involve viewer participation and may strengthen aggressive tendencies more than passive media.
- Quantitative findings cited include meta-analytic reviews (e.g., Anderson et al., 2017) and work on gun violence in films (Bushman et al., 2013) and child/adolescent aggression patterns.
- The literature highlights moderating variables such as age, gender, achievement, parental style, and family environment in the relationship between media violence and aggression.
- Practical guidance for parents/educators includes teaching kids that media violence is not real, that most people resolve conflicts nonviolently, and that real-life consequences of violence are harmful.
Mirror Neurons
- Mirror neurons fire both when an action is performed and when it is observed, supporting imitation and the understanding of others’ intentions; linked to early imitation in infants and contagious yawns/laughter.

Chapter Takeaways and Key Terms (Highlights)

Learning: relatively permanent change in behavior arising from experience (behaviorist view) or mental representations shaped by experience (cognitive view).
Classical Conditioning (Pavlov): associating a neutral stimulus (CS) with an unconditioned stimulus (UCS) to elicit a conditioned response (CR).
Unconditioned Stimulus/Response (UCS/UCR) vs Conditioned Stimulus/Response (CS/CR).
Extinction, Spontaneous Recovery, Generalization, Discrimination, Higher-Order Conditioning.
Taste Aversion: one-trial learning with evolutionary significance; can involve long delays between CS and UCS; ecological plausibility of prepared associations.
Little Albert and Preparedness: emotional conditioning; preparedness to fear certain stimuli; ethical issues in historical studies.
Counterconditioning, Flooding, Systematic Desensitization: fear reduction strategies.
Operant Conditioning (Skinner): learning through consequences; reinforcement vs punishment; primary vs secondary reinforcers; positive vs negative reinforcement.
Schedules of Reinforcement: fixed/variable interval/ratio; implications for persistence and behavior patterns; sheer unpredictability (variable) often sustains behavior (e.g., gambling).
Observational Learning and Bandura: learning via models; vicarious reinforcement; the role of attention, retention, and motivation; mirror neurons and social imitation.
Media Violence and Aggression: observed associations between media exposure and aggression; multiple mechanisms and moderating factors; evidence supports a link as a risk factor.

Quick Reference Terms (in-brief)

UCS, UCR, CS, CR: core classical conditioning terms.
Extinction: CS no longer elicits CR without UCS.
Spontaneous recovery: reappearance of a weakened CR after time.
Generalization/Discrimination: responding to similar vs. dissimilar stimuli;
Latent learning: learning that is hidden until there is motivation to demonstrate it.
Contingency theory: CS predicts UCS; learning depends on informative cues.
Observational learning: learning by watching others; vicarious reinforcement.
Mirror neurons: neurons that link perception and action to support imitation.
Primary/Secondary (Conditioned) Reinforcers: biologically based vs learned reinforcers.
Positive/Negative Reinforcers: frequency-increasing effects via presentation/removal of stimuli.
Schedules: FI, VI, FR, VR; implications for persistence and rates of responding.
Biofeedback, Shaping, Behavior Modification, Programmed Learning: applied learning techniques using reinforcement principles.

Chapter Review Practice prompts (concept checks)

Define learning from both behaviorist and cognitive perspectives.
Explain how Pavlov’s experiment demonstrates acquisition, extinction, and spontaneous recovery.
Distinguish between UCS/UCR and CS/CR, and provide everyday examples.
Describe taste aversion and its evolutionary significance.
Compare and contrast flooding vs systematic desensitization.
Explain Thorndike’s Law of Effect and how it differs from Skinner’s emphasis on reinforcement.
Differentiate primary vs secondary reinforcers and positive vs negative reinforcement.
Summarize four basic schedules of reinforcement and real-life examples.
Outline Bandura’s observational learning theory and the role of modeling and mirror neurons.
Discuss the evidence for and against media violence contributing to real-world aggression and list potential moderating factors.