Learning and Conditioning
Classical Conditioning
Definition: A learning process in which a previously neutral stimulus becomes associated with another stimulus through repeated pairing with that stimulus; also known as Pavlovian conditioning.
Key terms:
\text{UCS} (Unconditioned Stimulus): a stimulus that, without conditioning, will elicit a predictable response.
\text{UCR} (Unconditioned Response): a response that, without conditioning, results predictably from the UCS.
\text{CS} (Conditioned Stimulus): a stimulus that will elicit a predictable response because of its previous pairing with a UCS.
\text{CR} (Conditioned Response): a predictable response to a CS that has been learned through its pairing with a UCS.
Acquisition: the initial stage in classical conditioning; the phase during which a neutral stimulus becomes associated with a UCS and begins to elicit the CR.
Extinction: presenting the CS without following it by the UCS; extinction is not forgetting but the learning that the CS no longer predicts the UCS.
Spontaneous Recovery: after extinction, the CR can reappear when the CS is presented again following a rest period; this suggests the extinction association can be forgotten but not erased.
Generalization: a new CS similar to the original CS will elicit the CR to a degree proportional to similarity (e.g., similarities in pitch, color, shape).
Discrimination: the learned ability to distinguish between a CS and other stimuli that do not signal a UCS; exposure to different CS–UCS pairings tunes the learner to respond only to the reinforced CS.
Non-associative learning:
Habituation: reduced response probability to a non-changing, inconsequential stimulus.
Sensitization: increased response probability after a strong or salient stimulus.
Related concepts:
Associative learning: learning about the relationship between multiple stimuli (two stimuli, or a response and its consequences).
Non-associative learning vs associative learning: non-associative involves a single stimulus; associative involves relationships between stimuli.
Does classical conditioning apply to humans/animals? It does; examples include the classic Little Albert study (ethics later discussed).
Classic examples:
Pavlov’s dogs: neutral stimulus (light or bell) paired with food (UCS) leads to salivation (CR) in response to the CS alone.
Kramer Ambulance and tuna (example on slides): CS = can opener; UCS = tuna; UCR = coming running; CR = coming running to the CS after conditioning.
Little Albert: CS = white rat; UCS = loud sound; CR = fear/freaking out in response to the rat after conditioning; ethical concerns noted.
Mike and the Cocktail Shrimp: CS = cocktail shrimp; UCS = bad shrimp due to bacteria; CR = nauseous reaction; illustrates aversive conditioning.
Conceptual notes:
The conditioning process demonstrates how reflexive responses can be elicited by previously neutral stimuli when paired with biologically relevant stimuli.
CS can be any stimulus that comes to elicit a CR after conditioning (e.g., light, tone, object).
Applications include advertising (conditioning plates with brands), therapy (extinction of fear responses), and education (linking stimuli to positive outcomes).
Ethics and philosophy:
The “Twisted Story of Little Albert” is cited to discuss ethical concerns in early conditioning experiments (informed consent, potential distress, lasting harm).
Behaviorism as a framework emphasizes observable behavior and minimizes inner mental states; this view has evolved and is debated within modern psychology.
Operant Conditioning
Basic idea: operant conditioning involves behavior that operates on the environment to produce consequences; behaviors are strengthened if followed by a reinforcer and weakened if followed by a punishment.
Thorndike’s Law of Effect (historical foundation):
Quote (paraphrased): Of several responses in a given situation, those followed by satisfaction tend to recur; those followed by discomfort tend to be weakened; the greater the satisfaction or discomfort, the stronger the change in the bond between response and situation.
Source: Edward L. Thorndike (1911).
Skinner’s contribution: elaborated Thorndike’s Law of Effect and developed the behavioral technology of operant conditioning, including the Skinner box as a controlled environment for studying reinforcement and punishment.
The Skinner Box (operant chamber): typical components include
Pellet dispenser, lever, food cup, signal lights, dispenser tube, and an electrocution grid as needed for punishment experiments.
Core terms:
Reinforcer: any event that strengthens the behavior it follows.
Punisher: any event that weakens or decreases the likelihood of the behavior.
Shaping: an operant conditioning procedure in which reinforcers guide behavior toward closer and closer approximations of a desired goal (often used in animal training and human education).
Reinforcement and Punishment in detail:
Reinforcement increases the likelihood of the preceding behavior.
Punishment decreases the likelihood of the preceding behavior.
Positive reinforcement: presenting a desirable stimulus to increase a behavior.
Negative reinforcement: removing an aversive stimulus to increase a behavior.
Positive punishment: presenting an aversive stimulus to decrease a behavior.
Negative punishment: removing a desirable stimulus to decrease a behavior.
Notation and quick synthesis:
Positive Reinforcement (PR): add something good to increase a behavior.
Negative Reinforcement (NR): remove something aversive to increase a behavior.
Positive Punishment (PP): add something aversive to decrease a behavior.
Negative Punishment (NP): remove something desirable to decrease a behavior.
Schedule of reinforcement (how often reinforcement/punishment is delivered):
Continuous reinforcement: reinforce the desired response every time it occurs.
Partial (intermittent) reinforcement: reinforce the response only part of the time; leads to slower acquisition but greater resistance to extinction.
Ratio vs Interval schedules:
Ratio schedules depend on the number of responses.
Fixed Ratio (FR): reinforcement after a fixed number of responses; higher response rates with greater output (e.g., piecework pay).
Variable Ratio (VR): reinforcement after an unpredictable number of responses; high and steady response rates; very resistant to extinction (e.g., gambling, fishing).
Interval schedules depend on the time elapsed between reinforcements.
Fixed Interval (FI): reinforcement after a fixed amount of time; responses tend to increase as the time for reward nears (scalloped pattern).
Variable Interval (VI): reinforcement at unpredictable time intervals; produces slow, steady responding (e.g., occasional pop quizzes).
General implications of schedules:
Partial reinforcement tends to produce greater resistance to extinction than continuous reinforcement.
The type of schedule interacts with how quickly a behavior is learned and how robust it is to extinction.
Shaping and real-world applications:
Shaping is widely used in animal training and human behavior modification therapies; repeatedly reinforcing closer approximations to a target behavior.
Conditioned reinforcers:
Primary reinforcers: biologically based rewards (e.g., food, water).
Conditioned (secondary) reinforcers: learned through association with primary reinforcers (e.g., money, good grades, praise).
Conditioned reinforcers show how abstract behaviors can be learned through association with primary rewards.
Generalization and Discrimination in operant contexts:
Generalization: a behavior occurs in similar contexts or in response to similar cues that resemble the trained stimulus.
Discrimination: learning to respond differently to different, but similar, stimuli based on differential reinforcement.
Practical and ethical considerations:
Behavioral modification therapies and education use shaping, reinforcement schedules, and deliberate discrimination to influence behavior.
Historical notes include Skinner’s wartime Project Orcon (pigeon-guided missiles) and broader discussions about the role of behaviorist approaches in society (e.g., educational and social design, “Walden Two” discussions).
Deep Dive into Core Concepts and Examples
Non-associative vs associative learning:
Non-associative learning involves a single stimulus (habituation, sensitization).
Associative learning involves the relationship between stimuli and/or responses and their consequences (classical/operant conditioning).
Connections to foundational principles:
Classic conditioning demonstrates how reflexive responses can be elicited by neutral stimuli when paired with biologically relevant stimuli.
Operant conditioning extends learning to voluntary behaviors that affect the environment and become more or less likely based on consequences.
Real-world relevance and applications:
Advertising uses classical conditioning to pair brand cues (CS) with positive affect or desired outcomes (UCS) to elicit favorable responses (CR).
Education and workplace training use operant conditioning concepts (reinforcement schedules, shaping) to encourage desirable behaviors and discourage undesired ones.
Therapy uses behavioral principles to modify maladaptive behaviors (e.g., shaping new coping responses, using reinforcement schedules to promote desirable habits).
Ethical considerations:
Early conditioning experiments (e.g., Little Albert) raised concerns about informed consent and harm to participants.
The behaviorist emphasis on observable behavior has shaped debates about the role of internal mental states, autonomy, and freedom in shaping human behavior.
Key figures and their contributions:
Ivan Pavlov: classical conditioning; foundational concepts; methodology with digestive secretions in dogs.
John B. Watson: originator of behaviorism; advocated studying observable behavior and minimizing reference to internal mental states.
Edward Thorndike: Law of Effect; laid groundwork for operant conditioning and behavioral psychology.
B.F. Skinner: elaborated operant conditioning; developed the Skinner box; emphasized reinforcement/punishment and applied behavioral analysis.
Notable demonstrations and anecdotes:
Pavlov’s dog paradigm; response to CS after pairing with UCS (food).
Kramer Ambulance and tuna: CS (can opener) leads to CR (come running) after UCS (tuna).
Little Albert: CS (white rat) paired with loud noise (UCS) leading to CR (fear); highlighted ethical concerns.
Mike and cocktail shrimp: CS (cocktail shrimp) paired with aversive UCS (bad shrimp) leading to barfy response (CR).
Advertising case (Meow #3): exercise in applying classical conditioning to marketing strategies; analyze CS, UCS, UCR, CR in real ads.
Equations and concise formulas to remember:
Classical conditioning framework:
UCS → UCR
CS + UCS → CR
After conditioning: CS → CR
Extinction concept:
CS presented without UCS → CR diminishes over time
Spontaneous recovery:
After extinction and a rest period, CS → CR may reappear
Schedules of reinforcement (conceptual):
Continuous reinforcement: reinforcement after every correct response.
Partial reinforcement: reinforcement after some responses; slower acquisition but greater resistance to extinction.
Schedule definitions (summary):
Fixed Ratio (FR): after a fixed number of responses: reinforcement after every N responses.
Variable Ratio (VR): after an unpredictable number of responses: high, steady responding.
Fixed Interval (FI): after a fixed amount of time: response increases as time for reward nears.
Variable Interval (VI): after unpredictable time intervals: slow, steady responding.
Summary takeaways:
Classical conditioning is about linking stimuli to evoke automatic responses; acquisition occurs via repeated pairings; extinction and spontaneous recovery illustrate that learning can decay and reemerge.
Operant conditioning focuses on voluntary behavior shaped by consequences; reinforcement strengthens behavior, punishment weakens it; shaping guides behavior toward desired goals.
Schedules of reinforcement critically shape how quickly learning occurs and how resistant it is to extinction; partial reinforcement often yields more durable behavior than continuous reinforcement.
Real-world applications span education, parenting, advertising, therapy, and organizational behavior, but ethical considerations must govern experiments and interventions.
Quick Reference: Key Terms List
UCS: Unconditioned Stimulus
UCR: Unconditioned Response
CS: Conditioned Stimulus
CR: Conditioned Response
Acquisition, Extinction, Spontaneous Recovery, Generalization, Discrimination (Classical Conditioning)
Reinforcer, Punishment, Shaping (Operant Conditioning)
Positive/Negative Reinforcement, Positive/Negative Punishment
Primary vs Conditioned (Secondary) Reinforcers
FR, VR, FI, VI (Reinforcement Schedules)
Partial Reinforcement, Continuous Reinforcement
Skinner Box, Thorndike’s Puzzle Box
Measurable examples: Pavlovian conditioning, Little Albert, Mike and cocktail shrimp, Kramer Ambulance story, Meow #3 advertisement exercise
Ethical implications: early experiments vs modern standards; behaviorism’s stance on mental processes; real-world implications for manipulation and education