PSYC2081 Notes: Introduction, Pavlov & Skinner
Course context and structure
- PSYC2081: Learning and Physiological Psychology
- Topic: Introduction, Pavlovian (classical) conditioning and Skinnerian (instrumental/operant) conditioning
- Course logistics (from the transcript):
- Assessments: mid-term exam 15\%, weekly activities 5\%, research report 40\%, final exam 40\%
- Read the course outline on Moodle for all required information
- Core text reference: Pearce, J. M. (Animal Learning and Cognition), Chapters 1–4
- Lecturers include Dr. Nathan Holmes and a team of four other lecturers/tutors
- Foundational idea: learning as a process by which organisms adapt to their environments by using the past to predict the future and act in the present
Todays lecture overview
- Learning as adaptation: organisms use past experiences to predict future events and guide present behavior
- Introduction to Pavlovian (classical) conditioning
- Introduction to instrumental (operant) conditioning
- Emphasis on two types of learning:
- How we learn about relationships between stimuli or events in the world
- How we learn about the relationship between our actions and stimuli/events
Two types of learning (foundational distinction)
- Type 1: How we learn about relationships between stimuli/events in the world
- Why we are attracted to some stimuli/objects/events and repelled by others
- Type 2: How we learn about the relationship between our actions and stimuli/objects/events
- How actions influence subsequent stimuli or events
Pavlovian/classical conditioning: core ideas
- Classical conditioning involves learning that a neutral stimulus signals the arrival of a biologically relevant event (the US)
- In Pavlov’s terms:
- Unconditioned Stimulus (US): elicits a natural, unlearned response (UR)
- Unconditioned Response (UR): natural response to the US
- Conditioned Stimulus (CS): originally neutral stimulus that acquires the ability to signal the US
- Conditioned Response (CR): learned response to the CS
- Pavlov’s discovery: dogs salivated not only to food (US) but also to signals associated with food (e.g., bell, attendant) that predicted food
- Anticipatory salivation vs. salivation to food in the mouth: anticipatory salivation is a learned response to cues predicting food, whereas salivation to the food itself is a physiological reflex
- Key principle: the CS must be paired with the US for learning to occur; the dog does not need to salivate to obtain the food (the US occurs irrespective of the CR)
- Crucial generalization: a CS from any sensory modality can become associated with a US (visual, tactile, olfactory, interoceptive cues)
- Temporal relation: CS precedes US (often briefly); the association strengthens as the CS reliably predicts the US
Pavlov’s apparatus and paradigm (historical details)
- Apparatus: fistula implanted in the dog’s mouth to collect salivation; dog’s salivation recorded while food is delivered
- Setup allowed manipulation of stimuli (e.g., a bell or other signal) preceding the food delivery
- Central finding: learning occurs when a neutral cue signals food, leading to salivation in anticipation of the food
- Interpretation: anticipatory salivation reflects a mental state of affairs (the thought or expectation of food), distinct from reflexive saliva to actual food
- Importance: demonstrated that learning transfers control over a physiological reflex from the US to a predictive cue (the CS)
Pavlov’s learning phenomena (classical conditioning dynamics)
- External inhibition: salivation decreases when the CS is paired with a novel stimulus (e.g., light) that diverts attention away from the CS; strong CR when CS alone, weaker when CS is paired with a novel stimulus
- Extinction: gradual reduction and eventual elimination of the CR when the CS is repeatedly presented without the US
- Disinhibition: the extinguished CR reappears when the extinguished CS is presented together with a novel stimulus; attention to the novel stimulus temporarily reduces the inhibition of the CS-CR link
- Spontaneous recovery: after extinction, the CR can reappear when the CS is presented after a delay (days or weeks)
- Summary conceptualization: a CS can become associated with a US, producing a CR; several dynamics (extinction, disinhibition, spontaneous recovery, external inhibition) modulate the strength and expression of the CR
Pavlovian conditioning: cross-modal generality and implications
- Stimuli from all five sensory modalities can serve as CSs
- The CRs can be externally inhibited or extinguished depending on CS context and US availability
- Extinction does not erase the original learning; rather, it involves new learning that inhibits the CR, which can re-emerge under certain conditions (disinhibition or spontaneous recovery)
- Practical takeaway: classical conditioning explains how environmental cues become predictive of important events and how context and novelty can shape responses
Pavlovian conditioning: key statements and quotes from the lecture materials
- “The originally neutral conditioned stimulus (CS), through repeated pairing with the unconditioned stimulus (US), acquires the response originally given to the unconditioned stimulus.”
- “Stimuli from all five sensory modalities can be conditioned.”
- “External inhibition, extinction, disinhibition, and spontaneous recovery” are core phenomena observed in classical conditioning
- “Anticipatory salivation is a response to a mental state of affairs (the thought of food)”
Foundational individuals and studies in classical conditioning
- Watson & Raynor (1920): The case of Little Albert (early human conditioning example)
- Ivan Pavlov (1849–1936): Russian physiologist; Nobel Prize in Medicine for digestion studies; introduced conditioned reflexes; published Conditioned Reflexes (1927)
- Key idea: conditioning shifts control of reflexive responses from the US to CS via association learning
Skinner and instrumental conditioning (operant conditioning)
- Burrhus Frederic Skinner (1904–1990): influential figure; Harvard professor; studied instrumental (operant) conditioning using the “Skinner Box” to automate scheduling of events and responses
- Experimental setup (illustrated in the lecture): a rat in a Skinner box could press a lever to obtain food; the action (lever press) is instrumental in obtaining the reward
- Core distinction from Pavlov: in operant conditioning, the organism’s behavior (e.g., lever pressing) is emitted to obtain a consequence; the response is not elicited automatically by a CS but strengthened or weakened by consequences
- Crucial terminology: reinforcement and punishment govern the likelihood of the preceding behavior
Reinforcement and punishment: basic concepts (Skinner)
- Reinforcer definition: anything that increases the likelihood of the immediately preceding behavior (i.e., strengthens the contingent response)
- Schedules of reinforcement: the relationship between reinforcement and behavior over time
- Partial (intermittent) reinforcement is more effective for maintaining behavior than continuous reinforcement
Schedules of reinforcement: key types
- Fixed Ratio (FR_n): reinforcement after n responses
- Example: a worker paid for every n units produced
- Characteristic: rapid response with a post-reinforcement pause; the pause length is proportional to the number of responses required
- Variable Ratio (VR_n): reinforcement after an average of n responses
- Example: fishing (reward after a variable number of casts); leads to high and steady response rates with little post-reinforcement pause
- Fixed Interval (FI_t): reinforcement after a fixed period of time
- Example: exams or other timely contingencies; response rate tends to rise near the time of reinforcement
- Variable Interval (VI_t): reinforcement at variable time intervals
- Example: checking emails; responses tend to be steady and moderate
Characteristics and practical illustrations
- Ratio schedules tend to produce rapid responding
- FRn leads to post-reinforcement pauses; the pause length increases with n
- VRn yields high, steady rates of responding with little to no post-reinforcement pause
- VI and FI schedules produce more steady behavior than fixed-ratio schedules but with different temporal dynamics
- Real-world analogies:
- Fruit picker paid by number of baskets filled (FR-like)
- Fishing: rewards occur after variable numbers of casts (VR-like)
- Checking email: variable-interval-like reinforcement
Instrumental conditioning procedures and contingencies
- Reinforcement vs punishment framework (operant contingencies)
- Positive Reinforcement: a behavior produces a desirable outcome
- Negative Reinforcement: a behavior prevents an aversive outcome
- Punishment: a behavior produces an aversive outcome
- Omission training (negative punishment): a behavior prevents a desirable outcome
- These contingencies shape the likelihood of the target behavior across time
Comparison: Pavlov vs. Skinner (the interpretive contrast)
- Pavlovian conditioning (Classical): CS signaling the US elicits a reflexive CR; learning is about forming associations between stimuli
- Instrumental conditioning (Operant): the organism’s response leads to consequences; learning is about the relationship between actions and outcomes; behavior is emitted and shaped by consequences
- In Pavlovian conditioning, the CR is elicited by the CS; in operant conditioning, the behavior is emitted to obtain reinforcement
Ethical, philosophical, and practical implications
- Real-world and clinical contexts raise ethical considerations:
- Classic human conditioning experiments (e.g., Little Albert) highlighted ethical concerns about harm and consent
- Modern research emphasizes welfare and ethical guidelines for animal and human subjects
- The Ludovico Technique (fictional but discussed in lecture materials): a memory- and behavior-control procedure that subjects individuals to violent stimuli under drug-induced sickness to condition aversions; raises questions about autonomy, consent, and the potential for abuse in behavioral modification
- Hijacking (memory alteration for torture): Tracker jacker venom used to associate fear with memories and to alter those memories with hallucinations; highlights risks of memory manipulation and the ethics of coercive applications
- Real-world relevance: conditioning principles underpin advertising, education, habit formation, and behavior modification programs; ethical use requires careful consideration of autonomy, welfare, and potential harms
- Foundational principles connect to broader debates in philosophy of mind and ethics about mental states, agency, and the limits of behavioral control
Key definitions and shorthand references (summary)
- US: Unconditioned Stimulus — naturally elicits a response (e.g., food)
- UR: Unconditioned Response — natural, reflexive response to the US
- CS: Conditioned Stimulus — originally neutral, becomes predictive of the US
- CR: Conditioned Response — learned response to the CS
- FI_t: Fixed Interval schedule (time-based reinforcement at fixed intervals)
- VI_t: Variable Interval schedule (time-based reinforcement at variable intervals)
- FR_n: Fixed Ratio schedule (reinforcement after a fixed number n of responses)
- VR_n: Variable Ratio schedule (reinforcement after an average of n responses)
- Positive Reinforcement: R -> addition of a desirable outcome
- Negative Reinforcement: R -> removal of an aversive outcome
- Punishment: R -> introduction of an aversive outcome
- Omission training: R -> removal of a desirable outcome
- Extinction: CS on its own presented without US leads to decrease in CR
- Spontaneous recovery: CR reappears after time without exposure to CS
- Disinhibition: CR reappears when a novel stimulus is introduced during extinction
- External inhibition: a novel stimulus reduces CR expression when paired with CS
References and context from the lecture slides
- Pavlov (1849–1936): classical conditioning pioneer; Nobel laureate in physiology; early work on digestion and conditioned reflexes; Conditioned Reflexes (1927)
- Watson & Raynor (1920): Little Albert study (human conditioning example)
- Skinner (1904–1990): operant conditioning pioneer; Skinner box; reinforcement schedules; focus on action-outcome contingencies
- Pearce, J. M. (Year not specified in notes): Animal Learning and Cognition, Chapters 1–4
- Notable additional examples in lecture: The Ludovico Technique and Tracker jacker memory manipulation as ethical/practical concerns in learning and memory research
End-of-lecture takeaway
- The course integrates two complementary learning theories: classical conditioning (Pavlov) and operant conditioning (Skinner)
- Both frameworks explain how organisms adapt their behavior based on environmental contingencies, with broad implications for psychology, education, marketing, and ethics