EXTINCTION, RECOVERY AND AVOIDANCE - VOCABULARY FLASHCARDS
EXTINCTION
Reducing Responding – Everyday Examples
- Toddler crying at night ⟶ parent arrives (attention = reinforcer)
- Dog jumps up on owner ⟶ gets a treat
- Child mispronounces a word ⟶ receives praise
- Core issue: reinforcement is maintaining an undesirable operant response; goal of extinction is to eliminate that reinforcement contingency.
Basic Definitions
- Classical Conditioning (CC): Extinction = repeated presentations of the conditioned stimulus (CS) without the unconditioned stimulus (US).
- Instrumental / Operant Conditioning (IC): Extinction = performing the conditioned response (CR) in presence of the stimulus no longer yields the reinforcing outcome.
- Conceptually the opposite of acquisition; fits Rescorla-Wagner framework.
Extinction Dynamics in CC & IC
- CC example: Pavlov’s salivation – CS (bell) no longer followed by food; salivation diminishes.
- IC example: Light-on → lever-press → food dispenser blocked ✗; lever pressing eventually declines.
- Extinction burst: immediate, temporary increase in response rate when reinforcement is first removed.
- Response variability increases initially (Neuringer, Kornell & Olufs): subjects try novel topographies to regain reinforcement.
- Sustained non-reinforcement ⟶ gradual decrease in both rate and variability.
- Emotional by-products: frustration and possible aggression.
Rate of Extinction & Reinforcement Schedule
- Continuous Reinforcement (CRF) → rapid extinction once reinforcement halted.
- Partial Reinforcement → Partial-Reinforcement Extinction Effect (PREE): responding persists longer, extinction is slower.
Explanatory Theories for PREE
- Discrimination‐hypothesis:
• Organism fails to detect the shift from partial to zero reinforcement; change from CRF → extinction is obvious, change from partial → extinction is subtle. - Generalisation-decrement hypothesis:
• Extinction context resembles past partially reinforced context → response generalises. - Frustration theory:
• Organism learns to respond through frustration; frustration itself becomes a cue that signals reinforcement. - Key experimental comparison (Discrimination vs Generalisation):
• Group 1 – intermittent reinforcement → extinction versus Group 2 – CRF → CRF → extinction; persistent responding in Group 1 supports discrimination difficulty.
What Happens During Extinction? Unlearning vs Forgetting vs Interference
- Unlearning: active erasure of CS–US association (Rescorla-Wagner sets \lambda = 0 during extinction, yielding negative \Delta V).
- Forgetting: passive time-based decay; occurs without explicit non-reinforcement; assessed with no-extinction “forgetting controls”.
- Interference: new inhibitory learning masks but does not erase prior excitatory learning; depends on similarity between phases.
Interference Types
- Proactive: Phase 1 learning impairs recall of Phase 2.
- Retroactive: Phase 2 learning impairs recall of Phase 1.
- Cue vs Outcome interference:
• Cue: different cues compete for same outcome (A→B, C→B).
• Outcome: same cue paired with different outcomes (A→B, A→C). - Extinction = Retroactive Outcome interference: Phase 1 X+ (X→US), Phase 2 X− (X→no-US), Test X⟶CR? Retrieval disrupted by second‐learned association.
- More similarity (context, modality, timing) between the two phases → stronger interference.
RECOVERY FROM EXTINCTION
Key Message
- Extinction does not erase original learning; it installs an inhibitory S–R (or S–US) trace that is context-specific and fragile.
Forms of Recovery
- Spontaneous Recovery
- Response returns after mere passage of time.
- Attributed to instability of inhibitory or second-learned associations.
- Renewal
- Change of physical context reinstates responding.
- Variants:
• ABA: Acquisition in A, extinction in B, test in A.
• ABC: Acquisition A, extinction B, test C.
• AAB: Acquisition & extinction in A, test in B.
- Reinstatement
- Non-signalled presentations of the US alone restore responding when CS is next encountered.
- Context-specific: US must appear in, or generalise to, test context.
- Facilitated Reacquisition
- After extinction, re-pairings of CS–US are learned faster than initial acquisition (savings effect).
Bouton’s (1993) Retrieval Theory
- Frames extinction as retroactive interference; context (space & time) functions as a retrieval cue.
- If test context resembles extinction context ⟶ retrieve X–noUS (inhibition).
- If test context differs ⟶ proactive interference: earlier X–US resurfaces.
- Same mechanism explains spontaneous recovery (time = context change), renewal (space change), reinstatement (US presence alters contextual meaning).
Clinical Relevance
- Exposure therapy = extinction; relapse corresponds to spontaneous recovery, renewal, reinstatement.
- Research guides protocols to reduce relapse (Anderson & Insel 2006; Craske & Mystkowski 2006; Boschen, Neumann & Waters 2009).
ENHANCING EXTINCTION
Methods that Strengthen Extinction Learning
- Trial & session spacing: spaced > massed.
- Massive extinction: sheer number of CS-alone trials.
- Deepened extinction: extinguish CS in compound with a second excitor → larger prediction error \Delta V = \left(0 - [1.0 + 1.0]\right) = -2.0 ⟶ stronger inhibitory learning.
Methods that Improve Retrieval of Extinction Memory
- Multiple extinction contexts (Gunther, Denniston & Miller 1998): extinction in B,C,D produced less renewal vs single-context B.
- Increasing similarity between acquisition & extinction contexts: gradated context shifts reduce AAB renewal (Laborda et al., 2011).
- Reminder (extinction) cues during testing (Brooks & Bouton 1994) – external or internal signals present during extinction are re-presented at test to facilitate retrieval of inhibition.
PUNISHMENT
Definition & Forms
- Positive punishment: add aversive event after response (e.g. shock, scolding).
- Negative punishment: remove positive event (e.g. time-out, point loss).
- Laboratory practice: first establish high response rate with positive reinforcement, then pair response with punisher.
Key Variables Determining Effectiveness
- Intensity
- Higher intensity → greater suppression up to a limit.
- Graduated (escalating) punishers < single severe punisher (habituation to mild shocks).
- Schedule (Proportion of responses punished)
- Punishment need not be continuous; however, lower probability ⟶ weaker suppression.
- On FR reinforcement schedules, punishment lengthens post-reinforcement pause but little effect on ratio run.
- Contingency (response–punisher correlation)
- Greater contingency → stronger suppression (Goodall: instrumental‐contingent vs yoked CS).
- Contiguity (delay)
- Longer R–punisher delays dramatically reduce effectiveness (“Just wait till your father gets home…”).
- Competing Positive Reinforcement
- Behaviour may persist if still reinforced (drug self-administration models: cocaine or sucrose maintained lever-pressing despite shock).
- Punisher as Discriminative Stimulus for Reinforcer
- If reinforcement occurs only in presence of punishment, response increases (Holz & Azrin pigeons: pecking higher during punishment because food delivered only then).
- Availability of Alternative Reinforcement
- Providing other ways to obtain positive reinforcement (Thompson et al.) enhances suppression of the punished behaviour.
- Presence of explicit discriminative stimulus (S+)
- Instrumental response may return when S+ absent; punishment effects are stimulus-specific.
Theoretical Accounts
- Conditioned Emotional Response (CER) theory (Estes):
• Stimuli preceding response become CS; punishment = US; fear (CER) interferes with behaviour. - Negative Law of Effect (Thorndike extension):
• Punishment weakens S–R association; empirical work shows punishment can be ≈3× more potent than reinforcement.
Side-Effects & Ethical Considerations
- Aggression, escape/avoidance of punishing agent.
- Behaviour may only suppress in presence of punisher.
- Moral, welfare, legal constraints; reinforcement preferred where possible.
AVOIDANCE
Distinction from Punishment
- Punishment: positive contingency between R and shock → suppresses behaviour; “passive avoidance.”
- Avoidance: negative contingency – performing R prevents shock; increases behaviour; “active avoidance.”
Discriminated (Signaled) Shuttle Avoidance
- Trial initiated by warning CS.
• If animal responds before US → avoidance trial (no shock delivered).
• If fails → escape trial (shock occurs until R emitted). - Variants: two-way shuttle, one-way shuttle.
- Free-operant avoidance: no explicit CS; shocks at regular intervals unless response emitted.
Extinguishing Avoidance
- Response blocking / flooding: prevent avoidance response, forcing exposure to CS without US; extinction of avoidance.
Motivation Across Training
- Early: escape from actual shocks creates fear ➜ motivates responding.
- Later: fear decreases, but responding persists (fear & avoidance dissociate).
- Two-process theory: (1) Classical conditioning of fear to CS/context; (2) Instrumental negative reinforcement – response terminates fear CS.
- Safety-Signal Hypothesis: Feedback cues following avoidance act as conditioned inhibitors of fear and become secondary reinforcers sustaining behaviour (proprioceptive, spatial, tactile cues).
OVERALL SUMMARY
Extinction
- Negative CS–US contingency ⟶ response decline.
- Not unlearning; subject to spontaneous recovery, renewal, reinstatement.
- Inhibitory associations are context-specific; extinction strengthened via spacing, massed trials, deepened, multi-context procedures, reminder cues.
Punishment
- Positive response–US contingency; suppression depends on intensity, timing, contingency, concurrent reinforcement, alternative reinforcers.
- Explained by CER theory & negative law of effect; ethical & side-effect concerns.
Avoidance
- Active response to prevent US following CS.
- Initially driven by fear (escape); maintained by removal of fear or presence of safety signals.
- Extinguished through response blocking/flooding.
Mathematical / Formal Points Included
- Rescorla-Wagner error term: \Delta V = \alpha\beta ( \lambda - \sum V )
- Extinction sets \lambda = 0 → negative \Delta V (inhibitory learning).
- Deepened extinction predictive error example: (0 - [1.0 + 1.0]) = -2.0
- Outcome interference phase diagram: Phase 1 X–US, Phase 2 X–noUS.
Ethical & Practical Implications Spelled Out
- Clinical exposure therapy mirrors extinction; relapse resembles recovery phenomena.
- Punishment protocols require balancing efficacy and welfare; alternative reinforcement enhances outcomes.
- Flooding techniques derived from response blocking in avoidance research.