AR

yas- CM2

Lecture 2: Blocking and Extinction (beyond prediction error!!!)

Contact Information

  • Email: Christopher.mitchell@plymouth.ac.uk

  • Office: PSQ B229

  • Office Hours: Wednesdays 2-3pm, Fridays 2-3pm (subject to change)

  • Check-in code: Prof Chris Mitchell

Overview

  • First Half: Blocking

    • Reminder of blocking

    • Prediction error: Rescorla Wagner

    • An alternative ‘attentional’ account of blocking

  • Second Half: Extinction

    • Protection from extinction

    • Context specificity of extinction

  • Reading: Pearce (2008) Animal Learning and Cognition, Chs 3 and 5

Experiment: Conditioned Response to Colored Squares

  • Procedure:

    • Participants are presented with one or two colored squares.

    • Sometimes the presentation is followed by an electric shock.

    • Task: determine which colors lead to shock.

  • Shock Intensity: Double shock if both colors present lead to shock individually.

Standard Blocking Result

  • Presentation of colors (e.g., Brown, White, Orange, Purple) followed by shock.

  • Blocking Occurs: Prior training with one stimulus (A) blocks learning about a second stimulus (B) when they are presented together (AB).

    • Example: Brown > Purple

Associative Links

  • Blocking is usually explained as a failure to form a link between cue B and the US.

    • Example: Orange -> Shock!, Purple (no link formed during blocking).

Prediction Error in Blocking

  • Design: A+ then AB+ ('+' indicates presence of the US, A & B are Conditioned Stimuli)

  • By the end of A+ trials, the US is fully predicted by A; therefore, there is no prediction error.

  • During AB+ trials, the US is not surprising because A already predicts it, so B isn't processed.

  • No B-US association is learned because AB+ trials are the only time B appears.

  • Learning about B is 'blocked' by A.

Importance of Blocking

  • Before blocking, it was thought that associations form simply by pairing two stimuli (Hebbian learning).

  • In blocking, Cue B -> US on AB+ trials, but little learning occurs.

  • Demonstrates that learning isn't just about pairing stimuli; prediction error matters.

  • Central to contemporary learning theory.

Main Point: Prediction Error

  • No prediction error on AB+ trials after A+ trials because the US (+) is not surprising.

US Potency

  • The US becomes unsurprising and less potent over repeated A+ trials.

  • Comparison: Blocking (A+ then AB+) vs. Overshadowing (CD+).

    • In overshadowing, the US is surprising on initial CD+ trials.

Formalizing Prediction Error

  • Surprise: The difference between what you predict will happen and what actually happens.

Rescorla-Wagner Model (1972)

  • \Delta V = \alpha(\lambda - \Sigma V)

    • \Delta V = learning: change in associative strength on this trial (blocked cue B on AB+ trials)

    • \lambda = total associative strength supported by the outcome (US)

    • \Sigma V = the expected outcome (given all cues present A & B)

    • \alpha = attention to cues (salience of the CS); this doesn’t change with learning according to R&W

  • On AB+ trials, \lambda = \Sigma V because of the pretrained 'blocking' cue A. Thus, no learning occurs for B.

Main Point of Rescorla-Wagner Model

  • Learning occurs only when there is prediction error (i.e., when you are surprised by the outcome).

Alternative to R&W: Attention to CSs

  • Signals of important events are attended to especially well.

  • The salience of the CS “\alpha” is not fixed (contrary to R&W’s claim); it can change with experience/learning.

  • Blocking might occur because attention to B decreases due to the presence of the better predictor A.

Example of Blocking: Plymouth Argyle

  • Plymouth Argyle loses every week.

  • Buy Bellingham -> win every time he plays.

  • Buy another player (Palmer) -> win again.

  • How good is Palmer? Evaluated under conditions of blocking.

Blocking and Divided Attention

  • Will Plymouth win? Look for Bellingham (attend to him).

  • Palmer processed under divided attention.

  • At test, Bellingham's contribution is clear, but Palmer's is less so due to blocked learning.

Mackintosh (1975): Blocking Due to Changes in \alpha

  • A+ then AB+

  • The most predictive cues will increase in salience/\alpha (e.g., cue A).

  • Relatively less predictive cues will decrease in \alpha (e.g., cue B on AB+ trials).

  • Not much learning about cue B.

Measuring Attention: Eyetracking

Eye Gaze in a Learning Task (Beesley and Le Pelley, 2011)

  • No shocks.

  • Present stimuli on the eyetracker screen.

  • Cues: chemicals (Addexium, Rezaline).

  • Outcomes: symptoms in Mr. X (itchiness, nausea).

  • Do participants look at the word 'Addexium' (the CS)?

Experimental Design

  • Stages

    • Stage 1: A – o1; C – o1; E – o2; G – o2; I – o1; L – o1; O – o2; R – o2

    • Stage 2: AB – o1; CD – o1; EF – o2; GH – o2; JK – o1; MN – o1; PQ – o2; ST – o2

    • Stage 3: BK – o3; DN – o4; FQ – o3; HT – o4; BN?; DK?; FT?; HQ?

    • Test: BT?; DQ?; FN?; HK?

Results: Eye Gaze in Stage 2 AB+/JK+ Trials

  • Attention to cue A is high (good predictor of the outcome).

  • Attention (eye gaze) to 'blocked cue' B is reduced compared to overshadowing controls J and K.

  • Less attention/processing of Cue B would explain why there is less learning about that cue on AB-O1 trials.

Blocking Summary and Conclusions

  • Blocking is seen in humans (e.g., Beesley & Le Pelley, 2011) and in non-humans (e.g., Kamin, 1969).

  • Two possible mechanisms for blocking:

    • Prediction error: No learning because US isn’t surprising on AB+ trials.

    • Reduced attention to cue B on AB+ trials due to higher salience/better predictor cue A.

  • Evidence for the salience mechanism in humans (Beesley & Le Pelley, 2011).

  • Blocking is not all about prediction error.

Part 2: Extinction

Why is Extinction Important?

  • Some aspects of extinction don’t fit with Rescorla Wagner.

  • Clinical relevance: phobias and PTSD.

Extinction Experiment: Thug Recruitment

  • Assessing which candidates (Zack, Jeb, Paul) are evil enough to join a gang.

Prediction Error in Extinction

  • Early in training: All evil deeds are a surprise.

  • As you hear more about them, you begin to expect evil from Jeb and Paul.

  • Later in training: Expect evil from Paul but experience no evil: Negative prediction error.

  • Association between Paul and Evil reduces later in training?

Extinction: The Design

  • Phase 1: Paul-evil (A+)

    • Come to expect the US

  • Phase 2: Paul-no evil (A-)

    • Surprising absence of the US -> Prediction error on first trials

Extinction Curves

  • Initial learning curve: US becomes less surprising over trials; increase in associative strength slows down.

  • Extinction curve: absence of US becomes less surprising over trials; decrease in associative strength slows down.

Rescorla-Wagner and Extinction

Renewal

  • The CR returns in a different context from extinction.

Spontaneous Recovery

  • The CR returns with a delay between extinction and test.

Example of Renewal

  • Same task: which ones are evil enough to be let in?

Renewal: Extinction is Context Specific

  • Learnt that Paul does evil stuff in Plymouth.

  • Extinguished in Lamonic Bibber.

  • Plymouth and Lamonic Bibber are different 'contexts'.

  • Paul -> Evil association returns when back in Plymouth.

A Model of Renewal (Bouton)

  • Paul -> Evil, Lamonic Bibber

  • Reminder: Plymouth Paul-Evil

  • Lamonic Bibber Paul-Neutral

  • Plymouth Paul? -> New learning

Renewal Language

  • Context A: CS-US

  • Context B: CS-

  • Test: Context A CS? CS US

  • Context B: New learning

Most Important Thing

  • Extinction is specific to the context in which it was learnt.

Bouton on Extinction

  • Extinction is new learning of an inhibitory link.

  • That inhibitory link only operates in the context in which it was learnt.

  • Rescorla Wagner is wrong: The original CS-US link does not weaken.

Exposure Therapy and Extinction

  • Exposure therapy seems to be the same as extinction.

  • Both show renewal and spontaneous recovery.

  • How to avoid context specificity of exposure therapy (Laborda et al, 2011):

    • Have therapy in lots of different contexts.

    • Spread exposure treatments over a long period of time.

Important Things

  • Prediction error (e.g., Rescorla Wagner model) explains:

    • Learning curves (A+)

    • Conditioned inhibition (A+/AX-)

    • Blocking (A+ then AB+)

    • Extinction (A+ then A-)

  • Need additional explanations for:

    • Second-order conditioning (within-compound associations)

    • Attentional effects in blocking (e.g., Mackintosh, 1975)

    • Context specificity of extinction (inhibitory link that is activated by context)

Readings

  • Pearce (2008) Animal Learning and Cognition: An introduction, Chapter 3 (particularly pages 64-71; 83-86) and Chapter 5 (particular pages: 125-128; 134-138)

  • Additional: Beesley, T., & Le Pelley, M. (2011). The influence of blocking on overt attention and associability in human learning. Journal of Experimental Psychology: Animal Behavior Processes, 37(1), 114. Laborda, M. A., McConnell, B. L., & Miller, R. R. (2011). Behavioral techniques to reduce relapse after exposure therapy. Associative learning and conditioning theory, 1, 79-104.

Spontaneous Recovery

Bouton’s Model of Spontaneous Recovery

  • Time A CS-US

  • Time B CS-

  • Time C CS? CS US

  • Time A New learning

Bouton and Spontaneous Recovery

  • CS-US conditioning at Time/context A

  • Extinction at Time/context B

  • Test is later at Time/context C

  • Initial conditioning transfers from Time A to Time C

  • Extinction does not transfer from Time B to Time C

  • Spontaneous recovery is a form of renewal (ABC renewal).