Email: Christopher.mitchell@plymouth.ac.uk
Office: PSQ B229
Office Hours: Wednesdays 2-3pm, Fridays 2-3pm (subject to change)
Check-in code: Prof Chris Mitchell
First Half: Blocking
Reminder of blocking
Prediction error: Rescorla Wagner
An alternative ‘attentional’ account of blocking
Second Half: Extinction
Protection from extinction
Context specificity of extinction
Reading: Pearce (2008) Animal Learning and Cognition, Chs 3 and 5
Procedure:
Participants are presented with one or two colored squares.
Sometimes the presentation is followed by an electric shock.
Task: determine which colors lead to shock.
Shock Intensity: Double shock if both colors present lead to shock individually.
Presentation of colors (e.g., Brown, White, Orange, Purple) followed by shock.
Blocking Occurs: Prior training with one stimulus (A) blocks learning about a second stimulus (B) when they are presented together (AB).
Example: Brown > Purple
Blocking is usually explained as a failure to form a link between cue B and the US.
Example: Orange -> Shock!, Purple (no link formed during blocking).
Design: A+ then AB+ ('+' indicates presence of the US, A & B are Conditioned Stimuli)
By the end of A+ trials, the US is fully predicted by A; therefore, there is no prediction error.
During AB+ trials, the US is not surprising because A already predicts it, so B isn't processed.
No B-US association is learned because AB+ trials are the only time B appears.
Learning about B is 'blocked' by A.
Before blocking, it was thought that associations form simply by pairing two stimuli (Hebbian learning).
In blocking, Cue B -> US on AB+ trials, but little learning occurs.
Demonstrates that learning isn't just about pairing stimuli; prediction error matters.
Central to contemporary learning theory.
No prediction error on AB+ trials after A+ trials because the US (+) is not surprising.
The US becomes unsurprising and less potent over repeated A+ trials.
Comparison: Blocking (A+ then AB+) vs. Overshadowing (CD+).
In overshadowing, the US is surprising on initial CD+ trials.
Surprise: The difference between what you predict will happen and what actually happens.
\Delta V = \alpha(\lambda - \Sigma V)
\Delta V = learning: change in associative strength on this trial (blocked cue B on AB+ trials)
\lambda = total associative strength supported by the outcome (US)
\Sigma V = the expected outcome (given all cues present A & B)
\alpha = attention to cues (salience of the CS); this doesn’t change with learning according to R&W
On AB+ trials, \lambda = \Sigma V because of the pretrained 'blocking' cue A. Thus, no learning occurs for B.
Learning occurs only when there is prediction error (i.e., when you are surprised by the outcome).
Signals of important events are attended to especially well.
The salience of the CS “\alpha” is not fixed (contrary to R&W’s claim); it can change with experience/learning.
Blocking might occur because attention to B decreases due to the presence of the better predictor A.
Plymouth Argyle loses every week.
Buy Bellingham -> win every time he plays.
Buy another player (Palmer) -> win again.
How good is Palmer? Evaluated under conditions of blocking.
Will Plymouth win? Look for Bellingham (attend to him).
Palmer processed under divided attention.
At test, Bellingham's contribution is clear, but Palmer's is less so due to blocked learning.
A+ then AB+
The most predictive cues will increase in salience/\alpha (e.g., cue A).
Relatively less predictive cues will decrease in \alpha (e.g., cue B on AB+ trials).
Not much learning about cue B.
No shocks.
Present stimuli on the eyetracker screen.
Cues: chemicals (Addexium, Rezaline).
Outcomes: symptoms in Mr. X (itchiness, nausea).
Do participants look at the word 'Addexium' (the CS)?
Stages
Stage 1: A – o1; C – o1; E – o2; G – o2; I – o1; L – o1; O – o2; R – o2
Stage 2: AB – o1; CD – o1; EF – o2; GH – o2; JK – o1; MN – o1; PQ – o2; ST – o2
Stage 3: BK – o3; DN – o4; FQ – o3; HT – o4; BN?; DK?; FT?; HQ?
Test: BT?; DQ?; FN?; HK?
Attention to cue A is high (good predictor of the outcome).
Attention (eye gaze) to 'blocked cue' B is reduced compared to overshadowing controls J and K.
Less attention/processing of Cue B would explain why there is less learning about that cue on AB-O1 trials.
Blocking is seen in humans (e.g., Beesley & Le Pelley, 2011) and in non-humans (e.g., Kamin, 1969).
Two possible mechanisms for blocking:
Prediction error: No learning because US isn’t surprising on AB+ trials.
Reduced attention to cue B on AB+ trials due to higher salience/better predictor cue A.
Evidence for the salience mechanism in humans (Beesley & Le Pelley, 2011).
Blocking is not all about prediction error.
Some aspects of extinction don’t fit with Rescorla Wagner.
Clinical relevance: phobias and PTSD.
Assessing which candidates (Zack, Jeb, Paul) are evil enough to join a gang.
Early in training: All evil deeds are a surprise.
As you hear more about them, you begin to expect evil from Jeb and Paul.
Later in training: Expect evil from Paul but experience no evil: Negative prediction error.
Association between Paul and Evil reduces later in training?
Phase 1: Paul-evil (A+)
Come to expect the US
Phase 2: Paul-no evil (A-)
Surprising absence of the US -> Prediction error on first trials
Initial learning curve: US becomes less surprising over trials; increase in associative strength slows down.
Extinction curve: absence of US becomes less surprising over trials; decrease in associative strength slows down.
The CR returns in a different context from extinction.
The CR returns with a delay between extinction and test.
Same task: which ones are evil enough to be let in?
Learnt that Paul does evil stuff in Plymouth.
Extinguished in Lamonic Bibber.
Plymouth and Lamonic Bibber are different 'contexts'.
Paul -> Evil association returns when back in Plymouth.
Paul -> Evil, Lamonic Bibber
Reminder: Plymouth Paul-Evil
Lamonic Bibber Paul-Neutral
Plymouth Paul? -> New learning
Context A: CS-US
Context B: CS-
Test: Context A CS? CS US
Context B: New learning
Extinction is specific to the context in which it was learnt.
Extinction is new learning of an inhibitory link.
That inhibitory link only operates in the context in which it was learnt.
Rescorla Wagner is wrong: The original CS-US link does not weaken.
Exposure therapy seems to be the same as extinction.
Both show renewal and spontaneous recovery.
How to avoid context specificity of exposure therapy (Laborda et al, 2011):
Have therapy in lots of different contexts.
Spread exposure treatments over a long period of time.
Prediction error (e.g., Rescorla Wagner model) explains:
Learning curves (A+)
Conditioned inhibition (A+/AX-)
Blocking (A+ then AB+)
Extinction (A+ then A-)
Need additional explanations for:
Second-order conditioning (within-compound associations)
Attentional effects in blocking (e.g., Mackintosh, 1975)
Context specificity of extinction (inhibitory link that is activated by context)
Pearce (2008) Animal Learning and Cognition: An introduction, Chapter 3 (particularly pages 64-71; 83-86) and Chapter 5 (particular pages: 125-128; 134-138)
Additional: Beesley, T., & Le Pelley, M. (2011). The influence of blocking on overt attention and associability in human learning. Journal of Experimental Psychology: Animal Behavior Processes, 37(1), 114. Laborda, M. A., McConnell, B. L., & Miller, R. R. (2011). Behavioral techniques to reduce relapse after exposure therapy. Associative learning and conditioning theory, 1, 79-104.
Time A CS-US
Time B CS-
Time C CS? CS US
Time A New learning
CS-US conditioning at Time/context A
Extinction at Time/context B
Test is later at Time/context C
Initial conditioning transfers from Time A to Time C
Extinction does not transfer from Time B to Time C
Spontaneous recovery is a form of renewal (ABC renewal).