Lecture Notes: Stimulus Control and Respondent Conditioning

Class Rep Announcements

Discord Server:
- A Discord server has been set up for students to contact the class representatives (specifically). One of the class reps is Natalie.
- The link will be shared soon, possibly on Friday or whenever Michelle has time.
- Purpose: To address course-related issues and communicate concerns to course coordinators.
- Not for special conditions, which are handled separately.
Memory Lab:
- The memory lab is due on Sunday. Students who haven't started should begin working on it.
Lecture Attendance:
- This cohort has one of the highest lecture attendance rates in a long time.
- Students are encouraged to take breaks and treat themselves.
Lab Access Over Holidays:
- Psychology students can access labs during scheduled lab times over the holidays, even if there are no scheduled lab sessions.
Study Groups:
- There is interest in study groups for upcoming tests. Cohort forward will start those, potentially in week seven.
- A form will be sent out to determine availability.
- AUPSA will support the study groups with snacks, drinks, and potentially senior students.

Addressing the Class

Gratitude for Attendance:
- Appreciation for students attending the lecture, especially given the test tomorrow.
- Attendance makes interactive lectures easier.

Recap of Previous Lecture: Intermittent vs. Continuous Reinforcement

Continuous Reinforcement: Reinforcing every response.
Intermittent Reinforcement: Reinforcing some responses according to a rule (schedule of reinforcement).
Resistance to Extinction:
- Intermittent reinforcement produces behavior more resistant to extinction.
- Example: Rats intermittently reinforced (30% of the time) took longer to extinguish behavior compared to those continuously reinforced.
- Explanation: Transition from reinforcement to extinction is harder to detect with intermittent reinforcement.

Schedules of Reinforcement

Rules to specify how intermittent reinforcement is administered.
Ratio Schedule: Based on the number of responses.
- Fixed Ratio: Reinforce every nth response exactly.
- Variable Ratio: Reinforce on average every nth response.
Interval Schedules:
- Fixed Interval (FI): A response is reinforced when a fixed amount of time has elapsed since the last reinforcer; a response must be made after the interval.
  - Example: FI-15 second schedule: After 15 seconds, the first response is reinforced.
  - Pause After Reinforcement: Animals pause after reinforcement, then accelerate responding until the next reinforcement.
  - Explanation: Not due to consuming the reinforcer (as variable interval schedules don't show pauses).
  - The pause is due to the predictability of the next reinforcer.
    - Animals learn they have to wait before responding.
    - Variable interval schedules don't have this predictability, so responding starts immediately. This is an example of using a scientific experiment to tease apart two possible explanations for a phenomenon.
DRO Schedules (Differential Reinforcement of Other Behavior):
- Distinction from fixed interval and fixed time schedules is essential.
- Fixed Interval (FI): Response is reinforced when time has passed (must respond).
- DRO Schedule: Every response restarts the timer (must not respond).
- Fixed Time (FT): Reinforcer occurs after a time interval, regardless of response (doesn't matter whether you respond or not).
- DRO schedules are effective for eliminating unwanted behavior: both negative punishment of unwanted behavior and positive reinforcement of all other behaviors.
- Important to offer an alternative behavior to earn reinforcements.
Fixed Time Schedules (FT):
- Can be used to study superstition experimentally.
- Skinner's Pigeon Experiment: Pigeons exhibited idiosyncratic behaviors before the reinforcer arrived.
- Law of Effect: Behaviors followed by reinforcement are repeated, even if they don't cause the reinforcement.
- Temporal Association: Close timing between response and reinforcer is key.
- Human Superstitions:
  - Similar experiments work with humans (Wagner and Morris's clown experiment).
  - Most superstitions are about negative reinforcement (avoiding bad outcomes).
  - Irrational beliefs can be interpreted as accidental reinforcement of behaviors.

Stimulus Control of Operant Behavior

Focus shift: From consequences to stimuli that precede behavior.
Operant behavior is behavior controlled by its consequences. Stimulus control of operant behavior refers to how stimuli that precede the behavior affect the probability of that behavior.
We don't always behave the same in different circumstances.
Stimulus context indicates the consequences for responding.
The law of effect can be refined: When a response is reinforced, it becomes more likely in the same stimulus context.
Changing the stimulus context changes the probability of the response because different stimulus contexts signal different consequences for responding.

Examples of the Stimulus Context

Example (Petting a Kitten): Approaching and petting a cute kitten is reinforced (pleasant experience).
- Generalization: This behavior generalizes to other animals based on similarity.
- Probability of petting depends on the similarity to the kitten.
  - High probability for similar cats.
  - Less probability for a possum.
  - Even less for a ferret or wolf.
- Positive Punishment: If the cat bites or scratches, it becomes less likely I will approach it again. This would also tend to generalize.

Studying Stimulus Control

Training a Pigeon:
- Train a pigeon to peck a yellow key for food reinforcement (VI schedule).
- VI schedules maintain a high, constant rate of responding.
- Change the stimulus to an orange-colored key.
  - Prediction: Generalization will occur because orange is similar to yellow, but with less behavior.
  - If changed to a blue key the behavior would most likely drop further.

Generalization Gradients

Graphs showing the amount of responding as a function of a stimulus dimension (e.g., color).
- Training Stimulus: S+ (positive discriminative stimulus), signals reinforcement. Pecking is reinforced.
Decremental Gradient:
- Shape: Peaked, with most responding at the training value.
- As you move away from the original trained stimulus context, the less responding you should see. The less it should generalize.
Incremental Gradient:
- If pigeons received electric shocks for pecking the yellow key they would no longer pick it.
- This tendency not to pick it would generalize, leading it to peck an orange key more than yellow, and so forth.

Terminology Review

Decremental Gradient: Most responses at the original training stimulus (S+).
Responding generalizes to similar stimuli.
Amount of generalization depends on similarity.
Stimulus Control: The extent to which stimuli that precede or accompany operant behavior affect the rate or probability of that behavior.
- Less generalization = more stimulus control.
Discrimination: Behaving differently in the presence of different stimuli, change the context and observe whether you get different behavior.
Generalization and discrimination are opposites.
Stimulus control means lots of discrimination, not much generalization.

Real Data: Guttman and Kalish (1956)

First to demonstrate generalization gradients empirically.
Technological Reason: No reinforcement during testing with new colors.
If reinforces are given during testing, the animal will ignore the colors because they will get reinforced regardless of what the color is.
Tests must be in extinction to see the effect of previous training without further reinforcement.
Partial Reinforcement Effect: Use intermittent reinforcement to extend behavior during extinction.
- Trained picking yellow key on a variable interval schedule, then tested with other colors.
Four Groups of Pigeons: Trained with different S+ (different colors of key).
Trained at four different training values.
If they had only trained one group, it could be explained why they preferred a color from previous training. It was important they train different groups at different training values.
When they were trained at certain nanometers and certain colors, that is what they picked at most. When the stimulus changed, responding decreased.
Large stimulus control (steep gradients): Relatively small color changes produced big decreases in behavior. Pigeons are pretty good at this.
Pigeons have better color vision so they have good at determining if the colors are different.

Summary of Stimulus Control of Operant Behaviour

Nothing is learned in isolation. We always learn things within a particular stimulus context.
Change the context, change the behavior in a systematic way.
Stimulus Control: Less generalization, more discrimination.
ABC of Behavior:
- Antecedents (stimuli before behavior) set the occasion for responding.
- Behavior (responses) followed by consequences (reinforcers).
Example: Actively train discrimination where responses to a red key are reinforced, but not to a green key.
- Stimulus signals the consequence for responding.

Interesting implication of Stimulus Control

Learning is likely to improve by recalling material within the same stimulus context.
Experiment: Godin and Badley (underwater learning).
- Tested whether learning a list of words underwater or on land affected recall.
- Words were recalled better if they were being tested in the same context that they were learned.
- Experiment failed to demonstrate this.
Training generalisation: It is important to train for generalisation in applied behaviour analysis.
The way different contexts are distinguished needs to be trained.

Respondent Conditioning (Classical Conditioning) : Another way Stimuli Affect Behaviour that isn't Signaling the Consequences of Responding.

Stimulus control of operant behaviour: consequences are signalled by stimuli.
Respondent conditioning: based on the association between two stimuli.
Pavlov’s dogs

Definitions for Pavlov's Dog Experiment

Unconditioned Stimulus: A stimulus that automatically elicits an unconditioned response.
- Example: Meat for a dog elicits salivation.
Unconditioned Response: Example: salivation
Previously Neutral Stimulus: Might be paired with the unconditioned stimulus to reliably predict it.
- Pavlov would ring a bell before giving the dog food.
- Ring the bell before the food.
- The Bell did not elicit salivation but elicited salivation once they realised food was coming.
Conditioned Stimulus: Neutral stimulus becomes a Conditioned Stimulus

Details About Pavlov's Dog Experiment

US (Unconditioned Stimulus): food, which reliably elicits an Unconditioned Response of salivation.
Ring a bell just before presenting the food.
- Initially, bell does nothing.
Bell had become a Conditioned Stimulus(CS) that elicits salivation (Conditioned Response = CR).
Conditioned stimuli elicit conditioned responses, while unconditioned stimuli elicit unconditioned responses.
The Bell is NOT signaling. It is a different mechanism here.
Learned behaviour is when an animals learn to predict future events.
- In Pavlovian conditioning, classical conditioning, the bell predicts that food is coming, the dog can use the bell to predict the future.
- Pause after reinforcement is something animals uses to predict the future for something as simple as food.

Short Video Demonstrations from other Modules

Classical conditioning can be strong enough to make you sick by influencing your immune system.
Classical conditioning role in phobias.

Video Demonstration 1: Immunity Suppression Example

Philip Zimbardo
People can unintentionally condition the rat's immune response.