Learning Lecture Notes

Learning Lecture 6

Is This Learning?

Situational Analysis:
- Cessation of thumb sucking by an infant: Yes
- Acquisition of language in children: Yes
- A computer program that generates random chess moves and learns from outcomes: Yes
- A worm in a T maze showing behavioral changes in trials: Yes
- MYCIN, a computer diagnosing infections with additional rules: No (Learning is typically attributed to adaptive behavior rather than mere data processing.)

What Is Learning?

Definition: Learning is the process of acquiring new and relatively enduring information or behaviors.

How Do We Learn?

Mechanism of Learning:
- We learn by association.
- Our minds naturally connect events that occur in sequence.

Associative Learning

Definition: Learning that involves associating one stimulus with another.

Example 1: Sea Snail

Event 1: Sea snail associates splash with tail shock.

Example 2: Lightning and Thunder

Stimulus 1: Lightning;
Stimulus 2: Thunder;
Response: We wince anticipating thunder upon seeing lightning.

Example 3: Responding with Consequence

Response: Balancing a ball.
Consequence: Receiving food strengthens the behavior.

Example 4: Candy Vending Machine

Stimulus: Pushing vending machine button results in receiving a candy bar, reinforcing the behavior.

Types of Learning

Classical Conditioning
Operant Conditioning
Observational Learning

Classical Conditioning

Pavlov’s Dog Experiments:
- Before Conditioning:
- Food (Unconditioned Stimulus, US) produces salivation (Unconditioned Response, UR).
- The tone (neutral stimulus) does not produce salivation.
- During Conditioning:
- The neutral stimulus (tone) is paired with the US (food), resulting in salivation (UR).
- After Conditioning:
- The neutral stimulus (now Conditioned Stimulus, CS) elicits salivation (now Conditioned Response, CR).

Multiple Choice Question

In classical conditioning, which is an originally irrelevant stimulus that triggers a learned response?
- Unconditioned Response (UR);
- Unconditioned Stimulus (US);
- Conditioned Response (CR);
- Conditioned Stimulus (CS).

Pavlov’s Process

Before Conditioning:
- Food (US) → Salivation (UR)
- Tone (Neutral Stimulus) → No Salivation
During Conditioning:
- Tone + Food → Salivation
After Conditioning:
- Tone (CS) → Salivation (CR)

Classical Conditioning: Concepts

Extinction:
- Gradual weakening of CR when CS is presented without US.
- Graphical Representation:
- Response (CR) decreases over trials when US is absent after conditioning.
Discrimination:
- Differential responding between classes of stimuli.
Generalization:
- Responding to stimuli similar to the CS; Pavlov conditioned dogs to salivate with similar stimuli such as vibrators on the thigh.

Applications of Classical Conditioning

Fear Conditioning:
- Example: Watson and Raynor’s Little Albert Experiment
- Unconditioned Stimulus (US): Loud noise
- Conditioned Stimulus (CS): White rat
- Unconditioned Response (UR) → Fear;
- Conditioned Response (CR) → Fear.
Health Applications:
- Drug cravings and treatment for substance-use disorders.
- Food cravings and difficulty avoiding sweets.
- Classical conditioning effects on immune responses.

Operant Conditioning

Definition: Learning associations between behavior and consequences.
Distinction from Classical Conditioning:
- Classical conditioning involves respondent behavior (automatic response to a stimulus).
- Operant conditioning involves operant behavior (behavior that operates on the environment to produce rewards or punishments).

B.F. Skinner's Methods

Skinner Box (operant chamber) comprises various components such as levers, pellets, and signals for reinforcement.

Shaping Behavior

Definition: Reinforcing closer approximations to a desired behavior.
Example: Shaping animals to discriminate between stimuli based on shape, color, and size.

Types of Reinforcers

Positive Reinforcer: Introduction of a positive stimulus (e.g., rewards).
Negative Reinforcer: Removal of a negative stimulus (relief from pain).
Primary vs. Conditioned (Secondary) Reinforcers

Behavior and Reinforcement Example

Scenario: Cleaning
- Positive Stimulus Introduced: Candy reinforces cleaning behavior.
- Negative Stimulus Removed: Headaches relieved through taking aspirin.

Understanding Punishment

Distinction: Punishment is not the same as negative reinforcement.
Types of Punishment:
- Presentation of negative stimuli (positive punishment) or removal of positive stimuli (negative punishment).

Multiple Choice Regarding Consequences

Example: A basketball player receiving a penalty for a foul illustrates a type of punishment.
Distinctions between PR (positive reinforcement), NR (negative reinforcement), PP (positive punishment), and NP (negative punishment) are necessary for proper classification.

Alternative to Punishment

Using Positive Reinforcement to discourage behaviors by reinforcing incompatible responses rather than applying punishments.

Schedules of Reinforcement

Continuous vs. Partial Reinforcement
Fixed Ratio Schedule: Reinforcement after a fixed number of responses; high rate of responding with pause after reinforcement.
Variable Ratio Schedule: Reinforcement occurs after a varying number of responses; higher rate of responding with no pause.
Fixed Interval Schedule: Reinforcement after a fixed time period; lower rates with absences between reinforcement.
Variable Interval Schedule: Reinforcement occurs after varying time periods; constant behavior with no pauses.

Multiple Choice: Examples of Reinforcement Schedules

Scenario: Coffee shop rewarding every 10 drinks purchased is an example of fixed-ratio reinforcement schedule.

Applications of Operant Conditioning

In schools, immediate feedback through quizzing and behavior plans assist children with disabilities.
In sports, reinforcement for minor milestones enhances performance.
In social media, likes and notifications serve as positive reinforcers through variable ratio reinforcement.

Contrasting Classical and Operant Conditioning

Classical Conditioning: Learning associations between involuntary events, automatic responses; extinction occurs when CS presented alone.
Operant Conditioning: Learning associations between voluntary behavior and consequences; extinction occurs when reinforcement stops.

Observational Learning

Definition: Higher animals, especially humans, learn through observing and imitating others.
Key Concepts:
- Modeling: Observing and imitating behaviors.
- Mirror Neurons: Serve as the neural basis for observational learning.

Bandura’s Bobo Doll Studies

Findings: Observing aggressive behavior impact on imitation by children; vicarious reinforcement affects behavior tendencies.

Modeling Violence

Research indicates media violence may increase aggressive expression but does not directly lead to real-world violence according to studies.

Video Resources

Integrate visual demonstrations through links to relevant video examples regarding operant conditioning and observational learning.