Conditioning & Reinforcement: Unlearned vs. Learned Stimuli
Context & Lecture Flow
- Lecturer is covering advanced reinforcement concepts within a broader motivation unit.
- Content order differs slightly from the textbook; lecture follows instructor’s logical flow.
- Students encouraged to consult textbook chapters or attend office hours if confused.
Key Terminology & High-Level Concepts
- Reinforcer: Any stimulus that increases the future probability of the behavior it follows.
- Aversive Stimulus: Any stimulus an organism will act to escape/avoid; decreases behavior when presented contingent on that behavior.
- Unlearned = Unconditioned = Primary (no prior pairing needed).
- Learned = Conditioned = Secondary (acquire value only after pairing with other stimuli).
- Pairing: Simultaneous or sequential presentation of a neutral stimulus with a reinforcer or punisher so the neutral stimulus acquires similar properties.
- Value-Altering Principle: Pairing changes the reinforcing or punishing value of a previously neutral stimulus.
Unlearned (Unconditioned / Primary) Reinforcers
- Require no prior learning or pairing to function.
- Universally (or near-universally) reinforcing across individuals of a species, though magnitudes vary.
- Typical examples:
- Food
- Water/fluids
- Sleep/rest
- Sexual stimulation
- Relief from discomfort (e.g., removal of a tight restraint)
- Key attributes:
- Directly tied to survival or reproduction.
- Organism will work to obtain them without prior training.
- Can transfer their value to neutral stimuli through pairing → creation of conditioned reinforcers.
Unlearned (Unconditioned) Aversive Stimuli
- Instinctively avoided; no learning required.
- Examples provided:
- Pain (cuts, burns, pressure)
- Loud or startling noises
- Restricted movement or physical restraint
- Characteristics:
- Typically signal bodily harm or threat to survival.
- Pairing them with neutral stimuli will create conditioned aversive stimuli.
Learned (Conditioned / Secondary) Reinforcers
- Initially neutral; become reinforcing only after consistent pairing with primary or already-conditioned reinforcers.
- Prime classroom example: Attention
- At birth, caregiver presence provides food & comfort (primary reinforcers).
- Attention [ initially neutral ] + Food/comfort → attention gains reinforcing value.
- Absence of a learned reinforcer can signal absence of other reinforcers → becomes motivating in itself (e.g., no caregiver attention = no food, no diaper change, sustained discomfort).
- Classic economic example: Money
- \text{Paper Currency} + \text{Access to Goods/Services} \rightarrow \text{Money becomes reinforcing}
- Context-specific (Serbian dinars in the U.S. lose reinforcing power).
Learned (Conditioned) Aversive Stimuli
- Neutral stimuli converted to aversives through pairing with unconditioned aversives or other conditioned aversives.
- Classroom scenario:
- Neutral: Teacher standing at the front.
- Paired with: Directions that lead to loss of free-time or access to preferred activities.
- Result: Teacher’s presence becomes something the student attempts to escape/avoid.
- Operates on same pairing mechanism as conditioned reinforcers but drives escape/avoidance rather than approach.
Pairing Process & Value-Altering Principle
- Pairing definition: Present neutral stimulus immediately before a reinforcer or punisher.
- Temporal contiguity critical; closer presentation → stronger conditioning.
- If paired with a reinforcer → stimulus becomes a conditioned reinforcer.
- If paired with a punisher → stimulus becomes a conditioned aversive stimulus.
- Value-Altering Principle formalized:
- \text{Neutral Stimulus} + \text{Reinforcer} \rightarrow \text{Conditioned Reinforcer (CS+)}
- \text{Neutral Stimulus} + \text{Punisher} \rightarrow \text{Conditioned Aversive Stimulus (CS−)}
Behavior-Change Mechanisms (Reinforcement vs. Punishment)
- Reinforcement: \text{Behavior}_{t} \xrightarrow{\text{Reinforcer}} \uparrow P(\text{Behavior}_{t+1})
- Punishment: \text{Behavior}_{t} \xrightarrow{\text{Punisher}} \downarrow P(\text{Behavior}_{t+1})
- Organism need not be consciously aware of contingencies; effects are automatic.
Concurrent Contingencies & Practical Implications
- Multiple contingencies can operate simultaneously on the same behavior (e.g., attention + food + escape from discomfort).
- Understanding which stimuli are primary vs. conditioned informs effective behavior-change strategies (e.g., clinical interventions, classroom management, training animals).
- Real-world relevance:
- Marketing: Pairing products with positive imagery.
- Therapy: Systematic desensitization pairs feared stimuli with relaxation (reverse conditioning of aversives).
Study Prompt from Instructor
- Create a 4-column note card:
- Unlearned Reinforcer (primary)
- Learned Reinforcer (secondary)
- Unlearned Aversive Stimulus
- Learned Aversive Stimulus
- Generate at least two personal examples for each category without using the internet; rely on textbook and notes.
Administrative & Pedagogical Notes
- Lecture may jump around relative to textbook sequence.
- Students invited to office hours for clarification.
- If confusion arises, re-read corresponding textbook chapters first.