Conditioning & Reinforcement: Unlearned vs. Learned Stimuli

Context & Lecture Flow

Lecturer is covering advanced reinforcement concepts within a broader motivation unit.
Content order differs slightly from the textbook; lecture follows instructor’s logical flow.
Students encouraged to consult textbook chapters or attend office hours if confused.

Reinforcer: Any stimulus that increases the future probability of the behavior it follows.
Aversive Stimulus: Any stimulus an organism will act to escape/avoid; decreases behavior when presented contingent on that behavior.
Unlearned = Unconditioned = Primary (no prior pairing needed).
Learned = Conditioned = Secondary (acquire value only after pairing with other stimuli).
Pairing: Simultaneous or sequential presentation of a neutral stimulus with a reinforcer or punisher so the neutral stimulus acquires similar properties.
Value-Altering Principle: Pairing changes the reinforcing or punishing value of a previously neutral stimulus.

Require no prior learning or pairing to function.
Universally (or near-universally) reinforcing across individuals of a species, though magnitudes vary.
Typical examples:
- Food
- Water/fluids
- Sleep/rest
- Sexual stimulation
- Relief from discomfort (e.g., removal of a tight restraint)
Key attributes:
- Directly tied to survival or reproduction.
- Organism will work to obtain them without prior training.
- Can transfer their value to neutral stimuli through pairing → creation of conditioned reinforcers.

Instinctively avoided; no learning required.
Examples provided:
- Pain (cuts, burns, pressure)
- Loud or startling noises
- Restricted movement or physical restraint
Characteristics:
- Typically signal bodily harm or threat to survival.
- Pairing them with neutral stimuli will create conditioned aversive stimuli.

Initially neutral; become reinforcing only after consistent pairing with primary or already-conditioned reinforcers.
Prime classroom example: Attention
- At birth, caregiver presence provides food & comfort (primary reinforcers).
- Attention [ initially neutral ] + Food/comfort → attention gains reinforcing value.
Absence of a learned reinforcer can signal absence of other reinforcers → becomes motivating in itself (e.g., no caregiver attention = no food, no diaper change, sustained discomfort).
Classic economic example: Money
- \text{Paper Currency} + \text{Access to Goods/Services} \rightarrow \text{Money becomes reinforcing}
- Context-specific (Serbian dinars in the U.S. lose reinforcing power).

Neutral stimuli converted to aversives through pairing with unconditioned aversives or other conditioned aversives.
Classroom scenario:
- Neutral: Teacher standing at the front.
- Paired with: Directions that lead to loss of free-time or access to preferred activities.
- Result: Teacher’s presence becomes something the student attempts to escape/avoid.
Operates on same pairing mechanism as conditioned reinforcers but drives escape/avoidance rather than approach.

Pairing definition: Present neutral stimulus immediately before a reinforcer or punisher.
- Temporal contiguity critical; closer presentation → stronger conditioning.
If paired with a reinforcer → stimulus becomes a conditioned reinforcer.
If paired with a punisher → stimulus becomes a conditioned aversive stimulus.
Value-Altering Principle formalized:
- \text{Neutral Stimulus} + \text{Reinforcer} \rightarrow \text{Conditioned Reinforcer (CS+)}
- \text{Neutral Stimulus} + \text{Punisher} \rightarrow \text{Conditioned Aversive Stimulus (CS−)}

Reinforcement: \text{Behavior}_{t} \xrightarrow{\text{Reinforcer}} \uparrow P(\text{Behavior}_{t+1})
Punishment: \text{Behavior}_{t} \xrightarrow{\text{Punisher}} \downarrow P(\text{Behavior}_{t+1})
Organism need not be consciously aware of contingencies; effects are automatic.

Multiple contingencies can operate simultaneously on the same behavior (e.g., attention + food + escape from discomfort).
Understanding which stimuli are primary vs. conditioned informs effective behavior-change strategies (e.g., clinical interventions, classroom management, training animals).
Real-world relevance:
- Marketing: Pairing products with positive imagery.
- Therapy: Systematic desensitization pairs feared stimuli with relaxation (reverse conditioning of aversives).

Create a 4-column note card:
1. Unlearned Reinforcer (primary)
2. Learned Reinforcer (secondary)
3. Unlearned Aversive Stimulus
4. Learned Aversive Stimulus
Generate at least two personal examples for each category without using the internet; rely on textbook and notes.