Conditioning & Reinforcement: Unlearned vs. Learned Stimuli

Context & Lecture Flow

  • Lecturer is covering advanced reinforcement concepts within a broader motivation unit.
  • Content order differs slightly from the textbook; lecture follows instructor’s logical flow.
  • Students encouraged to consult textbook chapters or attend office hours if confused.

Key Terminology & High-Level Concepts

  • Reinforcer: Any stimulus that increases the future probability of the behavior it follows.
  • Aversive Stimulus: Any stimulus an organism will act to escape/avoid; decreases behavior when presented contingent on that behavior.
  • Unlearned = Unconditioned = Primary (no prior pairing needed).
  • Learned = Conditioned = Secondary (acquire value only after pairing with other stimuli).
  • Pairing: Simultaneous or sequential presentation of a neutral stimulus with a reinforcer or punisher so the neutral stimulus acquires similar properties.
  • Value-Altering Principle: Pairing changes the reinforcing or punishing value of a previously neutral stimulus.

Unlearned (Unconditioned / Primary) Reinforcers

  • Require no prior learning or pairing to function.
  • Universally (or near-universally) reinforcing across individuals of a species, though magnitudes vary.
  • Typical examples:
    • Food
    • Water/fluids
    • Sleep/rest
    • Sexual stimulation
    • Relief from discomfort (e.g., removal of a tight restraint)
  • Key attributes:
    • Directly tied to survival or reproduction.
    • Organism will work to obtain them without prior training.
    • Can transfer their value to neutral stimuli through pairing → creation of conditioned reinforcers.

Unlearned (Unconditioned) Aversive Stimuli

  • Instinctively avoided; no learning required.
  • Examples provided:
    • Pain (cuts, burns, pressure)
    • Loud or startling noises
    • Restricted movement or physical restraint
  • Characteristics:
    • Typically signal bodily harm or threat to survival.
    • Pairing them with neutral stimuli will create conditioned aversive stimuli.

Learned (Conditioned / Secondary) Reinforcers

  • Initially neutral; become reinforcing only after consistent pairing with primary or already-conditioned reinforcers.
  • Prime classroom example: Attention
    • At birth, caregiver presence provides food & comfort (primary reinforcers).
    • Attention [ initially neutral ] + Food/comfort → attention gains reinforcing value.
  • Absence of a learned reinforcer can signal absence of other reinforcers → becomes motivating in itself (e.g., no caregiver attention = no food, no diaper change, sustained discomfort).
  • Classic economic example: Money
    • \text{Paper Currency} + \text{Access to Goods/Services} \rightarrow \text{Money becomes reinforcing}
    • Context-specific (Serbian dinars in the U.S. lose reinforcing power).

Learned (Conditioned) Aversive Stimuli

  • Neutral stimuli converted to aversives through pairing with unconditioned aversives or other conditioned aversives.
  • Classroom scenario:
    • Neutral: Teacher standing at the front.
    • Paired with: Directions that lead to loss of free-time or access to preferred activities.
    • Result: Teacher’s presence becomes something the student attempts to escape/avoid.
  • Operates on same pairing mechanism as conditioned reinforcers but drives escape/avoidance rather than approach.

Pairing Process & Value-Altering Principle

  • Pairing definition: Present neutral stimulus immediately before a reinforcer or punisher.
    • Temporal contiguity critical; closer presentation → stronger conditioning.
  • If paired with a reinforcer → stimulus becomes a conditioned reinforcer.
  • If paired with a punisher → stimulus becomes a conditioned aversive stimulus.
  • Value-Altering Principle formalized:
    • \text{Neutral Stimulus} + \text{Reinforcer} \rightarrow \text{Conditioned Reinforcer (CS+)}
    • \text{Neutral Stimulus} + \text{Punisher} \rightarrow \text{Conditioned Aversive Stimulus (CS−)}

Behavior-Change Mechanisms (Reinforcement vs. Punishment)

  • Reinforcement: \text{Behavior}_{t} \xrightarrow{\text{Reinforcer}} \uparrow P(\text{Behavior}_{t+1})
  • Punishment: \text{Behavior}_{t} \xrightarrow{\text{Punisher}} \downarrow P(\text{Behavior}_{t+1})
  • Organism need not be consciously aware of contingencies; effects are automatic.

Concurrent Contingencies & Practical Implications

  • Multiple contingencies can operate simultaneously on the same behavior (e.g., attention + food + escape from discomfort).
  • Understanding which stimuli are primary vs. conditioned informs effective behavior-change strategies (e.g., clinical interventions, classroom management, training animals).
  • Real-world relevance:
    • Marketing: Pairing products with positive imagery.
    • Therapy: Systematic desensitization pairs feared stimuli with relaxation (reverse conditioning of aversives).

Study Prompt from Instructor

  • Create a 4-column note card:
    1. Unlearned Reinforcer (primary)
    2. Learned Reinforcer (secondary)
    3. Unlearned Aversive Stimulus
    4. Learned Aversive Stimulus
  • Generate at least two personal examples for each category without using the internet; rely on textbook and notes.

Administrative & Pedagogical Notes

  • Lecture may jump around relative to textbook sequence.
  • Students invited to office hours for clarification.
  • If confusion arises, re-read corresponding textbook chapters first.