Classical Conditioning Theory Part 2

Rescorla–Wagner Core Principle

  • Classical-conditioning learning rule formalised in the early 1970s.
  • Canonical equation: ΔV=αβ(λV)\Delta V = \alpha \beta (\lambda - V)
    • VV = current associative strength.
    • λ\lambda = maximum associative strength the US can support.
    • α,β\alpha,\,\beta = salience / learning-rate parameters for CS and US.
  • Essence: use feedback error ("what happened minus what was expected") to update associations.
  • Idea has been broadened far beyond animal conditioning into modern machine-learning, neural-prosthetics and artificial intelligence.

Brain–Computer Interface (BCI) Application (Tetraplegia Study)

  • Participant: individual with high-level spinal-cord injury (no voluntary hand/leg movement).
  • Implant: “Utah” micro-electrode array (square, < penny-sized) inserted into arm/hand region of sensorimotor cortex.
    • Dozens of needle-like electrodes record population spiking patterns.
  • Task progression
    1. Cursor control phase (first ~70 days):
    • Cursor starts at screen centre; participant tries to move it to illuminated goal targets.
    • Machine-learning algorithm (foundation = Rescorla–Wagner-style error correction) maps cortical patterns → 2-D cursor velocity.
    • Continuous feedback: If generated pattern matches stored template for “up”, cursor moves up etc.
    • Performance visibly improves across training; occasional mistrials illustrate online model updating.
    1. Prosthetic-hand phase:
    • Same decoded signals routed to a robotic gripper.
    • Participant learns to open/close hand, grasp objects ⇒ tangible improvement in activities of daily living.
  • Significance
    • Demonstrates feedback-driven plasticity at cortical level.
    • Provides real-world validation of error-based learning rules in neuro-prosthetics.
    • Opens pathway toward combining recording + eventual stimulus-based rehabilitation (discussed later).

Artificial-Intelligence Example – IBM Watson & “Smartest Machine on Earth” Documentary

  • Watson trains via massive corpus of historical Jeopardy! Q&A pairs ("training trials").
  • Straight keyword search failed (many absurd answers) → needed weight-adjustment algorithm based on expected vs obtained outcome.
  • Rescorla–Wagner analogy:
    • Each candidate answer has weight V<em>iV<em>i; feedback from correct/incorrect updates V</em>iV</em>i.
  • Applications: meteorology (Weather Channel forecasting), customer-service chat-bots, medical diagnosis.
  • Pedagogical note: Documentary + guided questions are examinable content.

Historical Roots of Eyeblink Conditioning

  • First human lab (1920s, pre-IRB): Clark Hull (in visor) slaps grad-student Ernest Hilgard’s face after a tone to induce blink.
  • Modern humane protocol (rabbits, humans):
    • CS = pure tone.
    • US = mild corneal air-puff.
    • CR = anticipatory eyeblink.
  • Widely adopted because:
    • Simple, quantifiable, high trial-throughput.
    • Cerebellar circuitry well mapped, enabling fine neurobiological analysis.
    • Diagnostic probe for clinical disorders affecting cerebellum, brainstem, or learning processes.

Cerebellar Circuitry: Three Converging Pathways

1 – Conditioned-Stimulus (CS) Pathway
  • Tone activates auditory relay → pontine nuclei.
  • Pontine sends:
    • Excitatory collaterals to interpositus nucleus (deep cerebellar nucleus).
    • Mossy-fibre projections to cerebellar cortex → granule cellsPurkinje cells.
  • Purkinje cells exert inhibitory (GABAergic\text{GABAergic}) influence on interpositus.
2 – Unconditioned-Stimulus (US) Pathway
  • Air-puff triggers trigeminal reflex AND ascends via brainstem → inferior olive.
  • Climbing-fibre outputs excite both Purkinje cells and interpositus nucleus.
3 – Conditioned-Response (CR) Output
  • Once interpositus activity surpasses threshold, projects to red nucleus / cranial-facial motor nuclei → eyelid muscles.
Sites of Convergence / Candidate Memory Loci
  • Purkinje cells (receive CS via mossy + US via climbing fibres).
  • Interpositus nucleus (receives direct excitatory input from both pathways + inhibitory gating from Purkinje).

Lesion Evidence

  • Focal electrolytic or pharmacological lesion of interpositus:
    • Retrograde amnesia: previously learned CR abolished although reflex blink remains intact.
    • Anterograde amnesia: post-lesion animals cannot acquire new CS–US association.
  • Purkinje cells distributed across cortex → global lesion impossible; later genetic knock-out / mutation models show parallel learning deficits, confirming their necessity.

Electrophysiological Evidence

Interpositus Recording
  • Pre-training: CS evokes little firing; US gives small burst; blink occurs after US (pure reflex).
  • Post-training: CS alone produces gradual ramp-up of spikes peaking just before predicted US; blink now anticipatory.
Purkinje Recording
  • Baseline: high tonic firing (inhibitory).
  • During CS after training: marked pause in firing ("Purkinje pause") coincident with interpositus ramp → removes inhibition → disinhibits interpositus → CR triggered.
  • After US time-point: Purkinje firing gradually resumes, reinstating inhibition.
Functional Interpretation
  • Learning corresponds to CS-induced, time-specific decrease in Purkinje activity + complementary increase in interpositus firing.
  • Temporal precision encodes inter-stimulus interval.

Electrical-Stimulation (“Matrix Upload”) Experiments

Goal
  • Test whether mere activation of neural pathways (no external tone or air-puff) is sufficient for acquisition, extinction, inhibition, spontaneous recovery.
Key Experiments & Results
  1. US Replacement (Mach 1986)

    • Group A: real air-puff (US).
    • Group B: electrical stimulation (ES) of inferior olive (start of US pathway).
    • Acquisition curves identical → ES-US is functionally equivalent.
  2. CS Replacement

    • ES-pontine nuclei used as CS; real air-puff as US.
    • Rabbits acquired CR; showed normal extinction when ES-CS presented alone; could undergo inhibitory conditioning (CS− trials reduce responding).
  3. Full Internal Pairing

    • ES-pontine (CS) + ES-inferior-olive (US) paired with appropriate temporal offset.
    • Animals never experience tone or puff, yet develop robust CR to pontine-stimulation alone.
    • Exhibit standard phenomena: extinction, spontaneous recovery, reacquisition, delayed re-learning after explicit CS− training.
Implications
  • Learning rule operates on neural-activity patterns irrespective of sensory modality.
  • Supports feasibility of therapeutic “electrical rehabilitation” paradigms (e.g., spinal-cord plasticity, BCI closed-loop stimulation).

Excitatory vs Inhibitory Neuron Dynamics (Important Background)

  • Excitatory cells (e.g., mossy-fibre targets, interpositus neurons): low baseline; fire bursts only when driven.
  • Inhibitory cells (Purkinje): high tonic rate; learning often manifests as decrease in their activity.
  • Net effect during CS after conditioning = shift of excitation/inhibition balance favouring motor output.

Broader Relevance & Ethical / Practical Considerations

  • Eye-blink paradigm foundational for dissecting memory engrams; parallels seen in fear conditioning, habit learning, addiction models.
  • Ethical transition from face-slap (1920s) → humane air-puff underscores evolution of research oversight (IRB protocols).
  • Electrical-pathway conditioning foreshadows questions about autonomy, consent and potential misuse of neuro-stimulation technologies ("Matrix" analogy).

Connections to Previous & Future Lectures

  • Builds upon earlier classical-conditioning concepts (CS, US, CR, extinction, spontaneous recovery, inhibitory conditioning).
  • Next lecture will extend conditioning framework to drug tolerance, dependence, and relapse.