Biopsychology Lecture Notes: Decision Making, Reward, and Plasticity
Choice, Reward, and Decision Making
- Event context (Week-long neuroscience fair talks): Speakers from the Poe Fly Lab, Memory and Eye Movements Lab, Fullstack Affective Neuroscience Lab, and others; food provided. Date/time: Wed, Sep 17, 2025, 05:00–06:30 pm; location: JBHT 0144.
- Schedule highlights: This week topics include Choice & Plasticity (Tue) and Learning & Memory (Thu); Next week includes Review 1 (Tue) and Exam 1 (Thu). Required readings: week4spike, week4neuro2064. No assignment; timeline mentioned.
- Quick theme: Applying neuroscience methods to study how you “choose” and how brain plasticity underpins learning from experience. Mention of designer drug / designer receptor (likely a nod to modern neuroscience tools such as DREADDs).
- Video resource: vid: https://youtu.be/de_b7k9kQp0?t=2m10s
- Quiz and practice: Quiz (graded for completion) on “choice” at https://fullstackneuro.io/biopsyc/ or via QR code.
Core concepts: What drives human choice
- Decision variables often considered in models of choice:
- Reward magnitude (amount of gain or loss)
- Probability of receiving the reward
- Time of reward (now vs. later)
- Risk (potential gain and potential loss)
- These together enable deconstruction of human choice behavior. extChoice=f(Rm,P,T,extrisk)
- Context slide emphasizes a framework for linking subjective value to neural signals.
Neuroanatomy: brain regions implicated in choice and cognition
- Frontal Lobe
- Motor control (motor cortex)
- Cognitive activities (planning, decision making, problem solving)
- Language (Broca’s area)
- Parietal Lobe
- Sensation (somatosensory cortex: touch, pressure, temperature)
- Fine sensation (texture, size, shape)
- Spatial awareness
- Temporal Lobe
- Hearing (auditory cortex)
- Memory (hippocampus)
- Language (Wernicke’s area)
- Occipital Lobe
- Neuroanatomy: Midsagittal plane (location of major structures in a sagittal view)
The Triune brain and the dopaminergic reward system
- Paul MacLean’s Triune Brain framework:
- Neomammalian (human cortex) – symbolic processing
- Paleomammalian (limbic system) – emotions and social processing
- Reptilian (brainstem) – basic survival functions; the 4 F's: Feeding, Fighting, Fleeing, Reproduction (context note: “N, P, R” in slide shorthand)
- Mesolimbic dopamine system:
- Ventral tegmental area (VTA) produces dopamine
- Dopamine projection targets include nucleus accumbens (NAcc) and medial prefrontal cortex (MPFC)
- Core idea: dopamine signaling links anticipated rewards to motivational drive and action selection.
Imaging methods and experimental tasks
- Functional Magnetic Resonance Imaging (FMRI)
- Temporal resolution ~2 seconds
- Spatial resolution ~3 mm
- Setup: projector, computer, scanner, response box, etc.
- Monetary Incentive Delay (MID) task (Knutson et al., 2000):
- Conditions during trials: Anticipation of gain (+$5), Anticipation of loss (-$5), Anticipation of no money ($0)
- Trial structure: cue → delay → target → feedback
- Purpose: dissociate neural signals during anticipation of gain vs. loss vs. neutral outcomes
- Outcome summary from MID research:
- Gain anticipation activates NAcc (nucleus accumbens) – the same region implicated in rodent self-stimulation.
- Loss anticipation activates Anterior Insula (AIns).
- MPFC (medial prefrontal cortex) is involved in coding reward probability and value signals during anticipation (as seen in condition-specific responses).
- The endowment effect and related valuation processes link to AIns activity during loss/gain framing.
Key studies and findings in reward processing
- Knutson et al., 2000 (MID task):
- Gain anticipation → NAcc activation
- Loss anticipation → Anterior Insula activation
- Data illustrate how different brain regions encode aspects of motivational value before outcomes.
- Knutson et al., 2005 (Probability and MPFC):
- Probability of reward and magnitude interact to shape MPFC activity during anticipation
- Example conditions: +$5 with 80% probability, +$5 with 50% probability, +$5 with 20% probability
- MPFC shows coding related to the expected value across varying probabilities of gain
- Representative data point: A = 14, PRB (probability) cue → anticipation response in MPFC
- Knutson et al., 2005 (quantitative symbols):
- Under +$5/80%, +$5/50%, +$5/20% conditions, MPFC response tracks the probabilistic value of the anticipated reward
- Knutson et al., 2000/2005 synthesis takeaways:
- Gain magnitude is coded in NAcc
- Loss magnitude is coded in AIns
- Probability of obtaining reward is coded in MPFC
- Time (now vs. later) involves LPFC (for delayed rewards and executive control)
Endowment effect and anterior insula
- Endowment effect: owning an object increases its subjective value; “it hurts to lose something you love.”
- Data example: endowment of a favorite Razorbacks cup used to demonstrate the effect with personal data.
- Anterior Insula (AIns) activity correlates with larger endowment effects (i.e., stronger loss aversion when owning an item).
- Additional figure (Knutson et al., 2008) supports the link between AIns activation and endowment-related valuation changes.
Valuing gains, losses, and decision signals across networks
- Gaining vs losing reward:
- Gain magnitude coding in NAcc
- Loss coding in AIns
- Probability of getting reward coded in MPFC
- Time of reward (now vs. later) coded in LPFC for delayed decisions
- Structural connectivity and gambling behavior (AIns–NAcc):
- Stronger white-matter connections between AIns and NAcc associated with not gambling as a behavior pattern (Knutson et al. lineages and follow-ups, Leong et al.).
- Note of reported correlations: c = -0.40, c’ = -0.24, a = -0.35, b = 0.46** (illustrative path coefficients for a mediation-style interpretation; p-values indicated by * / **).
- Practical question raised: How can real-time brain signals be used?
- Every second, one could in principle gauge how much someone likes/dislikes a stimulus or experiences loss aversion
- Potential applications: marketing, product design, behavioral interventions, and neuromarketing research (with ethical considerations).
Applied examples: Predicting consumer choices
- Predict buying (GODIVA chocolate; Knutson et al., 2005):
- Task: product shown with price; subjects decide Yes/No while NAcc (and related regions) are measured
- Data patterns show NAcc activity tracks anticipated reward value and can predict purchase choices
- Example figures show sequential trials with fixations and choices; NAcc signal correlates with eventual decision
- Predict buying: additional variables in datasets include fixations, choice timing (4 s windows), and subjective value estimates.
- Predict not buying: similar MID-like patterns show Insular regions modulate decisions to refrain from purchases; data suggest insula activity relates to avoidance of costly or less desirable options.
Cross-domain demonstrations: music and streaming sales
- Predict music sales (Berns & Moore, 2012):
- Experimental paradigm: listen to songs; collect ratings (popularity, familiarity) and actual sales data (albums sold)
- Finding: NAcc activation during listening correlates with subsequent sales performance (R ≈ 0.32)
- Predict streaming/view trials (Tong et al., 2020):
- Design: participants view trial blocks of videos, rate engagement/valence/arousal; measure NAcc activity during viewing
- Result: NAcc activity correlates with view frequency and engagement metrics (approximate r ≈ 0.46, p < .01 in reported data)
- Takeaway: reward-related neural signals can predict real-world consumer behaviors across products and media.
Time, delay, and future reward valuation
- Time-based decision tasks:
- Classic questions: Would you rather have $10 today or $15 in 30 days?
- Neural coding patterns:
- Immediate rewards vs. delayed rewards invoke different prefrontal circuits
- Dorsolateral Prefrontal Cortex (DLPFC) and lateral prefrontal cortex (LPFC) implicated in computing long-term value and control in timed choices
- Ballard et al. (2009) findings on brain regions during time-based value tasks:
- Dorsolateral prefrontal cortex (DLPFC) and inferior frontal gyrus (IFG) involvement in temporal valuation and cognitive control
- Summary of time/value coding:
- Reward magnitude: NAcc
- Loss magnitude: AIns
- Probability: MPFC
- Time/Delay: LPFC and DLPFC involvement in delayed rewards
Integrating time, risk, and value: Ballard and others
- Time-focused valuation analyses (Ballard et al., 2009):
- Brain coordinates show activity in dorsolateral and lateral prefrontal regions during timing and value integration
- Equations/symbols from figures show intent to map temporal aspects to specific cortical loci (e.g., y = 55 in one schematic for dorsolateral PFC activation; general idea of spatial mapping of time/value signals)
- In short: the brain integrates magnitude, probability, and delay across distributed circuits to compute net value and guide choice.
Risk and skew in real-world decisions
- Real-world risk preferences:
- Humans tend to like positively-skewed risks (e.g., lotteries, casino games) more than symmetric or negatively skewed risks
- Traditional finance models often do not account for skewness in risk preference
- Question: can neuroscience illuminate why people prefer asymmetric risks?
- Kraus & Litzenberger (1976) reference to risk in finance as a backdrop for neuroeconomic inquiry
Risky gambling task and neural correlates
- Gambling paradigm (Wu, Bossaerts, Knutson et al. context):
- Conditions comparing Positive-skew vs Symmetric vs Negative-skew gambles and a No Gamble option
- Measured outcomes include gamble rate and NAcc activation patterns over time (seconds)
- Key findings:
- Positive-skew gambles elicit stronger NAcc activity compared to symmetric/negative-skew gambles
- Neural responses align with behavioral tendency to engage more with positive-skew risks
- Graph interpretation notes:
- Time-series of Signal Change in NAcc shows peak responses following presentation of skewed gambles
- Statistical markers indicate significance for the positive-skew condition in relation to risk-taking behavior
Neuroanatomical and functional connections in gambling behavior
- Anterior Insula–NAcc structural connectivity:
- Structural connections between AIns and NAcc relate to gambling propensity
- Stronger AIns–NAcc white-matter connectivity associates with not engaging in gambling
- Regression-style coefficients (illustrative from slides):
- c = -0.40, c’ = -0.24, a = -0.35, b = 0.46**
- Interpretation: certain pathways moderate the relationship between affective signals (insula) and reward processing (NAcc) in predicting gambling behavior
Takeaway points: Integrated view of choice
- Core coding in different regions:
- Reward magnitude (gain) → NAcc
- Loss magnitude → AIns
- Reward probability → MPFC
- Time/delay → LPFC
- Risk processing involves interactions between NAcc and AIns, modulated by structural connectivity
- The brain’s choice architecture is distributed and interdependent; simple one-region explanations are insufficient.
- Structural connections (white matter) between key regions are an emerging frontier for linking brain networks to complex behavior (e.g., gambling risk-taking).
- While powerful, these neural signals are only part of the story; context, modelling assumptions, and ethical considerations matter in applying neuroscience to real-world decisions.
- Recap: Recap assignment 3 discusses NeuroSynth as a platform for linking FMRI activity to keywords across thousands of studies; dynamic meta-analyses can search by term or brain area to map terms to brain activity and vice versa.
- Week 3 readings (reverse inference) explain why we cannot infer a specific cognitive process solely from brain area activation; multiple processes can co-activate the same region.
- Week 3 readings warn against neuro hype and discuss what neuroimaging actually measures (and when it should be used), with examples where neuroimaging may be unnecessary or misleading.
Plasticity: the brain’s capacity to change across scales
- Core idea: Plasticity refers to brain changes due to experiences with the environment across spatial and temporal scales.
- Types of plasticity:
- Grey matter plasticity
- White matter plasticity
- Synaptic plasticity
- Grey matter plasticity:
- Involves growth and organization of neurons in cortical columns and layers
- Key historical examples: monocular deprivation effects on primary visual cortex (Hubel & Wiesel, 1964); structural changes observed by Maguire et al. (2000); Draganski (2004)
- White matter plasticity:
- Changes in myelination, axon diameter, and number of axons within a tract to speed and synchronize communication between brain areas
- White matter changes observed massively after birth; pruning during development (roughly ages 3–19, maybe until 25); ongoing debate about adult plasticity (Anguera et al., 2013; Sagi et al., 2012)
- Synaptic plasticity:
- Strengthening or creation of synapses between neurons (Hebbian-like principles: “cells that fire together wire together”)
- Core mechanisms involve NMDA receptors as coincidence detectors, calcium influx, and downstream signaling that strengthens synapses
Action potentials and synaptic mechanisms: foundations recap
- Action potential basics (review slides):
- Glutamate binds to liganded Na+ channels → excitatory postsynaptic potential (EPSP)
- EPSP leads to depolarization; reaches threshold to open voltage-gated Na+ and K+ channels, triggering an action potential
- Classic scale: resting potential ≈ $V{rest} \approx -70\,\text{mV}$; threshold ≈ $V{th} \approx -55\,\text{mV}$
- Presynaptic events:
- Action potential arrives at axon terminal; voltage-gated Ca^{2+} channels open
- Vesicles release neurotransmitter into synaptic cleft
- Neurotransmitter binds to postsynaptic receptors to propagate the signal
- NMDA receptor and synaptic plasticity:
- NMDA receptors act as a coincidence detector: require both glutamate binding and postsynaptic depolarization (to relieve Mg^{2+} block) and Ca^{2+} influx
- When Glutamate and Ca^{2+} coincide, NMDA receptors activate and trigger signaling cascades that induce synaptic changes
- Result: LTP (Long-Term Potentiation) – increases the efficacy of future synaptic transmission; foundational mechanism for learning, memory, and addictions (to be expanded next week)
- Mechanistic summary for LTP (condensed):
- If Glutamate and Ca^{2+} are present together at the synapse → NMDA receptor activation → intracellular signaling → structural and functional synaptic strengthening → greater future postsynaptic responses to the same presynaptic input
Spatial scales: from nanometers to meters
- Spatial scale sense:
- Grey matter and white matter operate at micrometer-to-millimeter scales (roughly 10^-6 to 10^-3 meters)
- Synaptic changes occur at nanometer scales (roughly 10^-9 meters)
- Visualizing scale progression:
- Classic scale demonstrations (e.g., scale models and animations) help contextualize how micro-level synaptic changes underpin macro-level cognitive functions
- Practical thought: many brain changes underlying learning are multiplexed across these scales and over different timescales (development, adulthood, aging)
Quiz and practice resources
- Quiz 2 (graded for completion) on plasticity: https://fullstackneuro.io/biopsyc or QR code provided in slides
LaTeX and key numerical details (quick reference)
- Resting potential: Vrest≈−70 mV
- Action potential threshold: Vth≈−55 mV
- MID task conditions (gain/loss/neutral): gains of +5.00, losses of −5.00, neutral +0.00; probabilities often set as P \in \{0.80, 0.50, 0.20} for different cue conditions
- Reward magnitude coding locations: nucleus accumbens (NAcc) / gain; anterior insula (AIns) / loss; probability coding in MPFC; delayed reward coding in LPFC
- Example correlation: Berns & Moore (2012) reported a correlation of approximately R=0.32 between NAcc activation and quantity of albums sold
- Positive-skew gambling results: indicates greater NAcc activation for positive-skew gambles relative to other skew types; structural connectivity (AIns–NAcc) relates to gambling behavior
- NAcc activation effects in streaming/music studies: r ≈ 0.46 in reported data (Tong et al., 2020) for the relationship between NAcc signal and view frequency
Connections to theory and real-world relevance
- The MID task and related fMRI findings anchor neuroeconomic theories about how people compute expected value and risk during decision making
- The endowment effect links affective neural signals (AIns) to valuation shifts when ownership changes
- Structural connectivity between affective (AIns) and reward (NAcc) regions helps explain individual differences in risk-taking, gambling, and addiction vulnerability
- Cross-domain studies (music, movies, goods) illustrate the generalizability of reward system signals for predicting real-world choices
- Critical caution: neuroimaging results should be interpreted within the limits of reverse inference and methodological constraints; meta-analytic tools (e.g., NeuroSynth) enable more robust connections between brain activity and cognitive terms, but do not provide clean one-to-one mappings
Notes on practical and ethical implications
- Real-time brain signals offer potential applications in marketing, clinical interventions, education, and personalized decision support
- Ethical considerations include privacy, autonomy, and potential manipulation concerns when decoding or influencing preferences
- Neuroscience should complement, not replace, behavioural data and theoretical models when informing policy or consumer design