Comprehensive Study Notes: Classical Conditioning, Operant Conditioning, and Cognitive Social Theory (with Applications and Key Terms)
Learning and Overview
- Learning defined as an enduring change in the way an organism responds based on experience. It lasts over time and is necessary for survival in a changing environment.
- Learning is inferred from behavior because it cannot be observed directly; we infer learning from subsequent behavior and its observable consequences.
- Behavior after learning may not occur immediately; there can be latent potential that appears later.
- A reflex is an automatically elicited behavior by an environmental stimulus (e.g., blinking when something approaches).
- Habituation is a reduction in response strength of a reflex with repeated presentation of the stimulus.
- Learning involves experience and results in an altered behavior; strict behaviorism emphasizes conditioning as the mechanism, with learning occurring under classical or operant conditioning.
- Important cautions: multiple influences on behavior (e.g., motivation) can affect whether a learned behavior is expressed.
- The focus of this module: classical conditioning, operant conditioning, and cognitive social theory.
Classical Conditioning
- Core idea: learning occurs from a new association between two previously unrelated stimuli; behavior is predicted by a preceding stimulus (cue/antecedent).
- Responses in classical conditioning are reflexive or autonomic, elicited by environmental stimuli.
- Key terms:
- Unconditioned stimulus (UCS): naturally elicits a reflexive response. Example: food.
- Unconditioned response (UCR): natural response to the UCS. Example: salivation to food.
- Neutral stimulus (NS): does not normally elicit the UCR. Example: bell/tonal sound.
- Conditioned stimulus (CS): originally neutral stimulus that, after pairing with the UCS, elicits a conditioned response. Example: bell after conditioning.
- Conditioned response (CR): learned response to the CS. Example: salivation to the bell.
- Acquisition: process of forming the new association between UCS and NS; after multiple pairings, the NS becomes the CS and elicits the CR.
- Example: With dogs, after repeated pairings of bell (CS) with food (UCS), the bell alone produces salivation (CR).
- Typical progression: initially no salivation to bell; after several pairings, the bell evokes salivation; by trial ext{5}^{ ext{th}} or ext{6}^{ ext{th}} trial, CR is firmly established.
- Example of conditioning: daily life scenario where a neutral stimulus (e.g., a fast-food logo) becomes associated with a response (salivation/anticipation of food).
- Taste aversion: a learned aversion to a taste associated with illness; can occur after a single trial (one-trial learning).
- Example: bad prawns → nausea after red wine; later, red wine alone (CS) evokes nausea (CR).
- In chemotherapy patients, nausea from a treatment can become associated with foods eaten nearby, leading to weight loss; hospitals may use a novel pretreatment-taste (e.g., peculiar ice cream) to create a new CS.
- Emotional conditioning: conditioned emotional responses (e.g., fear, happiness) to initially neutral stimuli paired with emotional events.
- Little Albert (Watson): classical conditioning leading to fear of white rat after pairing with a loud noise; illustrates conditioned fear and potential basis for simple phobias. Note: ethically problematic; used to illustrate foundational ideas.
- Applications and consequences:
- Phobias, conditioned fears (e.g., needles, injections) can be explained via classical conditioning.
- Conditioning of nausea during chemotherapy can lead to aversions to foods associated with treatment.
- Trauma can produce conditioned fear responses to stimuli associated with the trauma (e.g., aftershave, smells).
- Generalization and discrimination:
- Stimulus generalization: after learning, similar stimuli elicit a similar response (e.g., Albert’s fear spreading to other white objects).
- Stimulus discrimination: learning to respond only to the specific conditioned stimulus used during training (e.g., dogs respond to a bell but not other sounds).
- Extinction: conditioned response weakens when the CS is presented without the UCS over time; not unlearning, but inhibition of responding.
- Spontaneous recovery: after a rest period, the conditioned response may briefly return.
- Renewal effect: extinction in a different environment may be reversed if the organism returns to the original acquisition context.
- Factors affecting classical conditioning:
- Interstimulus interval (time between CS and UCS): optimal conditioning typically requires a brief interval (usually a few seconds or less).
- Temporal order: forward conditioning (CS precedes UCS) yields stronger learning than simultaneous or backward conditioning.
- Preparedness: some associations are easier to learn due to biological constraints; some species are biologically prepared to associate certain stimuli with specific responses (e.g., taste with nausea, shock with visual/auditory cues).
- Prepared learning: biological readiness to learn certain associations more readily due to evolution and natural selection; this eases conditioning for survival-related associations.
- Classical conditioning in practice: therapies to modify behavior, such as aversion therapy, flooding, systematic desensitization.
- Aversion therapy: pair an unpleasant stimulus with an undesirable behavior (e.g., smoking, overeating).
- Flooding: rapid, intense exposure to fear-provoking stimuli with no escape; used for phobias, PTSD, OCD.
- Systematic desensitization: pair CS with relaxation while gradually increasing exposure to the phobic CS.
- Antabuse (disulfiram): medication for alcohol dependence that produces sickness if alcohol is consumed, conditioning the aversive response to alcohol.
- Notable missteps and caveats:
- Conditioning does not occur for every CS-UCS pairing; timing, readiness, and prior learning history matter.
- Extinction is context-sensitive; recovery can occur when context/situation changes.
- Summary questions (practice):
- Q1: Behaviors elicited automatically by a stimulus are referred to as reflexes. Answer: ext{b}
- Q2: In classical conditioning, unconditioned refers to something reflexive; conditioned refers to something learned. Answer: ext{c}
- Q3: Extinction in classical conditioning results from the omission of the UCS. Answer: ext{c}
Operant Conditioning
- Distinction from classical conditioning: classical conditioning involves environment triggering a response; operant conditioning involves a behavior/operant producing consequences.
- Operants are behaviors emitted (not elicited by a stimulus) and are strengthened or weakened by consequences.
- Thorndike and the Law of Effect:
- An animal’s tendency to reproduce a behavior depends on the behavior’s effect on the environment and its consequences.
- Behavior is controlled by its consequences.
- Skinner’s contribution: operant conditioning using the Skinner box; reinforcement and punishment as central concepts.
- Reinforcement and punishment:
- Reinforcement increases the likelihood of a behavior occurring again.
- Punishment decreases the likelihood of a behavior occurring again.
- Consequences must be contingent (timely) to the behavior for conditioning to occur.
- Types of reinforcement:
- Positive reinforcement: addition of a pleasant stimulus after a behavior to increase its likelihood. Examples: pay, praise, rewards, good grades.
- Negative reinforcement: removal of an aversive stimulus after a behavior to increase its likelihood. Examples: removing cold to wear gloves, removing pain via analgesia.
- Negative reinforcement includes escape learning (removing an aversive state that already exists) and avoidance learning (preemptively avoiding an aversive event).
- Punishment types:
- Positive punishment: administration of an aversive stimulus after a behavior reduces its likelihood. Example: scolding.
- Negative punishment: removal of a pleasant stimulus after a behavior reduces its likelihood. Example: losing Internet privileges after curfew.
- Applications in behavior therapy:
- Use the least aversive means possible; positive reinforcement is often the most powerful change agent.
- Punishment tends to suppress undesired responses but does not teach desirable alternatives and may have negative side effects (e.g., fear, aggression, misuse).
- Problems with punishment:
- Learner may learn what not to do but not what to do; punishment can create fear of the punisher (conditioning other associations).
- Punitive methods can be misapplied or abusive; may not override existing rewards for undesired behaviors.
- How to improve punishment effectiveness:
- Apply swiftly after the undesired behavior; ensure the punishment is appropriate in magnitude.
- Pair punishment with training in alternative acceptable behaviors.
- Use non-corporal approaches where possible (e.g., time-out, removal of privileges).
- Extinction in operant conditioning: withholding reinforcement for a previously reinforced behavior leads to a decrease in that behavior.
- Factors affecting ease of extinction: strength of original learning, variety of settings, and reinforcement schedules used.
- Schedules of reinforcement and their effects:
- Continuous reinforcement: reinforcement after every correct response; quick learning, but rapid extinction when reinforcement stops.
- Ratio schedules:
- Fixed ratio (FR): reinforcement after a fixed number of responses (e.g., every 5th response). Pattern: high response rate with abrupt pauses after reinforcement.
- Variable ratio (VR): reinforcement after an unpredictable number of responses; produces high, steady responding; most resistant to extinction and common in daily life. Example: VR-5 averages one reward every 5 responses, but the exact number varies.
- Interval schedules:
- Fixed interval (FI): reinforcement after a fixed time interval, regardless of responses; leads to a scalloped response pattern with a pause after reinforcement.
- Variable interval (VI): reinforcement after varying time intervals; produces steady, moderate responding with high consistency.
- Applications and therapy:
- Operant conditioning principles underpin behavior modification therapies, such as shaping, token economies, remedial education, and autism therapies.
- Example: thumb-sucking, tantrums, quitting smoking addressed via reinforcement/punishment.
- Multi-choice practice recap:
- Highest and most consistent rate of response: Variable ratio schedule. Answer: ext{b}
- Extinction in operant conditioning occurs when the operant is not followed by the consequence previously associated with it. Answer: ext{b}
- When using punishment, it is most effective to reinforce alternative acceptable behaviors. Answer: ext{d}
Cognitive and Social Learning Theories
- Social learning (observational learning): learning occurs via social interaction and observation of others (models); imitation and vicarious learning can occur for both classical and operant conditioning.
- Bandura’s observational learning: learning can occur by watching a model’s behavior and its consequences (vicarious conditioning).
- Factors influencing imitation: prestige/likability of the model, and whether the model was rewarded or punished for the observed behavior (vicarious reinforcement/punishment).
- Bobo doll experiments (Bandura et al.): children exposed to aggressive models were more likely to display aggression toward the doll, especially if the model was rewarded for the aggression.
- Observational learning beyond imitation: learners can acquire knowledge by observing the outcomes and strategies, not only the actions.
- Interaction with cognitive processes:
- Cognitive social theory emphasizes internal mental processes: expected outcomes, mental images, and expectancies influence behavior alongside environmental contingencies.
- Tolman introduced latent learning: learning can occur without reinforcement and become evident when reinforcement is later introduced.
- Maze experiments with rats showed that rats without reinforcement still formed cognitive maps and navigated efficiently once reinforced, indicating learning prior to reinforcement.
- A cognitive map is a mental representation of the environment.
- Rotter’s locus of control: expectancy about whether outcomes are determined by internal actions or external forces.
- Internal locus of control: belief that one’s own actions determine outcomes (e.g., promotions earned through effort).
- External locus of control: belief that outcomes are due to luck or fate (external forces).
- Explanatory style and coping: how people explain bad events (internal vs external) affects depression risk; some individuals maintain positive coping styles and are less prone to learned helplessness.
- Seligman and learned helplessness:
- In experiments, dogs exposed to uncontrollable shocks failed to escape later even when escape was possible, showing learned helplessness and depressive-like states.
- Learned helplessness has been linked to human depression; some people display positive coping and resilience.
- Ethical note: Seligman’s dog studies are ethically controversial by today’s standards.
- Applications and implications:
- Observational learning underlies a wide range of social behaviors, attitudes, and skills, including phobic treatment (watching a therapist handle a snake calmly).
- Latent learning and cognitive maps explain how people/animals navigate environments and solve problems even without immediate reinforcement.
- Locus of control and explanatory style influence motivation, persistence, academic achievement, and mental health.
- Summary questions (practice):
- Q1 (vicarious conditioning): learning consequences of actions by watching others’ results. Answer: ext{a}
- Q2 (latent learning): learning tasks without reinforcement is latent learning. Answer: ext{c}
- Q3 (internal locus of control): a raise attributed to own effort indicates an internal locus of control. Answer: ext{b}
Connections, Implications, and Real-World Relevance
- Ethical, philosophical, and practical implications:
- Classical conditioning theories illuminate how emotions and fears can be conditioned, leading to phobias or taste aversions; caution about using such approaches in real life and considering patients’ rights and wellbeing.
- Conditioning-based therapies (aversion therapy, flooding, systematic desensitization) require careful ethical oversight and individualized planning.
- Operant conditioning highlights the power of reinforcement in shaping behavior but warns against over-reliance on punishment; emphasizes consistency and clarity in teaching new behaviors.
- Observational learning explains the broad influence of social context, role models, and media on behavior, attitudes, and skills.
- Real-world relevance:
- Education: reinforcement schedules can optimize study habits and behavior; token economies in classrooms.
- Workplace: reinforcement/punishment schedules influence productivity and compliance; avoidance of over-reliance on punishment.
- Therapy: behavioral therapies, cognitive behavioral approaches, and exposure-based treatments integrate conditioning principles with cognitive strategies.
- Public health: understanding conditioned taste aversions can inform management of chemotherapy side effects and nutritional interventions.
Key Terms and Concepts (Glossary)
- Learning, conditioning, reflex, habituation, UCS, UCR, NS, CS, CR, acquisition, extinction, spontaneous recovery, renewal, generalization, discrimination, preparedness, prepared learning, aversion therapy, flooding, systematic desensitization, Antabuse, operant conditioning, reinforcement, punishment, positive/negative reinforcement, positive/negative punishment, contingency, extinction in operant conditioning, partial reinforcement, fixed/variable ratio, fixed/variable interval, acquisition curves, shaping, token economies, observational learning, modeling, vicarious conditioning, Bobo doll, latent learning, cognitive map, locus of control, internal vs external, explanatory style, learned helplessness, Seligman.
Practice recap (quick questions)
- In vicarious conditioning, learning occurs by watching the consequences of someone else’s actions. Answer: ext{a}
- Latent learning describes learning that is not immediately expressed in behavior. Answer: ext{c}
- An internal locus of control means you believe your actions determine outcomes. Answer: ext{b}