Operant Conditioning

In previous lectures, two types of associative learning were introduced: classical conditioning and operant conditioning. This lecture focuses on the second type — operant conditioning.

Operant conditioning involves associating a behavior with a reward or punishment.
The consequences of actions determine the likelihood of them being repeated.

Classical conditioning is a passive and automatic process where individuals observe their environment to develop predictive relationships between events.
- Example: Pavlov's dogs learned to associate the sound of a bell with the presentation of food, leading to a reflexive response of salivation. Salivation is reflexive and outside the control of the dog.
Operant conditioning, in contrast, involves more willful behavior with choices.
- Motivation is crucial, as individuals learn that their behavior has consequences (both positive and negative).
- Operant conditioning can give learners the ability to alter their behavior to pursue positive consequences and avoid negative ones.

Proposed by Edward Thorndike, the Law of Effect states:
- "Responses that produce a satisfying state of affairs become more likely to be repeated."
- Learners tend to repeat behaviors that result in pleasure and avoid behaviors that result in pain.

Reinforcements and punishments interact differently with behavior.
- Reinforcement strengthens behavior (increases frequency), while punishment weakens behavior (decreases frequency).
- Example: Receiving a paycheck reinforces attendance at work, while a speeding ticket serves as punishment to deter future speeding.

Four types:
- Positive Reinforcement: Presentation of a pleasant stimulus to increase behavior.
- Negative Reinforcement: Removal of an unpleasant stimulus to increase behavior.
- Positive Punishment: Presentation of an unpleasant stimulus to decrease behavior.
- Negative Punishment: Removal of a pleasant stimulus to decrease behavior.

In operant conditioning:
- A behavior is followed by a consequence.
- The goal is to reinforce or punish that behavior to alter its frequency.

Shaping is a method used to teach complex behaviors through successive approximations.
Reinforce closer and closer steps toward the ultimate desired behavior.
- Commonly used in training scenarios (e.g., dog training).
- Example: Training a dog to close a door involves breaking the desired action into small, manageable steps.

Extinction occurs in operant conditioning when a previously reinforced behavior is no longer followed by its consequence.
Causes frustration in the learner due to unmet expectations of rewards.
An extinction burst may occur, which is a temporary increase in the frequency or intensity of the behavior when reinforcement stops.
Over time, if the behavior continues without reinforcement, it will gradually diminish.

Varying reinforcement schedules impact patterns of learning.
- Reinforcement can be fixed or variable; and delivered based on intervals or ratios.

Fixed Interval (FI): Reward given after a fixed amount of time.
- Example: Checking the mail.
Variable Interval (VI): Reward given after varying time intervals.
- Example: Checking email.
Fixed Ratio (FR): Reward after a set number of responses.
- Example: Gaining extra lives in video games after collecting 100 coins.
Variable Ratio (VR): Reward after a varying number of responses.
- Example: Playing slot machines.

Variable ratio schedules are the most effective due to the partial reinforcement effect, where behaviors persist longer without reinforcement.

ABA is a therapeutic strategy using learning principles to amend behavioral problems.
- Often used with patients exhibiting behavioral excesses or absences, particularly in treatment for Autism Spectrum Disorders (ASD).
Example: Reinforcing eye contact in individuals with ASD to facilitate social interactions.

The role of conditioning in the development and maintenance of substance use disorders is crucial.
- Positive reinforcement: Using drugs for pleasurable effects.
- Negative reinforcement: Using drugs to alleviate discomfort from anxiety or peer pressure.
- Withdrawal symptoms often compel continued use due to negative reinforcement dynamics.
Treatment strategies often involve breaking conditioned associations related to drug use through exposure therapies.

Addiction treatment requires ongoing support, addressing both classical and operant conditioning principles.
Recognizing that addiction issues are not limited to weakness can shift societal perceptions and encourage proper long-term support for those struggling with substance abuse disorders.