Learning

How do we learn?

Learning - the process of acquiring new and relatively enduring information or behaviors.

Information must stick around (no cramming for test)

We expect and prepare for certain events (arrival of food) - classical conditioning

Learn to repeat actions that bring good outcomes - Operant Conditioning

Learn new behaviors by observing people and events - Cognitive Learning

But how do we learn?

Humans learn from experience
Nature’s most important gift is humans ability to adapt

Association we learn through habits. Habits form when we repeat behavior in a given context. As we repeat this behavior, the next time we encounter that context, we will evoke our habit. Behavior turns into habits after 66 days.

Seals will repeat behavior (clapping) once it learns that doing so will get it treats

Associative Learning - learning that certain events occur together. The events may be two stimuli (as in classical conditioning) or a response and its consequence (as in operant conditioning).

since we know after lightning comes the boom of thunder, we start to brace ourselves just from the lightning, not the boom.
- classical conditioning

Respondent Behavior - behavior that occurs as an automatic response to some stimulus. (bracing ourselves from the lightning)

Operant Behavior - behavior that operates on the environment, producing a consequence.

Associate our action with a consequence. We then repeat the action in order to achieve good results (or avoid bad ones)
Kid learns that being polite means you are more likely to get what you want

Cognitive Learning - the acquisition of mental information, whether by observing events, by watching others, or through language.

Lets humans learn though other’s experience

Classical Conditioning

Classical Conditioning - a type of learning in which we link two or more stimuli and anticipate events.

Explored by Ivan Pavlov

Said psychology was a science and should be based in experiments

Pavlov’s Experiments

Studied Dog’s digestive system and then later on, learning

Noticed that when he put food on the dog’s tray, their mouths would begin to salivate

Not at the taste of the food, but at the sight of it

Neutral Stimulus - in classical conditioning, a stimulus that evokes no response before conditioning.

When you first see lightning, you don’t have a reaction (its neutral)

If these stimulus happened before the dog was given food, would the dog connect the stimulus and the food? Would the dog start to salivate?

Yes and yes!
A dog doesn’t learn to salivate in response to food in its mouth; food in the mouth automatically, unconditionally, triggers this response.
The dog salivating in response to a bell is a learned response

Unconditioned Response - in classical conditioning, an unlearned, naturally occurring response (such as salivation) to an unconditioned stimulus (US) (such as food in the mouth).

Natural response to a unconditioned stimulus

Unconditioned Stimulus - in classical conditioning, a stimulus that unconditionally — naturally and automatically — triggers a response (UR). The food for the dogs

When you first hear a boom of thunder, you automatically respond

Conditioned Response - in classical conditioning, a learned response to a previously neutral (but now conditioned) stimulus (CS).

The dog salivating once it heard the bell
You connect the lightning and the boom of thunder, and you prepare for the thunder

Conditioned Stimulus - In classical conditioning, an originally neutral stimulus that, after association with an unconditioned stimulus (US), comes to trigger a conditioned response (CR).

You see lightning and know thunder will happen
Tuning Fork/bell with Pavlov’s Dogs

If Pavlov’s research was so simply, why did he continue to study it for 3 decades?

Five Stages of Conditioning Processes:

Acquisition
Extinction
Spontaneous Recovery
Generalization
Discrimination

Acquisition

Acquisition - in classical conditioning, the initial stage — when we link a neutral stimulus and an unconditioned stimulus so that the neutral stimulus begins triggering the conditioned response. (In operant conditioning, the strengthening of a reinforced response.)

The point when the dogs linked the bell and the salivating

how much time passes between the bell and the salivation (half a second)

What would happen if the food was first, then the bell?

the conditioning would not happen. The order must go stimulus then response

Present the neutral stimulus and unconditioned stimulus in order to connect them together

Strengthen the response

Extinction and Spontaneous Recovery

What would happen if the bell rang, but no food showed up?

CS occurred without no US

Extinction - in classical conditioning, the weakening of a conditioned response — when an unconditioned stimulus does not follow a conditioned stimulus. (In operant conditioning, the weakening of a response when it is no longer reinforced.)

Forgetting the connection between the two stimulus

Spontaneous Recovery - the reappearance, after a pause, of a weakened conditioned response.

The dog will learn faster the second time around

Generalization

A dog conditioned to the sound of a bell would also respond to the sound of another different bell

Called stimulus generalization

Generalization - in classical conditioning, the tendency, after conditioning, to respond similarly to stimuli that resemble the conditioned stimulus (CS). (In operant conditioning, when responses learned in one situation occur in other, similar situations.)

Can be adaptive:

Toddlers who fear cars can learn to fear trucks/motorcycles

Can Linger:

A journalist who was tortured in prison flinches whenever he sees black shoes (the first sight of his torturers in prison)

Discrimination

Dog’s learned to respond to the sound of one bell and not the sound of another

Discrimination - knowing the differences between one stimulus and another

Your iPhone knows what your face looks like from multiple different angles (generalization)
Your iPhone is able to differentiate your face from others (discrimination)

Pavlov’s Legacy

Most psychologists agree that classical conditioning is the primary form of learning

Many other responses to many other stimuli can be classically conditioned in many other creatures
Pavlov showed us how a process such as learning can be studied objectively
1. Used to show how psychology might progress when experimenting

Modern neuroscience has identified neural circuits that link a conditioned stimulus (warning signal) with an upcoming unconditioned stimulus (threat)

Responses to stimuli can be classically conditioned in many other species

Classical Conditioning in Everyday Life

Medical Treatments - cancer treatments can cause vomiting. Patients develop classically conditioned nausea to sights, sounds, and smells of cancer clinics

Drug Cravings - People who formerly misused drugs often feel a craving when they are again in the drug-using context, because their brain has become conditioned to associate that context with a drug’s reward. Drug counselors tell their patients to stay away of any people/places associated with drug usage

Food Cravings - Sweet foods create cravings

John B Watson said that Pavlov’s research could help us understand our emotions

Showed an infant a white rat (the kid was not afraid). The next times they showed the rat, a loud noise was made behind the infant’s head. This was repeated and eventually the infant started crying at just seeing the rat)
- Super unethical

Operant Conditioning

Operant Conditioning - a type of learning in which a behavior becomes more probable if followed by a reinforcer or less probable if followed by a punisher

Behavior that operates on the environment to produce rewarding or punishing events is called operant behavior.

Skinners Experiments

B.F Skinner was behaviorism’s influential and controversial person

His work built upon the research of Edward L. Thorndike’s Law of Effect

Law of Effect - Thorndike’s principle that behaviors followed by favorable consequences become more likely, and that behaviors followed by unfavorable consequences become less likely.

Skinner wanted to experiment that would reveal behavior control

Can we teach a pigeon how to bowl?
He shaped pigeons natural walking and pecking habits
Taught pigeons how to play ping-pong, and keep a missile on track

Operant Chamber - in operant conditioning research, a chamber (also known as a Skinner box) containing a bar or key that an animal can manipulate to obtain a food or water reinforcer; attached devices record the animal’s rate of bar pressing or key pecking

Reinforcement - in operant conditioning, any event that strengthens the behavior it follows.

Depends on the animal

Shaping Behavior

Shaping - an operant conditioning procedure in which reinforcers guide behavior closer and closer toward a desired behavior.

First observe the animals behavior in their natural habitat
Give rat the food whenever it gets close to the bar
Then give rat food the closer and closer it gets to the bar
finally require the rat to touch the bar to get food

Shaping allows for researchers to understand what nonverbal organisms can perceive

If we can shape them to respond to one stimulus and not to another, then we know they can perceive the difference.

Types of Reinforcers

Positive Reinforcement - increasing behaviors by presenting a pleasurable stimulus, such as food. A positive reinforcer is anything that, when presented after a response, strengthens the response.

Negative Reinforcement - Increasing behaviors by stopping or reducing an aversive stimulus, such as an electric shock. A negative reinforcer is anything that, when removed after a response, strengthens the response.

Negative reinforcement could lead to substance abuse (drinking to avoid stress)
Is not punishment, rather it removes a punishing stimulus

Operant Conditioning Term	Description	Examples
Positive Reinforcement	Add a desirable stimulus	Pet a dog that comes when you call it; pay someone for work done
Negative Reinforcement	Remove an adverse stimulus	Take painkillers to end pain; fasten seat belt to end loud beeping.

Primary and Conditioned Reinforcers

Primary Reinforcers - an event that is innately reinforcing, often by satisfying a biological need

Unlearned

Conditioned Reinforcers - an event that gains its reinforcing power through its link with a primary reinforcer.

If a rat learned that a light turned on means there is food, the rat would work to turn on the light
- The light became a conditioned reinforcer that is linked with food

Immediate and Delayed Reinforcers

Immediate reinforcers helps link the behavior with the reward

Giving the rat food as soon as it touches the bar
If you delay giving the rat food, it won’t connect the behavior with the reward
Immediate feedback means immediate learning

But unlike rats, Humans have the ability to respond to delayed reinforcers

learning to control our impulses in order to achieve more valued rewards is a big step toward maturity

Sometimes, small immediate pleasures are more attractive than big but delayed rewards

Scrolling through Instagram at night vs getting a good nights sleep

Reinforcement Schedules

Reinforcement Schedule - a pattern that defines how often a desired response will be reinforced

Continuous Reinforcement - reinforcing a desired response every time it occurs.

Learning occurs rapidly
Extinction happens rapidly as soon as the reinforcement stops

Partial (intermittent) Reinforcement - reinforcing a response only part of the time; results in slower acquisition but much greater resistance to extinction than does continuous reinforcement.

Slot machines only reward players occasionally and unpredictably, but they still play them

Fixed-Ratio Schedule - a reinforcement schedule that reinforces a response only after a specified number of responses

Cafes giving us a free coffee after 10 purchases

Variable-Ratio Schedule - a reinforcement schedule that reinforces a response after an unpredictable number of responses

You never know when you will win at gambling, so you keep doing it

Fixed-Interval Schedule - a reinforcement schedule that reinforces a response only after a specified time has elapsed.

People waiting for a letter check more often as the delivery date approaches
Creates a “stop-start” pattern rather than a steady response pattern

Variable-Interval Schedules - a reinforcement schedule that reinforces a response at unpredictable time intervals

At unpredictable times, a food pellet rewarded Skinner’s pigeons for persistence in pecking a key
Produces steady response patterns

	Fixed	Variable
Ratio	Every so many: reinforcement after every nth behavior, such as a “buy 10 coffees, get 1 free” offer, or paying workers per product units produced	After an unpredictable number: reinforcement after a random number of behaviors, as when playing slot machines or fishing
Interval	Every so often: reinforcement for behavior after a fixed time, such as Tuesday discount prices	Unpredictably often: reinforcement for behavior after a random amount of time, as when checking for a social media update

Response rates are higher when reinforcement is linked to the number of responses (a ratio schedule) rather than to time (an interval schedule)

responding is more consistent when reinforcement is unpredictable (a variable schedule) than when it is predictable (a fixed schedule)

Punishment

Positive Punishment - an event that decreases a behavior by administering a negative stimulus.

Negative Punishment - an event that decreases a behavior by removing a rewarding stimulus.

Type of Punisher	Description	Example
Positive Punishment	administer an aversive stimulus	Spray water on a barking dog; give a traffic ticket for speeding
Negative Punishment	Withdraw a rewarding stimulus	Take away a misbehaving teen’s driving privileges; Put a kid in time out

Criminal behavior is influenced not by the threat of severe punishments but by swift punishments

Should adults punish kids physically? The answer is no:

Punished behavior is suppressed, not forgotten. This temporary state may (negatively) reinforce parents’ punishing behavior.
Physical punishment does not replace the unwanted behavior.
Punishment teaches discrimination among situations.
Punishment can teach fear.
Physical punishment may increase aggression by modeling violence as a way to cope with problems.

How should adults punish their kids → Time outs from positive reinforcement

You hit Emma? No more Dora the explorer!

Punishment tells you what not to do; reinforcement tells you what to do.

Skinners Legacy

Skinner believed that external influences (not internal thoughts and feelings) shape behavior.

Wanted people to use operant conditionings to influence other people’s behavior

Critics said that he dehumanized people by neglecting their personal freedom and controlling their actions.

Skinner’s response: External consequences already control people’s behavior. So why not steer those consequences toward human betterment?

Operant Conditioning in Everyday Life

At school:

skinner wanted “machines and textbooks” to give immediate reinforcements of correct answers

“Students must be told immediately whether what they do is right or wrong and, when right, they must be directed to the step to be taken next.”

At work and in sports:

It is better to make reinforcement immediate

Better to reward specific, achievable behaviors

Managers can praise workers for good work

In Video Games:

Game developers use reinforcement principles to create computer programs that mimic human learning.

In Parenting:

If parents give into whining and complaining, they reinforce that behavior. If the adult yells and that makes the kid comply it will lead to a strained relationship.

Notice people doing something right and affirm them for it. Give children attention and other reinforcers when they are behaving well

In Changing your own Behavior:

State a realistic goal in measurable terms and announce it.
Decide how, when, and where you will work toward your goal.
Monitor how often you engage in your desired behavior.
Reinforce the desired behavior.
Reduce the rewards gradually.

Contrasting Classical and Operant Conditioning

Biology, Cognition, and Learning

Biological Limits on Conditioning

Charles Darwin proposed Natural Selection aid in survival

There are limits on classical conditioning

Humans are biologically prepared to learn some things, rather than others

Species are predisposed to learn things needed in its survival

Biological Limits on Classical Conditioning

Rats avoid certain tastes that are linked with rat poison

not the sights or sounds that accompanied the rat poison
- Taste aversion
In one study, wolves were fed sheep carcass filled with a sickening drug. The wolves would avoid sheep meat and sheep all together

Humans are biologically prepared to learn some certain things rather than others

our ancestors learned to avoid food that made us sick

Biological Limits on Operant Conditioning

It is easier to learn and retain behavior that reflects our biological and psychological predispositions

You can use food to make a hamster dig (they are used to digging for food)
You cannot use food to make a hamster wash its face (food is not connected to this type of behavior

Cognitive Influences on Conditioning

Cognition and Classical Conditioning

Behaviorism - the view that psychology (1) should be an objective science that (2) studies behavior without reference to mental processes. Most psychologists today agree with (1) but not with (2).

John B Watson said “Its theoretical goal is the prediction and control of behavior.”"

Underestimated two influences:

is the way that biological predispositions limit our learning
is the effect of our cognitive processes — our thoughts, perceptions, and expectations — on learning.

Early Behavioralists thought that dogs and rats’ learned behavior were mindless mechanisms

no need to learn cognition

Animals can learn predictability

If a shock is always preceded by a tone, a rat will learn to react with fear to the tone, because the tone reliably predicts the shock. What would happen if a tone always comes before the shock, but a light sometimes accompanies the tone? A rat will react with fear to the tone, but not to the light.

Tone is a better predictor

People being treated for alcoholism are given alcoholic drinks with a vomit-inducing drug. But if they know that it is the drug making them sick, not the alcohol, it weakens the association with drinking and getting sick

Cognition and Operant Conditioning

Skinner resisted the idea that cognition was needed in psychology

he died 8 days later

Cognitive Map - a mental image of the layout of one’s environment.

Rats running around a maze are able to memorize it

Latent Learning - learning that is not apparent until there is an incentive to demonstrate it.

kids who have their parents dive them to school are able to walk to school later

Excessive rewards can destroy intrinsic motivation

People who focus on their work’s meaning and significance do better work and earn extrinsic rewards

	Classical Conditioning	Operant Conditioning
Biological Influences	Biological tendencies limit the types of stimuli and responses that can easily be associated. Involuntary, automatic.	Animals most easily learn behaviors similar to their natural behaviors; associations that are not naturally adaptive are not easily learned.
Cognitive Influences	Thoughts, perceptions, and expectations can weaken or strengthen the association between the CS and the US.	Animals may develop an expectation that a response will be reinforced or punished; latent learning may occur without reinforcement.

Learning By Observation

Observational Learning - learning by observing others.

You cannot learn how to be a good basketball player simply by watching LeBron James

Modeling - the process of observing and imitating a specific behavior.

the physical practice of basketball

Kids who watch adults beat up a blow up clown are then more likely to beat up a blow up clown

By observing models, we vicariously (in our imagination) experience what they are experiencing.

Mirrors and Imitations in the Brain

A monkey had electrodes on its head that would allow researchers to see activity in the frontal lobe (planning and acting out movements)

when the monkey put peanuts in its mouth, the machine would buzz because there would be activity
One day the monkey saw a scientist eating an ice cream cone, and the machine buzzed
- as if the monkey itself was making that motion

Mirror Neurons - neurons that some scientists believe fire when we perform certain actions and when we observe others performing those actions; a neural basis for imitation and observational learning.

Humans mirror catchphrases, ceremonies, food, traditions, morals, fads, by copying one another

Emotions are contagious. We unconsciously mimic people’s posture, faces, voices, writing styles and we understand what the person is feeling

When a loved one is in pain, our brains light up the same way as theirs

Observational Learning in Everyday Life

People look, they listen, they mentally imitate, and they learn

Pro-Social Effects

Prosocial Behavior - positive, constructive, helpful behavior. The opposite of antisocial behavior.

After observing someone helping people, we become more helpful subconsciously

Kids observe and replicate their parent’s behavior

If parents have moral behavior, their kids are more likely to be moral
Kids are more likely to be religious if their parents practice with them

Anti-Social Effects

Anti-social Behavior - negative, destructive, harmful behavior. The opposite of prosocial behavior

Can explain why abusive parents have abusive kids

Aggressiveness can have a genetic link, and it can be environmental

If people are constantly hearing hate speech, they are desensitized to hateful language

TV shows can promote violent behavior, making people desensitized