learning (T5)
- [[a lasting change in behavior/mental process as result of experience[[
- cannot be explained through:
- natural reactions
- development
- surgery/medical change
- drugs
- ivan pavlov
classical conditioning: learned association
- [[based on new associations[[
- @@unconditioned stimulus (UCS):@@
- event that leads to a response in the first place
- Unlearned stimulus
- naturally elicits a response
- @@unconditioned response (UCR):@@
- how the person is responding
- unlearned response
- @@conditioned stimulus (CS)@@:
- what is leading to the response now
- learned stimulus, but starts neutral
- learned stimulus based on association w/ unconditioned stimulus
- @@conditioned response (CR)@@:
- same as unconditioned response
- learned response
- @@acquisition:@@
- when learning has occurred
- @@generalization:@@
- responds to similar stimuli
- ex: can → ran, man…
- @@distinction:@@
- only respond to specific stimuli
- [[example: you were bit by a dog as a child[[
- UCS- bite
- UCR- fear
- CS- dog
- CR- fear
- john b. watson: father of behaviorism
- everything you are is related to the environment you grew up in
- nurture
- @@extinction@@:
- when a conditioned stimulus goes away
- @@expectancy:@@ adds cognitive part to classical conditioning
- rescorla & wagner: revised pavlonian theory
- said animals learn when something is unexpected
- @@taste aversion:@@
- we learn to avoid specific food
- @@blocking:@@
- when a previous CS
- @@second/higher order conditioning@@
- when new cs is paired w/ previous CS
- @@aversion therapy@@
- patient gives up undesireable habit because it becomes associated w/ unpleasant stimulus
- @@discrimination in classical conditioning:@@
- Individual learns to differentiate among similar stimuli
- ex: Pavlov's dog learns the difference between the sound of the 2 bells and no longer salivates at the sound of the non-food bell.
- @@extinction:@@
- the learned behvaior eventually stops
Operant conditioning:
- [[connecting behaviors w/ consequences[[
- edward thorndike: did experiment with cats in boxes
- law of effect:
- an organism is likely to repeat behaviors w/ good consequences
- BF skinner
- focuses on subject actively operating on enviornment
- ways to increase a behavior:
- positive reinforcement
- ex: offering money for a task
- negative reinforcement
- ex: taking asprin to relieve a headache
- escape learning
- ex: you go to nurse to avoid taking a test
- avoidance learning
- engaging in something to avoid a consequence
- ways to decrease behavior:
- positive punishment
- adding something
- negative punishment
- taking away something
<<situation → behavior → consequence → reinforcement = learning<<
<<→ punishment = no learning<<
reinforcement/punishment
go:
- {{Positive reinforcement: An event or thing that is presented after a response that makes
it more likely that the response will be repeated{{ - {{Negative reinforcement: An event or thing that, when removed after a response, makes it more likely the response will be repeated, positive stimulus is removed{{
stop:
- ]]Positive punishment: An undesirable event that is presented after a response that
makes it less likely the response will be repeated; sometimes simply called
punishment]] - ]]Negative punishment: A desirable event that is removed after a response, making it
less likely the response will be repeated; sometimes called omission training]]
examples:
1- If you do all your in-class assignments without bothering other students, I will stop
calling your parents everyday negative reinforcement \n 2- If you continue to fool around, bothering other students during our in-class
assignments, I will call your parents positive punishment \n 3- If you continue to bother other students during in-class assignments, I will no longer
love you negative punishment
4- Because you keep bothering other students during their in-class assignments, you
can't watch your favorite television show negative punishment
5- Here is a gold star for earning a grade of 90% on your test positive reinforcement \n 6- Because you scored below 70% on this test, you must submit a 1,000 page term paper positive punishment \n 7- If you complete this arithmetic assignment quickly, you can go out to recess early
Positive reinforcement \n 8- If you complete this arithmetic assignment quickly, you will no longer have to stay in
during recess negative reinforcement \n 9- When you learn to hang your pants on a hanger, you will look neater when you wear
them positive reinforcement \n 10- When you learn to hang your pants on a hanger, you will not look so ridiculous when
you wear them negative reinforcement
operant conditioning
- primary reinforcers:
- rewards
- food
- secondary reinforcers
- money
- @@token economy:@@
- every time you do something good in class you get a star, you can eventually trade stars in for a prize
- delayed reinforcer
- college
- delayed gratification
- shaping & changing
- @@shaping:@@ one behavior *single behavior
- @@chaining:@@ making a chain of behaviors *multiple behaviors
schedules of reinforcement
- @@continuous schedule:@@
- useful when learning new behavior - but unhelpful in the end
- reward every single time a person does a behavior
- @@intermittent:@@
- variety of different rewards
- most effective in maintaining behavior
^^fixed ratio:^^ reinforcement after a specific number of desired responses
- ex: free coffee after 5 hole punches
^^variable ratio:^^ reinforcement delivered after some number of responses, but amount is not specific
- ex: slot machines
^^fixed interval:^^ reinforcement delivered based on the same passage of time
- based on a set time
- if your homework is completed each day by 5pm, you get $10
^^variable interval:^^ reinforcement delivered after some period of time, but that interval
- ex: your waiting for a college acceptance email, you check your email a lot, hoping it will be there, even though that has no effect on when the mail will come
- @@instinctual drift:@@
- animals revert back to a natural behavior, even after conditioning
- @@premack principal:@@
- Preferred activity can be used to reinforce a non preferred activity
- need to develop things that are more appropriate for each person
- @@contingency:@@
- in order for learning to occur, the connection must be made between consequence & behavior
- @@overjustification:@@
- if you extrinsically reinforce a natural/intrinsic rewarding behavior you can diminish that behavior
- @@intrinsic motivation:@@ a natural desire
- ex: wanting to learn bc u like school
- @@extrinsic motivation:@@ based on external rewards
- ex: money
reinforcement
- A Fixed Ratio Schedule (FR): The consequence is delivered after a specific and predictable number of responses
- A Variable Ratio Schedule (VR): The consequence is delivered after a changing and unpredictable amount of responses
- A Fixed Interval Schedule (FI): The consequence reinforces a particular response after a specific amount of time has passed
- A Variable Interval Schedule (VI): The consequence reinforces a particular response after a changing and unpredictable amount of time has passed
- A Continuous Schedule (C): Rewards the response every time
[[*based on time=interval[[
[[*when learning is first occuring → use continous schedule[[
Classical Conditioning | Operant Conditioning |
---|---|
Behavior is controlled by the stimuli that precede the response (by the CS and the UCS). | Behavior is controlled by consequences (rewards, punishments) that follow the response. |
No reward or punishment is involved (although pleasant and aversive stimuli may be used). | Often involves rewards (reinforcement) and punishments. |
Through conditioning, a new stimulus (CS) comes to produce the old (reflexive) behavior. | Through conditioning, a new stimulus (reinforcer) produces a new behavior. |
Extinction is produced by withholding the UCS. | Extinction is produced by withholding reinforcement. |
Learner is passive (acts reflexively): Responses are involuntary. That is behavior is elicited by stimulation. | Learner is active: Responses are voluntary. That is behavior is emitted by the organism. |
pavlov: classical conditioning
skinner: operant conditioning
thorndike: law of effect
watson: little albert- classical conditioning
bandura- classical learning
seligman- learned helplessness
garcia- taste aversion
tolman- latent learning
kohler- insight learnign
premack- need to know learner to know what is punishmnet to them
escaping/avoiding = negative reinforcement
consult
- overjustification: rewarding someone for something they already like -extrinsic
- internal v external locus:
- where is the control
- internal: control w/in you
- external: control is outside of your control
- learned helplessness
- try something & repeatedly fail → you give up
- not trying anymore
- \
- garcia effect: biological predisposition → tasted aversion
- stimulus discrimination: opposite of generalization
- spontaneous recovery: a behavior has been extinction
- after a period of extinction a behavior returns
- social learning: we learn from watching people
- self efficacy: belief in ones own abilities
- biofeedback: using body signals to identify your stressed → connections
- tension headaches
- \n