instrumental conditioning
Instrumental conditioning is leaning of a contingency between a behaviour and a consequence
INSTRUMENTAL CONDITIONING
Thorndike (guy with the escape room for cat) found that the frequency of the fandom behaviours generally decreased over time over several trials Animals don't have a clear aha moment
Stamping in and stamping out - behaviours that were followed by the favourable consequence are stamped in. Random behaviours are stamped out. Eventually the process leads to refinement
Law of effect - Behaviours w positive consequences are stamped in; those with neg consequences are stamped out
In instrumental conditioning we are considering overt behaviours that are operated by an actor, leading to a reinforcer
TYPES OF INSTRUMENTAL CONDITIONING
Reinforcer - any stimulus that is presented after a response that impact the frequency that the response is performed
Reward training - presentation of positive reinforcer, which inc the freq of the behaviour
punishment training - presentation of a negative reinforcer, which dec the freq of the behaviour
omission training - removal of a positive reinforcer, which dec the freq of the behaviour
escape training - removal of a negative reinforcer, which inc the freq of the behaviour
response rate of behaviour - visualized by psychologists using graphs (think that moving paper and a pencil). Graph has cumulative number of responses on y axis, time on the x-axis
AQUISITION AND TRAINING
autoshaping is a form of conditioning in which a subject that has been given reinforcement following a stimulus, regardless of its response to that stimulus
shaping successive approximation gradual learning. rewarding for behaviour similar to the behaviour wanted. a behaviour is reinforced only if it is a closer approximation of the desired behaviour last reinforced; reinforcing on the basis of improvement
chaining - a technique used to develop a sequence of behaviours. each behaviour is reinforced w the opportunity to perform the next behaviour in a sequence. ABCDE. behaviours in the chaining sequence, as well as the order, are set prior to the training. reinforced the behaviour so long as it is performed in a defined order. incresingly more complex
GENERALIZATION AND DISCRIMINATION
SD / S+ - signals when a contingency b/w a particular behaviour and reinforcement is “on”; sets the occasion for a voluntary response
S𝛿 / S- - signals when the contingency relationship is not valid
SCHEDULES OF REINFORCEMENT
a ratio schedule of reinforcement is based on the number of responses made by a subject, which determines when reinforcement is given
an interval schedule of reinforcement is based on the times since the last response was reinforced
fixed schedule - the conditions were held constant across trials
variable schedule - based on the average across trials
ratio strain - if the reward is too stingy, the subject is no longer going to respond
FR-#

VR-# -
FI-# -
VI -# -

CRF continuous reinforcement schedule
PRF partial reinforcement schedule; more resistant to extinction
chapter 4: instrumental conditioning
law of effect - a response followed by a satisfying effect is strengthened and likely to occur again in that situation, while a response followed by an unsatisfying effect is weakened and less likely to occur again in that situation.
overjustification effect - when indic are rewarded for a behaviour that previously has intrinsic value, and now only has extrinsic value once has been reinforced
negative contrast :: lowering of reward value, dec response rate
positive contrast :: increasing reward value, inc response rate
mirror neuron :: a cell that responsds in the same way when preforming an action as it does when the organism pocessing that cell observes someone else perform the action or even imagines perfoming the action. believed to play a key role in observational learning
primary reinforcer :: a reinforver with intrinsic value such as food, water or a mate
secondary reinforcer :: a reinforver that can be exchanged for a primary reinforcer. money is most commonly used for humans