learning and motivation chap 7

  • schedule of reinforcement: response requirement that must be met to obtain reinforcement

  • continuous reinforcement schedule: each specified response is reinforced

    • very useful when a behavior is first being shaped or strengthened.

  • intermittent / partial reinforcement schedule: only some responses are reinforced

    • characterizes much of daily life

    • fixed ratio: reinforcement is contingent upon a fixed, predictable number of responses

      • FR1 is the same as a continuous

      • produce a high rate of response along with a short pause following the attainment of each reinforcer

      • short pause: post-reinforcement pause

      • high ratio requirements produce longer post-reinforcement pauses

      • schedules in which the reinforcer is easily obtained are said to be very dense / rich

      • moving from a low ratio requirement to a high ratio requirement should be done gradually

        • stretching the ratio

        • if the requirement is increased too quickly, it can cause ratio strain (disruption in responding due to an overly demanding response requirement)

    • variable ratio: reinforcement is contingent upon a varying, unpredictable number of responses

      • produce a high and steady rate of response, often with little to no postreinforcement pause, especially when the minimum response requirement is very low

      • help to account for the persistence with which some people display certain maladaptive behaviors

      • gambling

      • development of abusive relationship

    • fixed interval: reinforcement is contingent on the first response after a fixed, predictable period of time

      • scalloped pattern of responding: postreinforcement pause followed by a gradually increasing rate of response as the interval draws to a close

        • reinforcer will be attained as soon as it becomes available

    • variable interval: reinforcement is contingent upon the first response after a varying, unpredictable amount of time

      • usually produce a moderate, steady rate of response, often with little or no postreinforcement pause

    • characteristic response pattern: stable pattern that emerges once the organism has had considerable exposure to the schedule

      • aka steady-state behaviors

    • ratio schedules produce higher rates of responses than interval schedules

    • fixed schedules tend to produce postreinforcement pauses while variable schedules don't

    • duration schedules: reinforcement is contingent on performing a behavior continuously throughout a period of time

      • fixed duration schedule: behavior must be performed continuously for a fixed, predictable period of time

        • rat nust run in the wheel for 60 seconds to earn a pellet of food (fd-60sec)

      • variable duration schedule: behavior must be performed continuously for a varying, unpredictable period of time

        • rat must run in the wheel for an average of 60 seconds (vd-60sec)

    • response-rate schedule: reinforcement is directly contingent upon the organism’s rate of response

      • differential reinforcement of high rates: reinforcement is contingent upon emitting at least a certain number of responses in a certain period of time

        • reinforcement is provided for responding at a fast rate

        • one type of response is reinforced while another is not

        • ex: swimming for sport. winning is contingent on a rapid series of response

      • differential reinforcement of low rates: minimum amount of time must pass between each response before the reinforcer will be delivered

        • reinforcement is provided for responding at a slow rate

        • responses that occur during the interval have an adverse effect and prevent reinforcement from occurring

          • example: brushing teeth slowly

      • differential reinforcement of paced responding: reinforcement is contingent upon emitting a series of responses at a set rate

        • reinforcement is provided for responding neither too fast nor too slow

        • ex: musical activities. relevant actions must be performed at a specific pace

    • noncontingent schedules: reinforcer is delivered independently of any response

      • response is not required for the reinforcer to be obtained

      • response-independent schedules

      • fixed-time schedule: reinforcer is delivered following a fixed, predictable period of time, regardless of the organism’s behavior

      • variable time schedule: reinforcer is delivered following a varying, unpredictable period of time, regardless of the organism’s behavior

      • may account for superstitious behavior

      • adjunctive behaviors: innate tendencies, like fidgeting, that are elicited during a period of waiting

    • complex schedules: consist of a combination of two or more simple schedules

      • conjunctive schedule: requirements of two or more simple schedules must be met before a reinforcer is delivered

      • adjusting schedule: response requirement changes as a function of the organism’s performance while responding for the previous reinforcer

        • because it has performed so well, we expect better performance in the future

        • shaping also involves an adjusting schedule

        • requirement for reinforcement changes as soon as the rat has successfully met the previous requirement

      • chained schedule: consists of a sequence of two or more simple schedules. must be completed in a particular order

        • goal gradient effect: increase in the strength / efficiency of responding as one approaches the goal

        • backward chaining: train the final link first and the initial link last

  • drive reduction theory: an event is reinforcing to the extent that it is associated with a reduction in some type of physiological drive

  • incentive motivation: motivation that is derived from some property of the reinforcer, as opposed to an internal drive state

  • premack principle: high-probability behavior can be used to reinforce a low-probability behavior (first work then play)

  • response deprivation hypothesis: behavior can serve as a reinforcer when access to the behavior is restricted and its frequency thereby falls below its preferred level of occurrence

  • behavioral bliss approach: organism with free access to alternate activities will distribute its behavior in such a way as to maximize overall reinforcement


robot