learning and motivation chap 7

  • ^^schedule of reinforcement:^^ response requirement that must be met to obtain reinforcement
  • ^^continuous reinforcement schedule:^^ each specified response is reinforced
      * very useful when a behavior is first being shaped or strengthened.
  • ^^intermittent / partial reinforcement schedule:^^ only some responses are reinforced
      * characterizes much of daily life
      * fixed ratio: reinforcement is contingent upon a fixed, predictable number of responses
        * FR1 is the same as a continuous
        * produce a high rate of response along with a short pause following the attainment of each reinforcer
        * short pause: post-reinforcement pause
        * high ratio requirements produce longer post-reinforcement pauses
        * schedules in which the reinforcer is easily obtained are said to be very dense / rich
        * moving from a low ratio requirement to a high ratio requirement should be done gradually
          * stretching the ratio
          * if the requirement is increased too quickly, it can cause ratio strain (disruption in responding due to an overly demanding response requirement)
      * variable ratio: reinforcement is contingent upon a varying, unpredictable number of responses
        * produce a high and steady rate of response, often with little to no postreinforcement pause, especially when the minimum response requirement is very low
        * help to account for the persistence with which some people display certain maladaptive behaviors
        * gambling
        * development of abusive relationship
      * fixed interval: reinforcement is contingent on the first response after a fixed, predictable period of time
        * scalloped pattern of responding: postreinforcement pause followed by a gradually increasing rate of response as the interval draws to a close
          * reinforcer will be attained as soon as it becomes available
      * variable interval: reinforcement is contingent upon the first response after a varying, unpredictable amount of time
        * usually produce a moderate, steady rate of response, often with little or no postreinforcement pause
      * ^^characteristic response pattern:^^ stable pattern that emerges once the organism has had considerable exposure to the schedule
        * aka steady-state behaviors
      * ratio schedules produce higher rates of responses than interval schedules
      * fixed schedules tend to produce postreinforcement pauses while variable schedules don't
      * ^^duration schedules:^^ reinforcement is contingent on performing a behavior continuously throughout a period of time
        * fixed duration schedule: behavior must be performed continuously for a fixed, predictable period of time
          * rat nust run in the wheel for 60 seconds to earn a pellet of food (fd-60sec)
        * variable duration schedule: behavior must be performed continuously for a varying, unpredictable period of time
          * rat must run in the wheel for an average of 60 seconds (vd-60sec)
      * response-rate schedule: reinforcement is directly contingent upon the organism’s rate of response
        * differential reinforcement of high rates: reinforcement is contingent upon emitting at least a certain number of responses in a certain period of time
          * reinforcement is provided for responding at a fast rate
          * one type of response is reinforced while another is not
          * ex: swimming for sport. winning is contingent on a rapid series of response
        * differential reinforcement of low rates: minimum amount of time must pass between each response before the reinforcer will be delivered
          * reinforcement is provided for responding at a slow rate
          * responses that occur during the interval have an adverse effect and prevent reinforcement from occurring
            * example: brushing teeth slowly
        * differential reinforcement of paced responding: reinforcement is contingent upon emitting a series of responses at a set rate
          * reinforcement is provided for responding neither too fast nor too slow
          * ex: musical activities. relevant actions must be performed at a specific pace
      * noncontingent schedules: reinforcer is delivered independently of any response
        * response is not required for the reinforcer to be obtained
        * response-independent schedules
        * ==fixed-time schedule:== reinforcer is delivered following a fixed, predictable period of time, regardless of the organism’s behavior
        * ==variable time schedule:== reinforcer is delivered following a varying, unpredictable period of time, regardless of the organism’s behavior
        * may account for superstitious behavior
        * ==adjunctive behaviors:== innate tendencies, like fidgeting, that are elicited during a period of waiting
      * complex schedules: consist of a combination of two or more simple schedules
        * conjunctiveschedule:conjunctive schedule: requirements of two or more simple schedules must be met before a reinforcer is delivered
        * adjustingschedule:adjusting schedule: response requirement changes as a function of the organism’s performance while responding for the previous reinforcer
          * because it has performed so well, we expect better performance in the future
          * shaping also involves an adjusting schedule
          * requirement for reinforcement changes as soon as the rat has successfully met the previous requirement
        * chainedschedule:chained schedule: consists of a sequence of two or more simple schedules. must be completed in a particular order
          * goal gradient effect: increase in the strength / efficiency of responding as one approaches the goal
          * backward chaining: train the final link first and the initial link last
  • drive reduction theory: an event is reinforcing to the extent that it is associated with a reduction in some type of physiological drive
  • incentive motivation: motivation that is derived from some property of the reinforcer, as opposed to an internal drive state
  • premack principle: high-probability behavior can be used to reinforce a low-probability behavior (first work then play)
  • response deprivation hypothesis: behavior can serve as a reinforcer when access to the behavior is restricted and its frequency thereby falls below its preferred level of occurrence
  • behavioral bliss approach: organism with free access to alternate activities will distribute its behavior in such a way as to maximize overall reinforcement

 \n