click below
click below
Normal Size Small Size show me how
psych U6M27
operant conditioning
| Question | Answer |
|---|---|
| edward thorndike: law of effect | behaviors with good consequences become more likely behaviors with bad consequences become less likely |
| operant conditioning | learning from the consequences of behavior; response>>result |
| operant chamber/skinner box | box that has a bar that an animal can press to get a reward, while devices record the rate of bar pressing |
| reinforcement | an event that strengthens the behavior that it follows |
| shaping | reinforces guide/shape behavior towards a desired behavior |
| example of a discriminative stimulus | misty getting excited to the garage door opening, as opposed to hearing the bathroom door open |
| primary reinforcement | a biological reinforcer (sleep, food) |
| secondary reinforcement | reinforcer that comes from a learned association with a primary reinforcer (money, points in school and games) |
| positive reinforcement | give something that the subject wants to strengthen behavior |
| negative reinforcement | take away something negative that the subject (wants) to strengthen a behavior |
| how do positive and negative reinforcement coincide? | +: receiving teacher approval (present when the student studies) -: receiving teacher disapproval (removed when student studies) |
| what happens to children who make choices influenced by a delayed reinforcer? | saving my lunch for 2 hours later to fulfill hunger later rather than feel full ATM. children are more socially competent; become high achievers |
| generalized reinforcer/token economy | learning to trade something for something you want, which as a result, strengthens behavior |
| schedules of reinforcement | various methods involving time, #, and consistency for providing rewards to influence behavior |
| continuous | reinforcing the desired response every time it happens |
| partial (intermittent) | reinforcing a response only part of the time, resulting in greater resistance to extinction but slower acquisition |
| fixed ratio | reinforcement after a FIXED NUMBER of correct behaviors |
| variable ratio | reinforcement after UNPREDICTABLE NUMBER of responses |
| fixed interval | reinforcement after FIXED period of TIME |
| variable interval | reinforcement after UNPREDICTABLE period of TIME |
| how are variable and fixed schedules of behavior diff in terms of effect on behavior? | variable=steady, high rate of response fixed=behavior increased towards (time of) reinforcement |
| ex of a fixed ratio | getting free coffee after 8 purchases |
| ex of a variable ratio | gambling--you can get $20 after 3 tries or 22 tries |
| ex of a fixed interval | taking the trash out as the time for the garbage truck to come draws near |
| ex of a variable interval | waiting for teacher to respond to an email |
| punishment | applying adverse events to decrease behavior |
| how is punishment different from negative reinforcement | punishment=stop the behavior negative reinforcement=taking away bad thing to increase behavior |
| positive punishment+ex | give the subject what they don't want (spray water, traffic tix) |
| negative punishment+ex | to withdraw a reward (take away a phone, allowance) |
| avoidance conditioning | avoid an unpleasant stimulus altogether (skipping school bc of test) |
| escape conditioning | to rid an aversive stimulus (killing trump) |
| drawbacks of physical punishment | may not fully teach the lesson--the child can do the behavior jst not at the house only, it teaches fear, it increases aggression |
| forward chaining | one response at a time in usual order |
| backward chaining | begins at the last step; each step becomes the 2nd/next reinforcer |
| total task presentation | the entire series of responses in each trial |
| premack principle | whichever of 2 activities is preferred can be used to reinforce activity not preferred |
| fading | the drawing back of reinforcement |
| contiguity apporach | pairing of the NS and natural stimuli occur because they are paired at the right time |
| contingency apporach | paired because the CS comes to predict the US "if______, then____" |
| behavior modification | reinforcing the desired behavior (token economy, shaping, fading, etc) |
| how can parents achieve desirable behaviors w/o punishment | commands can be rephrased into more positive meanings; use no physical harm |
| how can punishment be effective | it must follow immediately after the undesired behavior, it must be sufficient; not abusive; it must be certain, and it must be consistent; having the same offence for the behavior that gradually accumulates to a bigger consequence |
| omission training | decrease the frequency of a behavior by rewarding the subject when the specific behavior is not performed |
| response chains | performing a number of responses in sequence for a reward; each response produces a signal for the next one |
| response patterns | chains of responses are organized into larger response patterns (in tap, you learn the step, then arms, then facial, then combine it all into one step) |