Question 1

Thorndike’s Law of Effect

Accepted Answer

responses in a situation folloed by satisfaction will become stronger. Responses followed by discomfort will become weaker.

Question 2

Cats in a Puzzle Box

Accepted Answer

random trial + error behavior
gradual change behavior

Question 3

Law of Effect doubts

Accepted Answer

do animals really know what to expect?
- contrast effects
do animals always show behavior change?
- kohler and insight in apes

Question 4

contrast effects

Accepted Answer

negative (switch from large to small reinforcement)
positive (switch from small to large reinforcement)

Question 5

Skinner Box

Accepted Answer

aka operant chamber
- small aparatus with way for subject to make responses and experimenter to deliver reinforcement
- rats + levers, piegons + keys

Question 6

shaping

Accepted Answer

reinforcement of successive approximations to a desired behavior

Question 7

chaining

Accepted Answer

constructing a sequence of behaviors with reinforcement only occurring after the final response in the sequence

Question 8

extinction

Accepted Answer

response eliminated when reinforcement is withheld

Question 9

extinction burst

Accepted Answer

temporary increase of nonreinforced behavior when extinction starts

Question 10

spontaneous recovery

Accepted Answer

extinction fades away as a passage of time
- not only way for extinguished responses to occur

Question 11

resurgence

Accepted Answer

recovery of an extinguished response after extinction of a competing behavior

Question 12

typical resurgence procedure

Accepted Answer

rats in skinner box, 2 levers
-- phase1: L1 = food, L2 = no effect
-- phase2: extinguish L1 (no food)
-- phase3: L2 = food, L1 = no effect
-- phase4: extinguish L2 (no food)
result --> rats will resume pressing L1 (resurgence) even in absence of food

Question 13

partial reinforcement extinction effect

Accepted Answer

slows down acquisition of response BUT increase resistance to extinction

Question 14

discrimination hypothesis

Accepted Answer

more difficult to distinguish between acquisition phase and extinction phase

Question 15

frustration hypothesis

Accepted Answer

responding while frusterated becomes associated with reinforcement

Question 16

sequential hypothesis

Accepted Answer

memory of sequence of nonrewarded trials becomes associated with reinforcement

Question 17

schedules of partial reinforcement

Accepted Answer

fixed interval, variable interval, fixed ratio, variable ratio

Question 18

fixed-interval

Accepted Answer

reinforcement is given for the first response that occurs after a set period of time

Question 19

variable-interval

Accepted Answer

reinforcement is given for the first response that occurs after a changing period of time

Question 20

fixed-ratio

Accepted Answer

reinforcement is given for a set number of responses

Question 21

variable-ratio

Accepted Answer

reinforcement is given for a changing number of responses

Question 22

pauses after reinforcement on

Accepted Answer

fixed schedules, esp interval

Question 23

greater responding on

Accepted Answer

ratio schedules than interval

Question 24

greatest total number of responses on what schedule

Accepted Answer

variable-ratio

Question 25

matching law

Accepted Answer

responses are distributed to reflect the distribution of reinforcements

Question 26

example of matching law

Accepted Answer

- 2 subjects in operant (variable-interval) experiments simultaneously
- proportion of responses that a subject makes of a certain kind matches the proportion of reinforcement that is received from those responses

Question 27

probability matching

Accepted Answer

a related pattern is found in human cognition. the probability of choosing an option tends the match the probability of that option succeeding.

Question 28

example of probability matching

Accepted Answer

subjects predict 2 lights will come on. if one comes on 80% of the time and the other 20%, they'll pick first one 80% of time. This does not maximize successes.
- should pick most probable every time

Question 29

basic idea of the ephemeral rewards task (ERT)

Accepted Answer

choice doesn't always seem rational

Question 30

in the ERT, what goes down

Accepted Answer

subjects given a choice between two responses on each trial, each leading to identical rewards. If A - reward given, trial ends. If B, reward given, but A still available.

Question 31

So what are the ERT results

Accepted Answer

Optimal strategy should be to always first make the response that will keep the other response available (B) and get 2 reinforcements. BUT animals + some primates choose A/B 50/50.

Question 32

what happens with humans and ERT

Accepted Answer

many humans also pick suboptimally after going thru 60 trials to earn points. They report being confused by this.

Question 33

self control

Accepted Answer

one type of choice, NOT a moral judgement --> behavioral phenomenon

Question 34

how to study self control

Accepted Answer

behaviorally by choice between immediate small reward vs larger delayed reward

Question 35

discounting

Accepted Answer

future rewards not valued as highly as immediate rewards. future punishments not feared as much as immediate.

Question 36

why does discounting make sense

Accepted Answer

life is inherently uncertain, cannot rely on the future

Question 37

what does keynes say

Accepted Answer

do not just focus on the long run bc we are all dead then

Question 38

how do pigeons test discounting

Accepted Answer

put em in a skinner box
1 interaction = less but instant food
2 interaction = more but delayed food
pigeons have a lot of discounting poor self control

Question 39

who did hundreds of experiments about discounting

Accepted Answer

mischel

Question 40

what were the mischel experiments

Accepted Answer

child waiting to eat candy so they can get two pieces
- older child, more likely to wait
- something else to do helps
- ironic processess
- reflects long term personality characteristics

Question 41

Pleasant: the response produces reinforcing outcome

Accepted Answer

Reward Training
(positive reinforcement) - Response INC

Question 42

Unpleasant: the response produces reinforcing outcome

Accepted Answer

Punishment
Response DEC

Question 43

Pleasant: the response prevents reinforcing outcome

Accepted Answer

Omission, Extinction, Time Out
Response DEC

Question 44

Unpleasant: the response prevents reinforcing outcome

Accepted Answer

Escape, Avoidance
(Negative reinforcement) - Response INC

Question 45

when does punishment work?

Accepted Answer

consistency, delay, intensity factors

Question 46

side effects of punishment?

Accepted Answer

conditioned fear, aggression

Question 47

omission training

Accepted Answer

don't do something - good outcome occurs
-- ex: time out

Question 48

escape

Accepted Answer

learn to avoid punishment
- two process theory

Question 49

two-process theory

Accepted Answer

classical conditioning of fear to warning signal + reduction of fear as reinforcement
- once learned will always avoid

Question 50

evidence against the two process theory

Accepted Answer

SIDMAN AVOIDANCE PROCEDURE
- no warning signal
- animals still find ways to avoid

Question 51

cognitive theory of avoidance

Accepted Answer

subject develops expectations (e.g. no shock if I respond, shock if I don't)
- expectations of circumstance, subject's knowledge/cognitions

Question 52

instrumental learning is sort of

Accepted Answer

the subject learning how to control the world

Question 53

learned helplessness

Accepted Answer

learning that there is a lack of contingency between responses and an aversive outcome

Question 54

example of learned helplessness

Accepted Answer

forcing a dog to be shocked, they stop struggling/trying to escape eventually

Question 55

how do bad events relate to learned helplessness

Accepted Answer

sometimes when bad things happen, you take it as something that you could fix
--> no excuse? bam LH
if you learn you cannot fix things, you will not look for the opportunity to fix them

Question 56

learned helplessness has served as a model for what

Accepted Answer

depression

Question 57

long term learned helplessness effects: emotional

Accepted Answer

depressed effect, loss of appetite and sex drive, psychosomatic illness

Question 58

long term learned helplessness effects: motivational

Accepted Answer

lack of initiative to respond

Question 59

long term learned helplessness effects: cognitive

Accepted Answer

develop "external locus of control" attributional style; fail to recognize situations where they could exert control

Question 60

social defeat approach

Accepted Answer

psychological and behavioral effects of the losing party in a confrontation among animals of the same species

Question 61

what criticism does social defeat come from

Accepted Answer

learned helplessness research with animals is often using artifical negative stimuli
- social defeat a more realistic manipulation

Question 62

resident-intruder task

Accepted Answer

animal placed in cage of another animal(s) of same species in a manner that allows non-lethal conflict. 
- documented to produce anxiety-like and depressive-like symptoms in mice

Question 63

animals allowed to fight on a single occassion

Accepted Answer

acute stress

Question 64

animals allowed to fight on several occasions

Accepted Answer

chronic stress

Term	Definition
Thorndike’s Law of Effect	responses in a situation folloed by satisfaction will become stronger. Responses followed by discomfort will become weaker.
Cats in a Puzzle Box	random trial + error behavior gradual change behavior
Law of Effect doubts	do animals really know what to expect? - contrast effects do animals always show behavior change? - kohler and insight in apes
contrast effects	negative (switch from large to small reinforcement) positive (switch from small to large reinforcement)
Skinner Box	aka operant chamber - small aparatus with way for subject to make responses and experimenter to deliver reinforcement - rats + levers, piegons + keys
shaping	reinforcement of successive approximations to a desired behavior
chaining	constructing a sequence of behaviors with reinforcement only occurring after the final response in the sequence
extinction	response eliminated when reinforcement is withheld
extinction burst	temporary increase of nonreinforced behavior when extinction starts
spontaneous recovery	extinction fades away as a passage of time - not only way for extinguished responses to occur
resurgence	recovery of an extinguished response after extinction of a competing behavior
typical resurgence procedure	rats in skinner box, 2 levers -- phase1: L1 = food, L2 = no effect -- phase2: extinguish L1 (no food) -- phase3: L2 = food, L1 = no effect -- phase4: extinguish L2 (no food) result --> rats will resume pressing L1 (resurgence) even in absence of food
partial reinforcement extinction effect	slows down acquisition of response BUT increase resistance to extinction
discrimination hypothesis	more difficult to distinguish between acquisition phase and extinction phase
frustration hypothesis	responding while frusterated becomes associated with reinforcement
sequential hypothesis	memory of sequence of nonrewarded trials becomes associated with reinforcement
schedules of partial reinforcement	fixed interval, variable interval, fixed ratio, variable ratio
fixed-interval	reinforcement is given for the first response that occurs after a set period of time
variable-interval	reinforcement is given for the first response that occurs after a changing period of time
fixed-ratio	reinforcement is given for a set number of responses
variable-ratio	reinforcement is given for a changing number of responses
pauses after reinforcement on	fixed schedules, esp interval
greater responding on	ratio schedules than interval
greatest total number of responses on what schedule	variable-ratio
matching law	responses are distributed to reflect the distribution of reinforcements
example of matching law	- 2 subjects in operant (variable-interval) experiments simultaneously - proportion of responses that a subject makes of a certain kind matches the proportion of reinforcement that is received from those responses
probability matching	a related pattern is found in human cognition. the probability of choosing an option tends the match the probability of that option succeeding.
example of probability matching	subjects predict 2 lights will come on. if one comes on 80% of the time and the other 20%, they'll pick first one 80% of time. This does not maximize successes. - should pick most probable every time
basic idea of the ephemeral rewards task (ERT)	choice doesn't always seem rational
in the ERT, what goes down	subjects given a choice between two responses on each trial, each leading to identical rewards. If A - reward given, trial ends. If B, reward given, but A still available.
So what are the ERT results	Optimal strategy should be to always first make the response that will keep the other response available (B) and get 2 reinforcements. BUT animals + some primates choose A/B 50/50.
what happens with humans and ERT	many humans also pick suboptimally after going thru 60 trials to earn points. They report being confused by this.
self control	one type of choice, NOT a moral judgement --> behavioral phenomenon
how to study self control	behaviorally by choice between immediate small reward vs larger delayed reward
discounting	future rewards not valued as highly as immediate rewards. future punishments not feared as much as immediate.
why does discounting make sense	life is inherently uncertain, cannot rely on the future
what does keynes say	do not just focus on the long run bc we are all dead then
how do pigeons test discounting	put em in a skinner box 1 interaction = less but instant food 2 interaction = more but delayed food pigeons have a lot of discounting poor self control
who did hundreds of experiments about discounting	mischel
what were the mischel experiments	child waiting to eat candy so they can get two pieces - older child, more likely to wait - something else to do helps - ironic processess - reflects long term personality characteristics
Pleasant: the response produces reinforcing outcome	Reward Training (positive reinforcement) - Response INC
Unpleasant: the response produces reinforcing outcome	Punishment Response DEC
Pleasant: the response prevents reinforcing outcome	Omission, Extinction, Time Out Response DEC
Unpleasant: the response prevents reinforcing outcome	Escape, Avoidance (Negative reinforcement) - Response INC
when does punishment work?	consistency, delay, intensity factors
side effects of punishment?	conditioned fear, aggression
omission training	don't do something - good outcome occurs -- ex: time out
escape	learn to avoid punishment - two process theory
two-process theory	classical conditioning of fear to warning signal + reduction of fear as reinforcement - once learned will always avoid
evidence against the two process theory	SIDMAN AVOIDANCE PROCEDURE - no warning signal - animals still find ways to avoid
cognitive theory of avoidance	subject develops expectations (e.g. no shock if I respond, shock if I don't) - expectations of circumstance, subject's knowledge/cognitions
instrumental learning is sort of	the subject learning how to control the world
learned helplessness	learning that there is a lack of contingency between responses and an aversive outcome
example of learned helplessness	forcing a dog to be shocked, they stop struggling/trying to escape eventually
how do bad events relate to learned helplessness	sometimes when bad things happen, you take it as something that you could fix --> no excuse? bam LH if you learn you cannot fix things, you will not look for the opportunity to fix them
learned helplessness has served as a model for what	depression
long term learned helplessness effects: emotional	depressed effect, loss of appetite and sex drive, psychosomatic illness
long term learned helplessness effects: motivational	lack of initiative to respond
long term learned helplessness effects: cognitive	develop "external locus of control" attributional style; fail to recognize situations where they could exert control
social defeat approach	psychological and behavioral effects of the losing party in a confrontation among animals of the same species
what criticism does social defeat come from	learned helplessness research with animals is often using artifical negative stimuli - social defeat a more realistic manipulation
resident-intruder task	animal placed in cage of another animal(s) of same species in a manner that allows non-lethal conflict. - documented to produce anxiety-like and depressive-like symptoms in mice
animals allowed to fight on a single occassion	acute stress
animals allowed to fight on several occasions	chronic stress
after the defeat/interval between social defeat fights	subordinate animal may be exposed to threats from the dominant one - nearby cage - visual/olfactory cues
classical conditioning	learning about ASSOCIATIONS BETWEEN EXTERNAL EVENTS (out in the world), theories emphasize STIMULUS-STIMULUS associations
instrumental conditioning	learning about associations between OUR OWN BEHAVIOR and EXTERNAL events (reinforcement) - our behavior can alter likelihood of outcome
who's in charge CC	experimenters determine trials by presenting CS + US
who's in charge IC	the subject must make the response
nature of response CC	usually involuntary (blinking, salivating, etc)
nature of response IC	usually voluntary (pushing a lever)
biofeedback	an exception to IC - instrumentally controlling an autonomic response through the use of biofeedback: procedure of giving an individual info about the activity of an internal response
drive-reduction theory	positive reinforcers reduce drives, and negative reinforcers increase drives - complete actions to reduce uncomfortable "drive" - brain stimulation as reinforcement - chimpanzees prefer african or indian music to silence
if drive-reduction is about bio needs, is it always needed?	no! hull says it's sometimes wrong for ex: ppl love diet coke, but it costs money, bad for health, water is necessary and free
what does drive-reduction not take into account	pleasure!
incentive motivation	reinforcers are incentives that elicit responding - food! (hunger + pleasure)
reinforcer priming	exposure to reinforcer may enforce responding
relative reinforcement	anything can be a reinforcement depending on what alternatives are
premark principle	a higher probability activity will reinforce a lower probability activity - higher-prob more favorable, more likely to do - if what the subject most wants to do is able to be done, it can serve as reinforcement for other things
response deprivation theory	reinforcer is an opportunity to move closer to bliss point - if you are deprived of something more likely to work to be able to increase doing that thing
bliss point	ideal distribution of time and effort across all points
secondary reinforcement	neural stimuli that have been paired with reinforcers and have acquired the capacity to reinforce on their own - money!
social reinforcement	in many species (including humans) attention + physical contact may serve as reinforcement - theorists differ if this is primary or secondary reinforcement
what actually is reinforcement?	watson/skinner: - if subjects will inc response to get access to something, that is positive reinforcement - if subjects inc response to having something taken away, that is negative reinforcement
ecological validity of punishment	only effective in certain circumstances negative side effects (ex: avoidance, aggression)
ecological validity of reward	token economies
token economies	clients earned tokens for participating in predetermined target behaviors which can be exchanged for reinforcements - inpatient units, correctional facilities, schools
goal of token economies	generalize behavior to other settings BUT may not generalize, depends
potential limitations of token economies	lab closed economy, world an open one goal gradient hypothesis
goal gradient hypothesis	effect of a reward is weaker further the behavior from the reward
how do rewards punish us?	kohn - yes rewards change behavior, but negative consequences good sometimes - rewards may reduce creativity - ppl may focus on reward targets instead of thinking outside of the box ----- reduction of INTRINSIC motivation
stimulus control	conditioning a response to occur more often in the presence of a specific stimulus than in its absence
discriminative stimulus (S^D)	yes, if you make the response now you will get reinforcement
how is stimulus control not CC	you still need to DO the response (pull lever) to be reinforced
generalization	responding similarly to similar stimuli. to get a sense of similarity for a subject, a generalization gradient can be formed.
generalization gradient	- condition to respond in presence of S^D, not respond in absence (variable interval sched) - test for responses to different stimuli. plot # of responses as Fn of similarity to S^D - gradient is symmetric, centered on S^D
what do we do stimulus control for	to study generalization
discrimination training	responses in the presence of one stimulus (S^D) would be reinforced, while responses in presence of another stimulus (Sdelta) are not. - learn WHEN to make responses and when not - focus not on training, but what subject does after figuring out rules
what happens after discrimination training	subjects are tested on a range of stimuli. discrimination training typically results in generalization gradient that is narrower w/ a higher peak. the peak of responding is moved away for Sdelta
behavioral contrast	a phenomenon where changing a reinforcement schedule in one setting causes a behavior to change in the opposite direction in an unchanged setting
peak shift	peak becomes higher, narrower, shifted
continuity approach	discrimination is achieved by gradual association of one stimulus w/ award (excitation) and one nonreward (inhibition). these associations spread to similar stimuli. - natural explanation of peak shift, excitation/inhibition spread
non-continuity approach	discrimination is achieved by attention to particular dimensions + hypothesis testing. behavior can change drastically from one trial to another as different hypotheses are tested. - often tested on multidimensional stimuli
mackintosh cnon-ontinuity study	some animals focus on one dimension, some focus on another, and some on compound ex: SD: red circle, Sdelta: blue triangle pigion shown circle, triangle, red, blue then? indv animals focus on diff aspects color, shape, etc
testing subjects on stimuli varying on two dimensions	after one discrimination is mastered, which kind of shift is more easily learned? ex: see red/blue circles + triangles p1: S^D blue, Sdelta red - blue circle: respond - blue triangle: respond - red circle: no respond - red traingle: no respond
reversal shift	essentially change what is going on to every stimulus. some dimension matters (S^D red, Sdelta blue) -- BC = BT = no response, RC + RT = response. Just keep paying attn to color
nonreversal shift	change dimension (S^D circle, Sdelta triangle) -- BC + RC = response, BT + RT = no response
what is a tool to study concept learning in animals?	discrimination training
hernstein on discrimination training	train pigeons with S^D a class of stimuli and Sdelta anything not in that class. after training test two slides. pigeons capable of learning open-ended categories: - trees vs non trees, indv vs others, dish vs nonfish, humans vs nonhumans
errorless	telling people answer so fast they don't even have the option to mess it up
errorless discrimination	discrimination can be a long and seemingly frustrating process for subjects with freq responses to Sdelta
terrace's errorless DT	begin training w/ Sdelta very far away from S^D. gradually, move Sdelta closer and closer to S^D (stimulus fading)
preparedness	relative preparedness defined by the number of learning experiences one must occur before behavior change is reliable
conditioned taste aversion	avoidance of food as reponse to illness - long delay - one-trial learning - only certain aspects
preparedness and phobia	early evidence on phobias and conditioning: Watson and Rayner Little Albert
prepared fear module: ohman and mineka	- responds to simuli that are threatening in species' history - responding is automatic and involuntary - fear response is relatively unaffected by other modules (cog) - specialized neural circuts (amygdala)
what happened when skinner noticed tha animal behavior in an operant chamber was surprisingly complex	pigeons pecking at light for food but wtf they are also flapping wings?? - superstitious behaviors!!
superstitious behaviors	pigeons did external activities bc they were accidentally reinforced - THIS IS WRONG
staddon and simmelhag were like let's observe these pigeons	not random! diff pigeons tended to show same behaviors - interim and terminal activities
interim activities	some activities regularly done right after food delivery
terminal activities	activities that occurred as time for the next food delivery approached
instinctive drift	over time, innate behavior dominates conditioned behavior
species-specific defensive reactions	innate response primed in dnagerous situations
autoshaping	responding (pecking) in the absence of a relationship with reinforcement
negative autoshaping	responding (pecking) in an omission training design
behavior system	a behavior system is a related set of perceptual, behavioral, and motivational elements that move together like a unit
example of a behavior system	activation of the food system predisposes sensitivity to food-related stimuli, food-searching responses, and food-related reinforcers
what do behavior systems emphasize	role of innate, complex behavior in both simple experiments and everyday life
viewing learning as stimulus-response association is much too simple	reinforcement is activating a complex structure based on heredity
persistance	the voluntary, motivated continuation of goal-directed behavior despite obstacles, challenges, or fatigue
approach-avoidance conflict	occurs when a single goal or object holds both positive (desirable) and negative (avoidant) characteristics, causing simultaneous urges to approach and avoid it

"Know" box contains:
Time elapsed:
Retries:

PSCL 353 - Exam #2