Elizabeth Haas

PSY-150/Withrow

September 29, 2003

Chapter Five: Learning

Learning is the process by which experience or practice results in a relatively permanent change in behavior or potential behavior.

Classical or Pavlovian conditioning is the type of learning in which a response naturally elicited by one stimulus comes to be elicited by a different, formally neutral stimulus. Based on Pavlov’s experiments with dogs. He would ring a bell before feeding the dogs. After a while, the dogs would begin to salivate as soon as they heard the bell because to the dogs the bell signaled the appearance of food. The dogs began to salivate on cue (bell) even when there was no food. The dogs were therefore conditioned to salivate in response to new stimulus (the bell) that would normally not prompt salivation.

Elements of classical conditioning are unconditioned stimulus, unconditioned response, condition stimulus, and conditioned response. Unconditioned stimulus (US) is a stimulus that automatically elicits a certain reflex action. In Pavlov’s studies, the food in the dog’s mouth was the unconditioned stimulus. An unconditioned response (UR) is a response that takes place in an organism whenever an unconditioned stimulus occurs. In Pavlov’s studies, the dog’s salivation is the unconditioned response. A conditioned stimulus is an originally neutral stimulus that is paired with an unconditioned stimulus and eventually produces the desired response in an organism when presented alone. Pavlov’s repeated use of the bell trigged the dog’s salivation. Therefore the bell was the conditioned stimulus. A conditioned response is the response an organism produces when a conditioned stimulus is present, a learned reaction.

The pairing of the conditioned stimulus and the unconditioned stimulus on only a portion of the learning trials is called intermittent pairing. It is important that the pairings be neither too far apart or too close together.

Other examples of classical conditioning: eating when not hungry but doing so because we know it’s lunch time or a specific smell triggering a bad memory.

Desensitization therapy is a conditioning technique designed to gradually reduce anxiety about a particular object or situation and starts with teaching a system of deep muscle relaxation. With desensitization, a fear-arousing thought is repeatedly paired with a muscular state that produces calmness until eventually the formally fearful thought no longer trigger anxiety.

Operant or instrumental conditioning is learning to make or withhold a certain response because of its rewards or punishment. Emitted behavior is one essential element in operant conditioning. Behavior designed to operate on the environment in a way that will gain something desired or avoid something unpleasant are operant behaviors. There are two types of consequences in operant behavior: the reinforcer and the punisher. The reinforcer occurs when a consequence increases the likelihood of operant behavior being emitted and the punisher occurs when a consequence decreases the likelihood of operant behavior. Edward Lee Thorndike’s Law of Effect is a theory that behavior consistently rewarded will be “stamped in” as learned behavior and behavior that brings about discomfort will be “stamped out.” Because behaviors involved in operant conditioning are voluntary, it is not easy to establish an operantly conditioned response. The desired behavior must first be performed spontaneously in order for it to be rewarded and strengthened. One way to speed up the process is to increase the motivation. Examples of operantly learned behavior would be studying to gain a teacher’s approval. Another way to speed up the process is to reinforce successive approximations of the desired behavior, referred to as shaping. Food, praise, or money that add something rewarding to a situation would be considered a positive reinforcer, encouraging the ongoing behavior. Stopping an aversive noise or taking away something unpleasant would be considered a negative reinforcer, also encouraging the ongoing behavior. Both positive and negative reinforcements result in the learning of new behaviors or the strengthening of existing behaviors.

Punishment is any event whose presence decreases the likelihood that the ongoing behavior will recur. Punishment adds something unpleasant to he environment therefore weakening the behavior that caused it. Drawbacks to punishment include stirring up negative feelings, models of aggressive behavior, and suppression of undesirable responses. When a threat of punishment induces a change to more desirable behavior it is called avoidance training. Learned helplessness is a “giving up” response that causes resignation in the face of unpleasant outcomes even when the outcomes can be avoided.

Similarities between classical and operant conditioning: 1) both involved learned associations, 2) extinction and spontaneous recovery, 3) generalization, 4) discrimination, 5) involve stimulus control.

Contingency is a relationship in which one event depends on another existing between two stimuli or between a stimulus and a response. In classical conditioning, the contingency is between the conditioned stimulus and the unconditioned stimulus. In operant conditioning, the contingencies exist between the responses and the consequences.

In operant conditioning:

The rule for determining when and how often the reinforcers will be delivered is the schedule or reinforcement. Partial reinforcements are rewards given for some correct responses but not every one generating behavior that persists longer than behavior that is generated by continuous reinforcement. In a fixed-interval schedule, reinforcement comes on the first correct response after a fixed length of time has passed since the last reinforcement resulting in a slow, steady pattern of responding as the learner keeps testing for the next payoff. In a fixed-ratio schedule, the correct response is reinforced after a fixed number of correct responses, resulting in a high rate of response because faster responses yield quicker payoffs. In the variable-ratio schedule, a varying number of correct responses must occur before reinforcement is presented resulting in a high response rate since the person keeps harboring the idea that he next response will bring a reward.

Extinction, a factor shared by both classical and operant conditioning, is a decrease in the strength or stopping of a learned response. After extinction and a period of rest, a conditioned response may suddenly reappear, reminded of an old association. This is called spontaneous recovery. In classical conditioning, extinction is caused by the failure to continue pairing the conditioned stimulus with the unconditioned stimulus. In operant conditioning, extinction happens as a result of withholding reinforcement.

The control of conditioned responses by stimuli in the environment is stimulus control and occurs in both classical and operant conditioning. The process by which a learned response to a specific stimulus comes to be associated with different but similar stimuli is known as stimulus generalization. Stimulus discrimination is a process through which learners can be trained not to generalize but rather to make a conditioned response only to a single specific stimulus. As a result, behavior becomes more finely tuned to the demands of the environment. Higher-order conditioning is based on previous learning where the conditioned stimulus serves as the unconditioned stimulus for further training.

Reinforcers that are rewarding in itself, such as food, water, and sex, and in which no prior learning is need to make them reinforcing are primary reinforcers. Secondary reinforcers need prior learning and are reinforcers whose value is acquired through association with other primary or secondary reinforcers.

The main difference between classical and operant conditioning is that in classical conditioning, the learner is passive and the behavior involved is usually involuntary. In operant conditioning, the learner is active and the behavior involved is usually voluntary.

Cognitive learning depends on mental processes that are not directly observable. Latent learning is learning that is not immediately reflected in a behavior change. Rewards and punishment are necessary for latent learning to take place. A cognitive map is a learned mental image of a spatial environment that may be called upon to solve problems when the stimuli in the environment change, a storing of visual perceptions. Insight is learning that occurs as a result of understanding all the elements of a problem. The ability to become increasingly more effective in solving problems as more problems are solved is a learning set. The learning set provides a key to solving a problem even when its demands are slightly different from those problems solved in the past.

Observational or vicarious learning is learning by observing other people’s behavior. Social learning theorists are psychologists whose view of learning emphasizes the ability to learn by observing a model or receiving instructions without first hand experience. According to social learning theory, humans use their powers of observation and thought to interpret their own experiences and those of others when deciding how to act. Vicarious reinforcement or vicarious punishment affects the willingness of others to perform the behaviors they learned by observing so that when a consequence for a behavior isn’t experienced first hand but only occurs to other people.

PSY-150 Chapter Five Notes

Elizabeth Haas