What Is Operant Conditioning and How Does It Work? (2023)

How Reinforcement and Punishment Modify Behavior

Operant conditioning, also known as instrumental conditioning, is a method of learning normally attributed to B.F. Skinner, where the consequences of a response determine the probability of it being repeated. Through operant conditioning behavior which is reinforced (rewarded) will likely be repeated, and behavior which is punished will occur less frequently.

By the 1920s, John B. Watson had left academic psychology, and other behaviorists were becoming influential, proposing new forms of learning other than classical conditioning. Perhaps the most important of these was Burrhus Frederic Skinner. Although, for obvious reasons, he is more commonly known as B.F. Skinner.

Skinner’s views were slightly less extreme than those of Watson (1913). Skinner believed that we do have such a thing as a mind, but that it is simply more productive to study observable behavior rather than internal mental events.

The work of Skinner was rooted in a view that classical conditioning was far too simplistic to be a complete explanation of complex human behavior. He believed that the best way to understand behavior is to look at the causes of an action and its consequences. He called this approach operant conditioning.

What Is Operant Conditioning and How Does It Work? (1)

Contents

BF Skinner: Operant Conditioning

Skinner is regarded as the father of Operant Conditioning, but his work was based on Thorndike’s (1898) law of effect . According to this principle, behavior that is followed by pleasant consequences is likely to be repeated, and behavior followed by unpleasant consequences is less likely to be repeated.

Skinner introduced a new term into the Law of Effect – Reinforcement. behavior which is reinforced tends to be repeated (i.e., strengthened); behavior which is not reinforced tends to die out-or be extinguished (i.e., weakened).

Skinner (1948) studied operant conditioning by conducting experiments using animals which he placed in a “ Skinner Box ” which was similar to Thorndike’s puzzle box.

What Is Operant Conditioning and How Does It Work? (2)

A Skinner box, also known as an operant conditioning chamber, is a device used to objectively record an animal’s behavior in a compressed time frame. An animal can be rewarded or punished for engaging in certain behaviors, such as lever pressing (for rats) or key pecking (for pigeons).

Skinner identified three types of responses, or operant, that can follow behavior.

  • Neutral operants: responses from the environment that neither increase nor decrease the probability of a behavior being repeated.
  • Reinforcers: Responses from the environment that increase the probability of a behavior being repeated. Reinforcers can be either positive or negative.
  • Punishers: Responses from the environment that decrease the likelihood of a behavior being repeated. Punishment weakens behavior.

We can all think of examples of how our own behavior has been affected by reinforcers and punishers. As a child, you probably tried out a number of behaviors and learned from their consequences.

For example, when you were younger, if you tried smoking at school, and the chief consequence was that you got in with the crowd you always wanted to hang out with, you would have been positively reinforced (i.e., rewarded) and would be likely to repeat the behavior.

If, however, the main consequence was that you were caught, caned, suspended from school and your parents became involved you would most certainly have been punished, and you would consequently be much less likely to smoke now.

(Video) Skinner’s Operant Conditioning: Rewards & Punishments

Positive Reinforcement

Positive reinforcement is a term described by B. F. Skinner in his theory of operant conditioning. In positive reinforcement, a response or behavior is strengthened by rewards, leading to the repetition of desired behavior. The reward is a reinforcing stimulus.

Primary reinforcers are stimuli that are naturally reinforcing because they are not learned and directly satisfy a need, such as food or water.

Secondary reinforcers are stimuli that are reinforced through their association with a primary reinforcer, such as money, school grades. They do not directly satisfy an innate need but may be the means. So a secondary reinforcer can be just as powerful a motivator as a primary reinforcer.

Skinner showed how positive reinforcement worked by placing a hungry rat in his Skinner box. The box contained a lever on the side, and as the rat moved about the box, it would accidentally knock the lever. Immediately it did so that a food pellet would drop into a container next to the lever.

The rats quickly learned to go straight to the lever after a few times of being put in the box. The consequence of receiving food if they pressed the lever ensured that they would repeat the action again and again.

Positive reinforcement strengthens a behavior by providing a consequence an individual finds rewarding. For example, if your teacher gives you £5 each time you complete your homework (i.e., a reward), you will be more likely to repeat this behavior in the future, thus strengthening the behavior of completing your homework.

What Is Operant Conditioning and How Does It Work? (3)

Negative Reinforcement

Negative reinforcement is the termination of an unpleasant state following a response.

This is known as negative reinforcement because it is the removal of an adverse stimulus which is ‘rewarding’ to the animal or person. Negative reinforcement strengthens behavior because it stops or removes an unpleasant experience.

For example, if you do not complete your homework, you give your teacher £5. You will complete your homework to avoid paying £5, thus strengthening the behavior of completing your homework.

Skinner showed how negative reinforcement worked by placing a rat in his Skinner box and then subjecting it to an unpleasant electric current which caused it some discomfort. As the rat moved about the box it would accidentally knock the lever.

Immediately it did so the electric current would be switched off. The rats quickly learned to go straight to the lever after a few times of being put in the box. The consequence of escaping the electric current ensured that they would repeat the action again and again.

In fact Skinner even taught the rats to avoid the electric current by turning on a light just before the electric current came on. The rats soon learned to press the lever when the light came on because they knew that this would stop the electric current being switched on.

These two learned responses are known as Escape Learning and Avoidance Learning.

Punishment (weakens behavior)

Punishment is defined as the opposite of reinforcement since it is designed to weaken or eliminate a response rather than increase it. It is an aversive event that decreases the behavior that it follows.

Like reinforcement, punishment can work either by directly applying an unpleasant stimulus like a shock after a response or by removing a potentially rewarding stimulus, for instance, deducting someone’s pocket money to punish undesirable behavior.

(Video) The difference between classical and operant conditioning - Peggy Andover

Note: It is not always easy to distinguish between punishment and negative reinforcement.

There are many problems with using punishment, such as:

  • Punished behavior is not forgotten, it’s suppressed – behavior returns when punishment is no longer present.
  • Causes increased aggression – shows that aggression is a way to cope with problems.
  • Creates fear that can generalize to undesirable behaviors, e.g., fear of school.
  • Does not necessarily guide toward desired behavior – reinforcement tells you what to do, punishment only tells you what not to do.

Schedules of Reinforcement

Imagine a rat in a “Skinner box.” In operant conditioning, if no food pellet is delivered immediately after the lever is pressed then after several attempts the rat stops pressing the lever (how long would someone continue to go to work if their employer stopped paying them?). The behavior has been extinguished.

Behaviorists discovered that different patterns (or schedules) of reinforcement had different effects on the speed of learning and extinction. Ferster and Skinner (1957) devised different ways of delivering reinforcement and found that this had effects on

1. The Response Rate – The rate at which the rat pressed the lever (i.e., how hard the rat worked).

2. The Extinction Rate – The rate at which lever pressing dies out (i.e., how soon the rat gave up).

What Is Operant Conditioning and How Does It Work? (4)

Skinner found that the type of reinforcement which produces the slowest rate of extinction (i.e., people will go on repeating the behavior for the longest time without reinforcement) is variable-ratio reinforcement. The type of reinforcement which has the quickest rate of extinction is continuous reinforcement.

(A) Continuous Reinforcement

An animal/human is positively reinforced every time a specific behavior occurs, e.g., every time a lever is pressed a pellet is delivered, and then food delivery is shut off.

  • Response rate is SLOW
  • Extinction rate is FAST

(B) Fixed Ratio Reinforcement

Behavior is reinforced only after the behavior occurs a specified number of times. e.g., one reinforcement is given after every so many correct responses, e.g., after every 5th response. For example, a child receives a star for every five words spelled correctly.

  • Response rate is FAST
  • Extinction rate is MEDIUM

(C) Fixed Interval Reinforcement

One reinforcement is given after a fixed time interval providing at least one correct response has been made. An example is being paid by the hour. Another example would be every 15 minutes (half hour, hour, etc.) a pellet is delivered (providing at least one lever press has been made) then food delivery is shut off.

  • Response rate is MEDIUM
  • Extinction rate is MEDIUM

(D) Variable Ratio Reinforcement

behavior is reinforced after an unpredictable number of times. For examples gambling or fishing.

    • Response rate is FAST
    • Extinction rate is SLOW (very hard to extinguish because of unpredictability)

(E) Variable Interval Reinforcement

Providing one correct response has been made, reinforcement is given after an unpredictable amount of time has passed, e.g., on average every 5 minutes. An example is a self-employed person being paid at unpredictable times.

  • Response rate is FAST
  • Extinction rate is SLOW

Behavior Modification

Behavior modification is a set of therapeutic techniques based on operant conditioning (Skinner, 1938, 1953). The main principle comprises changing environmental events that are related to a person’s behavior. For example, the reinforcement of desired behaviors and ignoring or punishing undesired ones.

This is not as simple as it sounds — always reinforcing desired behavior, for example, is basically bribery.

There are different types of positive reinforcements. Primary reinforcement is when a reward strengths a behavior by itself. Secondary reinforcement is when something strengthens a behavior because it leads to a primary reinforcer.

Examples of behavior modification therapy include token economy and behavior shaping.

Token Economy

Token economy is a system in which targeted behaviors are reinforced with tokens (secondary reinforcers) and later exchanged for rewards (primary reinforcers).

(Video) Operant Conditioning Explained

Tokens can be in the form of fake money, buttons, poker chips, stickers, etc. While the rewards can range anywhere from snacks to privileges or activities. For example, teachers use token economy at primary school by giving young children stickers to reward good behavior.

Token economy has been found to be very effective in managing psychiatric patients. However, the patients can become over-reliant on the tokens, making it difficult for them to adjust to society once they leave prison, hospital, etc.

Staff implementing a token economy program have a lot of power. It is important that staff do not favor or ignore certain individuals if
the program is to work. Therefore, staff need to be trained to give tokens fairly and consistently even when there are shift changes such as in prisons or in a psychiatric hospital.

Behavior Shaping

A further important contribution made by Skinner (1951) is the notion of behavior shaping through successive approximation. Skinner argues that the principles of operant conditioning can be used to produce extremely complex behavior if rewards and punishments are delivered in such a way as to encourage move an organism closer and closer to the desired behavior each time.

In shaping, the form of an existing response is gradually changed across successive trials towards a desired target behavior by rewarding exact segments of behavior.

To do this, the conditions (or contingencies) required to receive the reward should shift each time the organism moves a step closer to the desired behavior.

According to Skinner, most animal and human behavior (including language) can be explained as a product of this type of successive approximation.

Educational Applications

In the conventional learning situation, operant conditioning applies largely to issues of class and student management, rather than to learning content. It is very relevant to shaping skill performance.

A simple way to shape behavior is to provide feedback on learner performance, e.g., compliments, approval, encouragement, and affirmation. A variable-ratio produces the highest response rate for students learning a new task, whereby initially reinforcement (e.g., praise) occurs at frequent intervals, and as the performance improves reinforcement occurs less frequently, until eventually only exceptional outcomes are reinforced.

For example, if a teacher wanted to encourage students to answer questions in class they should praise them for every attempt (regardless of whether their answer is correct). Gradually the teacher will only praise the students when their answer is correct, and over time only exceptional answers will be praised.

Unwanted behaviors, such as tardiness and dominating class discussion can be extinguished through being ignored by the teacher (rather than being reinforced by having attention drawn to them). This is not an easy task, as the teacher may appear insincere if he/she thinks too much about the way to behave.

Knowledge of success is also important as it motivates future learning. However, it is important to vary the type of reinforcement given so that the behavior is maintained. This is not an easy task, as the teacher may appear insincere if he/she thinks too much about the way to behave.

Summary

Looking at Skinner’s classic studies on pigeons’ / rat’s behavior we can identify some of the major assumptions of the behaviorist approach.

Psychology should be seen as a science, to be studied in a scientific manner. Skinner’s study of behavior in rats was conducted under carefully controlled laboratory conditions .

• Behaviorism is primarily concerned with observable behavior, as opposed to internal events like thinking and emotion. Note that Skinner did not say that the rats learned to press a lever because they wanted food. He instead concentrated on describing the easily observed behavior that the rats acquired.

• The major influence on human behavior is learning from our environment. In the Skinner study, because food followed a particular behavior the rats learned to repeat that behavior, e.g., operant conditioning.

• There is little difference between the learning that takes place in humans and that in other animals. Therefore research (e.g., operant conditioning) can be carried out on animals (Rats / Pigeons) as well as on humans. Skinner proposed that the way humans learn behavior is much the same as the way the rats learned to press a lever.

So, if your layperson’s idea of psychology has always been of people in laboratories wearing white coats and watching hapless rats try to negotiate mazes in order to get to their dinner, then you are probably thinking of behavioral psychology.

(Video) What is Operant Conditioning?

Behaviorism and its offshoots tend to be among the most scientific of the psychological perspectives . The emphasis of behavioral psychology is on how we learn to behave in certain ways.

We are all constantly learning new behaviors and how to modify our existing behavior. behavioral psychology is the psychological approach that focuses on how this learning takes place.

Critical Evaluation

Operant conditioning can be used to explain a wide variety of behaviors, from the process of learning, to addiction and language acquisition. It also has practical applications (such as token economy) which can be applied in classrooms, prisons and psychiatric hospitals.

However, operant conditioning fails to take into account the role of inherited and cognitive factors in learning, and thus is an incomplete explanation of the learning process in humans and animals.

For example, Kohler (1924) found that primates often seem to solve problems in a flash of insight rather than be trial and error learning. Also, social learning theory (Bandura, 1977) suggests that humans can learn automatically through observation rather than through personal experience.

The use of animal research in operant conditioning studies also raises the issue of extrapolation. Some psychologists argue we cannot generalize from studies on animals to humans as their anatomy and physiology is different from humans, and they cannot think about their experiences and invoke reason, patience, memory or self-comfort.

References

Bandura, A. (1977). Social learning theory. Englewood Cliffs, NJ: Prentice Hall.

Ferster, C. B., & Skinner, B. F. (1957). Schedules of reinforcement. New York: Appleton-Century-Crofts.

Kohler, W. (1924). The mentality of apes. London: Routledge & Kegan Paul.

Skinner, B. F. (1938). The behavior of organisms: An experimental analysis. New York: Appleton-Century.

Skinner, B. F. (1948). Superstition” in the pigeon. Journal of Experimental Psychology, 38, 168-172.

Skinner, B. F. (1951). How to teach animals. Freeman.

Skinner, B. F. (1953). Science and human behavior. SimonandSchuster.com.

Thorndike, E. L. (1898). Animal intelligence: An experimental study of the associative processes in animals. Psychological Monographs: General and Applied, 2(4), i-109.

Watson, J. B. (1913). Psychology as the behaviorist views it. Psychological Review, 20, 158–177.

Further Reading

FAQs

What is operant conditioning and how does it work? ›

Operant conditioning, sometimes referred to as instrumental conditioning, is a method of learning that uses rewards and punishment to modify behavior. Through operant conditioning, behavior that is rewarded is likely to be repeated, and behavior that is punished will rarely occur.

How does operant conditioning work quizlet? ›

is a method of learning that occurs through rewards and punishments for behavior. Through operant conditioning, an association is made between a behavior and a consequence for that behavior. are favorable events or outcomes that are presented after the behavior.

What is a operant conditioning in psychology? ›

Operant conditioning (also known as instrumental conditioning) is a process by which humans and animals learn to behave in such a way as to obtain rewards and avoid punishments.

How does Skinner's operant conditioning work? ›

BF Skinner: Operant Conditioning

According to this principle, behavior that is followed by pleasant consequences is likely to be repeated, and behavior followed by unpleasant consequences is less likely to be repeated. Skinner introduced a new term into the Law of Effect - Reinforcement.

What is the best example of operant conditioning? ›

Operant conditioning examples
  • offering praise when they do something positive.
  • giving them a piece of candy when they clean their room.
  • letting them play video games after they complete their homework.
  • sending them to their room as a form of punishment.
  • ending a playdate if they don't stop misbehaving.
Nov 11, 2021

What is a good example of operant conditioning? ›

A child is scolded (unpleasant event) for ignoring homework (undesirable behavior.) A parent gives a child a time-out (unpleasant consequence) for throwing tantrums (unwanted behavior.) The police gives a driver a ticket (unpleasant stimulus) for speeding (unwanted behavior.)

Which of the following best describes how operant conditioning works? ›

Answer and Explanation: The answer is B. Organisms learn from the consequences of their behavior. In operant conditioning, a behavior is either reinforced or encouraged through a series of punishments and reinforcements.

What is an example of operant behavior in psychology? ›

Operant responses can include everything from a rat's lever press maintained by food, an infant's crying maintained by maternal attention, and small talk maintained by social companionship.

How does operant conditioning work in the classroom? ›

Operant conditioning examples in the classroom can be a way to analyze reinforcement and punishment. Student behavior is an honest reflection of risks, rewards and other underlying consequences. Although natural consequences can lead to changes, rewards and punishments can be consciously doled out to create change.

What is operant behavior in simple terms? ›

Definition. Operant behavior is that which is said to meet two conditions: (1) It is freely emitted by an animal, in the sense that there is no obvious triggering stimulus. (2) It is susceptible to reinforcement and punishment by its consequences, such that it can be caused to go up or down in frequency, respectively.

Which is an example of operant conditioning quizlet? ›

A lion in a circus learns to stand up on a chair and jump through a hoop to receive a food treat. This example is operant conditioning because attendance is a voluntary behavior. The exemption from the final exam is a negative reinforcement because something is taken away that increases the behavior (attendance).

Why is operant conditioning the most effective? ›

An advantage of operant conditioning is its ability to explain learning in real-life situations. From an early age, parents nurture their children's behavior using rewards. Praise following an achievement (e.g. crawling or taking a first step) reinforce such behavior.

What are the 4 types of operant conditioning? ›

In Operant Conditioning Theory, there are essentially four quadrants: Positive Reinforcement, Positive Punishment, Negative Reinforcement, and Negative Punishment.

How do you use operant conditioning on yourself? ›

Using reinforcers is a great way to condition yourself to do something you want to, to reach a goal, or to stop an unwanted behavior. For instance, if you want to start walking 30 minutes a day you can reward yourself with a small piece of chocolate or your favorite TV show after the walk.

What's the best concise definition of operant conditioning? ›

Operant conditioning, also called instrumental conditioning, is a learning process where behaviors are modified through the association of stimuli with reinforcement or punishment.

Which is the best example of operant conditioning quizlet? ›

Which of the following is the best example of operant conditioning? A child listens quietly during story time after being praised for that same behavior. operant conditioning.

How does operant conditioning motivate students? ›

Operant conditioning theory is the simplest of the motivation theories. It basically states that people will do those things for which they are rewarded and will avoid doing things for which they are punished.

What's the best definition of operant conditioning quizlet? ›

operant conditioning. a type of learning in which behavior is strengthened if followed by a reinforcer or diminished if followed by a punisher. respondent behavior. behavior that occurs as an automatic response to some stimulus.

What is the meaning of operant learning? ›

Definition. Operant learning occurs when behavior changes as a function of its consequences, i.e., the environmental changes that follow the behavior. This definition is similar to the Law of Effect proposed initially by Edward Thorndike (1874–1949).

Is operant conditioning a behavior therapy? ›

Operant Conditioning

Behavioral therapy techniques use reinforcement, punishment, shaping, modeling, and related techniques to alter behavior. These methods have the benefit of being highly focused, which means they can produce fast and effective results.

What are the 3 principles of operant conditioning? ›

The Four Principles of Operant Conditioning for Dogs
  • POSITIVE REINFORCEMENT. The dog's behavior makes something good happen. ...
  • POSITIVE PUNISHMENT. The dog's behavior makes something bad happen. ...
  • NEGATIVE PUNISHMENT. The dog's behavior makes something good go away. ...
  • NEGATIVE REINFORCEMENT.
Nov 15, 2011

What are some examples of operant behavior? ›

Operant responses can include everything from a rat's lever press maintained by food, an infant's crying maintained by maternal attention, and small talk maintained by social companionship.

What are the two main concepts in operant conditioning? ›

Reinforcement and punishment are the core tools through which operant behavior is modified. These terms are defined by their effect on behavior. Either may be positive or negative.

What are the main components of operant conditioning? ›

The components of Operant Conditioning are Reinforcement and Punishment. There is positive and negative reinforcement, as well as positive and negative punishment.

How can you use operant conditioning to teach a new behavior? ›

The most effective way to teach a person or animal a new behavior is with positive reinforcement. In positive reinforcement, a desirable stimulus is added to increase a behavior. For example, you tell your five-year-old son, Jerome, that if he cleans his room, he will get a toy.

Videos

1. What Is Operant Conditioning and How Does It Work?
(Ai Moun)
2. How to Train a Brain: Crash Course Psychology #11
(CrashCourse)
3. What is Operant Conditioning (Reinforcement Learning)?
(BRAINPSYCHLOPEDIA)
4. Moving to a Stationary Target
(Lori Torrini)
5. Operant Conditioning Explained
(Psych Explained)
6. LECTURE 15 OPERANT CONDITIONING BG 3RD SEMESTER
(SHAHID SIR'S EDUCATION POINT(SSEP))
Top Articles
Latest Posts
Article information

Author: Clemencia Bogisich Ret

Last Updated: 06/04/2023

Views: 5465

Rating: 5 / 5 (80 voted)

Reviews: 87% of readers found this page helpful

Author information

Name: Clemencia Bogisich Ret

Birthday: 2001-07-17

Address: Suite 794 53887 Geri Spring, West Cristentown, KY 54855

Phone: +5934435460663

Job: Central Hospitality Director

Hobby: Yoga, Electronics, Rafting, Lockpicking, Inline skating, Puzzles, scrapbook

Introduction: My name is Clemencia Bogisich Ret, I am a super, outstanding, graceful, friendly, vast, comfortable, agreeable person who loves writing and wants to share my knowledge and understanding with you.