How Dogs Learn by Consequences: Operant Conditioning in Dog Training

What is the common thing all dog trainers in the world share despite all their unbridged differences? They all use the principles of Operant and Classical Conditioning when they train, handle, and manage their dogs. 

Today, we talk about Operant Conditioning. The learning model that focuses on how dogs, humans, and other intelligent animals, learn by consequences.

If you think about it, all methods of training introduce a consequence, whether pleasant or unpleasant, after a behavior occurs. By doing so, we -the dog handlers- encourage or discourage a behavior, according to our likes.

This is the science behind dog training.

Let’s break it down.

Operant conditioning: the definition

Operant Conditioning is a learning model that describes how intelligent beings modify their behavior according to the consequences that are tied to.

For example, a student (a human student) is more likely to participate in a class that offers incentives. If good grades are also followed by other types of rewards, the student is set up for success, simply by adding a pleasant outcome (the incentives) to a voluntary response (her choice to participate) to an otherwise neutral cue (a class without incentives).

In the same fashion, and back to the context of dog training, we introduce a consequence when a behavior occurs, in a way that makes sense to the dog.

There are two ways of doing that. One way is to focus on the behavior we want to increase (reinforcement). The other way is to focus on the behavior we want to decrease (punishment). We decrease or increase a behavior by adding (positive) or removing (negative) a stimulus. The addition and removal of a stimulus is the consequence that dogs learn to associate with their actions, and this is how learning occurs.

The four types of operant conditioning

  1. Positive reinforcement. A pleasant stimulus follows the desired behavior. We give praises to the puppy that just pooped in the designated area.
  2. Negative reinforcement.  An unpleasant stimulus is removed when the desired behavior occurs. Handler restrains the dog until the dog is calm, and then the restraint is removed. Negative refers to something that we remove (the restraint )when the desired behavior occurs (the dog is calm).
  3. Positive punishment. An unpleasant stimulus follows the undesired behavior. Dog jumps on visitors and the handler shouts to the dog. Again, positive refers to the addition of shouting.
  4. Negative punishment. A pleasant stimulus is removed when the undesired behavior occurs. The dog eats poop from the ground and the playtime is over. Negative refers to the playtime being over.

*In operant terminology, the words positive and negative are used in their mathematical sense. They mean addition and abstraction. Likewise, punishment, in the context of operant conditioning, is a technical term that means the introduction of a consequence after a particular behavior to make it less likely to reoccur. While this leaves room for abuse, as it has, it doesn’t mean that punishment, if wisely used, is abusive.

operant conditioning
  • Save


The four types in-depth:

1. Positive reinforcement

This is the most dog-friendly way to train your dog

Positive reinforcement is a powerful method of teaching that introduces pleasant stimuli after a desired behavior occurs. In dog training, it works by rewarding a dog for offering a behavior that we want to encourage. Rewards may include food, praises, toys, games, etc.

Positive reinforcement relies on the simple notion that dogs are more likely to repeat behaviors that they associate with pleasant consequences. Basic cause-and-effect stuff.

The word positive, as in math, refers to something being added (the reward), and reinforcement to the behavior we want to encourage. For example, with a soft and firm voice say ”come”, and give a treat when the dog comes. The treat is the added stimulus and the successful response to our command ”come” is the behavior you want to reinforce.

See the sequence of events? Cue, response, and reward.

Positive reinforcement values kindness and cooperation. Desirable behaviors followed by desirable consequences. Quite positive, indeed.

2. Negative reinforcement

Negative reinforcement happens when we remove an aversive the moment the desired behavior occurs. In other words, dogs learn to behave in ways that release them from discomfort and even pain.

Take prong collars, for example. Prong collars give pressure to the neck of a dog and release only when the dog stops pulling. This makes dogs stop the pulling. Dogs learn that walking next to you releases the discomfort. Then, they learn to actively avoid it.

There are two ways that negative reinforcement works. One is when the trainer removes the aversive stimulus (escape). Second is when the dog actively avoids a known unpleasant outcome (avoidance.)

Examples of negative reinforcement:

  • The trainer shocks the dog with an e-collar when she walks away and gives the command ‘come’. The trainer releases when the dog comes. The dog learns that coming after the cue/command ‘come’ is going to release her from pain (escape.)
  • The dog stays around his trainer because walking away causes discomfort (avoidance).
  • The dog knows that barking to strangers makes his people mad, so he prefers to avoid it (avoidance.)

Training with aversives requires you are very cautious and experienced. I never purposely give pain to my dog, and I don’t recommend it to you either. You risk ruining your relationship with your dog.

In any case, if you are convinced that this is the way-to-go, do not use negative reinforcement for long, and don’t make the aversive stimuli look like it comes from you. Make them look like natural consequences to your dog’s actions. Best thing, get professional help.

Now, to avoid the common confusion that makes people mistake negative reinforcement with punishment, keep reading. Keep in mind that with negative reinforcement we aim to increase a behavior, while with punishment we aim to decrease it.

3. Positive punishment

A more familiar concept, punishment happens when we introduce an unpleasant consequence when an unwanted behavior occurs. Positive punishment is when we add an unpleasant stimulus when a dog does something we do not approve or doesn’t do what is being asked.

For example, the dog walks ahead and the collar tightens. The pressure is the aversive stimulus that punishes the dog for walking ahead. That’s positive punishment. With time, the dog will learn to avoid this stimulus but not walking ahead. And that’s negative reinforcement (avoidance) in action.

Other examples of positive punishment are e-collars, shouting, prong collars, beating, showing anger, restrictions, and more.

It’s easier for most people to grasp the concept of punishment compared to the less familiar negative reinforcement and negative punishment. It has been the way to teach and manage children for centuries. When people started training dogs methodically, about a century ago, they quickly picked up what behaviorists knew at the time.

Training methods that stem from or use negative reinforcement and positive punishment is the so-called dominance training, and the alpha dog training. The latter is somehow still a thing.

4. Negative punishment

Negative punishment happens when we remove a stimulus that the dog values as a result of his/her unwanted behavior.
For example, when my dog eats poop at the park, I express verbal disapproval, and immediately she is restricted for a minute to a designated penalty area, or even sometimes our walk is over. Her freedom or playtime is over (removed) the moment she engages in a highly disapproved behavior.
Negative punishment is the dog-friendly way to communicate your disapproval and correct your dog. Dogs learn that unpleasant consequences are part of life.

Key-points and clarifications:

1. The difference between operant and classical conditioning

In the first paragraph, I mentioned Classical Conditioning as another model that describes how dogs learn. It’s helpful to distinguish the two models before you continue.
Classical conditioning is learning by association. Dogs observe the correlation between two or more events and learn to act accordingly. For example, my dog knows that the sound of the alarm in the morning (first event) means that we are getting up (second event), and a new day of fun begins. She knows that when I make breakfast she gets half an apple, that getting her leash means we are going out, that coming back means she gets to get cleaned, and she knows very well which sounds mean meal-time.

The main difference between the Operant and Classical conditioning is this: In OC we link behaviors to events (the introduction of stimuli), while in CC two events are paired to each other. In OC the dog voluntarily chooses to offer a behavior in order to enjoy a pleasant consequence or give up a behavior to avoid an unpleasant one.

2. Reinforcement focuses on the desired behavior, while punishment focuses on the undesired

Another way to categorize the four types of Operant Conditioning is by their intent. With positive and negative reinforcement we want to teach new behaviors and with punishment, we want to stop an established one.

Punishment doesn’t guide towards what’s good. Punishment stops what’s ‘bad’. While this is important when not complying with known commands gets dangerous), guiding towards what’s desirable will build a better behaved dog.

3. Behaviors treated with punishment are oppressed, not resolved

Punishing alone won’t resolve much. It is a momentary correction that with time and repetition forms fear-related compliance. Punishment relies on fear. Dogs will only comply when the punisher is there. The behavior is not forgotten unless they learn otherwise.

For example, merely punishing a dog for aggression towards strangers won’t resolve the issue, but rather oppress it. Creating anew a more positive experience for the dog around people will resolve it.

4. Reinforced behaviors tend to extinct if not continuously reinforced

Even established behaviors extinct if not continuously reinforced. When a behavior is somewhat established, move on to the next thing you want to teach your dog. For the known behaviors(commands if you prefer), employ real-life rewards.
Real-life rewards are a convenient way to reinforce behaviors out of the context of formal training. Everything that your dog likes during a normal day can be a real-life reward for a behavior you want to reinforce.
When my dog Aria learned to sit, we moved on to the next command/signal. Training devices are reserved for new goals. ”Sit” was well established. ”Sit”, and other known commands, now happen before we open the balcony door for her before we put food in her bowl, before she gets on the couch, eth. Everything she loves is a chance to reinforce her responses.

5. Positive reinforcement and negative punishment go well together

Positive reinforcement for showing the right way. Negative punishment for showing disapproval. Simply the best way to start your dog’s training. Reserve harder methods for when the situation is out of control. (Seek professional guidance)
Training with positive reinforcement and correcting with negative punishment minimizes the chances that those out of control situations will ever come.
This combination ensures you leave cruelty out. With positive reinforcement, we build relationships based on cooperation and kindness. With negative reinforcement, we show that unpleasant consequences are part of everybody’s life. The latter is a law of nature. We don’t argue with Nature.

6. Positive reinforcement should be your primary method of training

If you want to be your own dog’s trainer, which is very doable, positive reinforcement is the way-to-go. Positive Reinforcement is the method that most dog trainers in the world recommend as everybody’s primary method of training. It is the only method that guarantees a smooth symbiosis with our dogs.

Positive reinforcement regards and respects a dog’s enthusiastic and curious nature. It acknowledges a dog’s talent to find fun in everything around them. With positive reinforcement, you won’t have just an obedient dog, but a partner in fun.

Dogs learn to offer the desired behavior because they want to and not because they fear bad consequences.

Positive reinforcement is, no doubt, the best option for strengthening the bond we share with our dog(s). There is undeniable evidence for that. It’s our way to speak the Dog language and dogs’ only chance to be their best versions.

Positive reinforcement should be our first choice of training.

Quick final thought

Remember that the four quadrants are not cut and dry. Often it is difficult to say if restricting your dog’s freedom is positive or negative punishment. See the links below for further reading.

As always, complete this post with your recommendations, insights, or questions in the comment section (they are all answered). And if you feel like it, share the post with your dog folks.


Leave a Reply

Your email address will not be published. Required fields are marked *

Copy link
Powered by Social Snap