Types of Reinforcement and Punishment (3.8.2) | AP Psychology Notes

AP Syllabus focus:

‘Reinforcement and punishment can be positive or negative, and reinforcers may be primary or secondary.’

Operant conditioning explains how consequences shape behaviour. This page focuses on categorising consequences precisely: whether they strengthen or weaken behaviour, whether something is added or removed, and what kinds of reinforcers are involved.

Core idea: consequences change behaviour

Reinforcement (increases behaviour)

Reinforcement: Any consequence that increases the future likelihood of a behaviour.

Reinforcement is defined by its effect (behaviour increases), not by intentions or whether it seems “nice.”

Punishment (decreases behaviour)

Punishment: Any consequence that decreases the future likelihood of a behaviour.

Punishment is also defined by its effect (behaviour decreases). If behaviour does not drop reliably, the consequence was not functioning as punishment.

Positive vs negative: added vs removed (not “good” vs “bad”)

In operant conditioning, positive means a stimulus is presented/added after a behaviour, and negative means a stimulus is removed/subtracted after a behaviour. These labels do not indicate moral value.

The four consequence types (two reinforcements, two punishments)

Positive reinforcement: add a desirable stimulus to increase behaviour.
Negative reinforcement: remove an aversive stimulus to increase behaviour.
Positive punishment: add an aversive stimulus to decrease behaviour.
Negative punishment: remove a desirable stimulus to decrease behaviour.

A key AP Psychology skill is distinguishing negative reinforcement from punishment: negative reinforcement strengthens behaviour because it produces relief/escape (something unpleasant ends).

How to classify quickly (decision steps)

Step 1: Did the behaviour increase (reinforcement) or decrease (punishment)?
Step 2: Was something added (positive) or taken away (negative)?
Step 3: Name it using the two-part label (e.g., negative punishment).

This flowchart operationalizes classification in operant conditioning by first asking whether the consequence makes the behavior more or less likely to recur (reinforcement vs. punishment). It then separates “positive” (something added) from “negative” (something removed), yielding the four standard categories. Use it as a quick diagnostic to avoid the common confusion between negative reinforcement and punishment. Source

Types of reinforcers: primary vs secondary

Reinforcement can be delivered through different kinds of reinforcers (stimuli that function as reinforcement).

Primary (unconditioned) reinforcers

Primary reinforcer: A reinforcer that is innately rewarding because it satisfies a biological need (e.g., food, water, warmth).

Primary reinforcers do not require learning to be effective, though their strength can vary with physiological state (e.g., hunger).

Secondary (conditioned) reinforcers

Secondary reinforcer: A reinforcer that gains value through learning, typically by being associated with primary reinforcers or other established rewards.

Secondary reinforcers are powerful in human settings because they can be widely applied and exchanged. A special case is a generalised conditioned reinforcer (e.g., money), which is linked to many different rewards and therefore remains reinforcing across situations.

Key distinctions AP Psychology expects

Reinforcement vs punishment: defined by the direction of behaviour change.
Positive vs negative: defined by whether a stimulus is added or removed, not whether it feels pleasant.
Reinforcer type: primary (biological) versus secondary (learned through association).
The same consequence can function differently depending on the person and context; classification must be based on observable behavioural effects over time.

FAQ

Yes. Whether something is a reinforcer depends on its effect on behaviour for that individual.

Reasons include:

Preferences and prior learning history
Current needs (e.g., tiredness, hunger)
Cultural values tied to certain rewards

It is a secondary reinforcer associated with many different reinforcers (not just one).

Because it “works” across contexts, it is less dependent on any single deprivation state and is highly flexible in shaping behaviour.

They alter the current value of a reinforcer.

Deprivation tends to increase effectiveness (more motivating).
Satiation tends to decrease effectiveness (less motivating).

This helps explain why a consequence may stop working without changing the formal contingency.

The Premack principle states that a more probable (preferred) behaviour can reinforce a less probable behaviour.

Example structure (no full scenario needed):

Access to a high-frequency activity becomes the reinforcer for completing a low-frequency activity.

Common reasons include:

Inconsistent delivery (behaviour is not reliably followed by the punisher)
Delayed consequences (weakens learning link)
The “punisher” is not actually aversive to that person
The behaviour is still reinforced elsewhere (competing reinforcement)

Practice Questions

In operant conditioning, distinguish between negative reinforcement and negative punishment. (3 marks)

Negative reinforcement: behaviour increases because an aversive stimulus is removed/avoided. (1)
Negative punishment: behaviour decreases because a desirable stimulus is removed. (1)
Clear statement that “negative” refers to removal/subtraction (not “bad”). (1)

A teacher introduces a points system in which students earn points that can be exchanged for privileges, and sometimes removes points after rule-breaking. Using operant conditioning terminology, explain the types of reinforcers involved and classify the consequences as positive/negative reinforcement or punishment. (6 marks)

Identifies points as a secondary/conditioned reinforcer. (1)
Explains that points gain value through association with privileges/rewards. (1)
Notes that privileges (if inherently enjoyable/valued) can function as reinforcers. (1)
Earning points to increase desired behaviour = positive reinforcement (adds a stimulus). (1)
Removing points to reduce rule-breaking = negative punishment (removes a desirable stimulus). (1)
Uses correct positive/negative logic: addition vs removal, with behaviour increase vs decrease. (1)

Try All Topic Practice Questions

Written by:

Valentina

Profile

Oxford University - Experimental Psychology

Valentina is an Oxford-educated psychologist. Experienced in creating educational resources, she has dedicated the past 5 years to nurturing future minds as an A-Level and IB Psychology tutor.