Question 1

How does it differ from Pavlovian conditioning?

Accepted Answer

Pavlovian (classical, respondent) conditioning links a neutral stimulus to a reflex via pairing — Pavlov's dogs salivating at a bell. The animal's response is involuntary and triggered by the stimulus. Operant conditioning links voluntary, emitted behavior to its consequences — the dog learns to sit when sitting is rewarded. Skinner emphasized this distinction; both processes occur in real situations and often interact.

Question 2

What is shaping?

Accepted Answer

Reinforcing successive approximations to a target behavior. Skinner trained a pigeon to bowl by reinforcing first any movement toward the ball, then nudging it, then with increasing force. Without shaping, the target behavior would never spontaneously appear to be reinforced. Used in therapy (gradual exposure, social skills training), education (scaffolded learning), and animal training (zoos, service dogs, marine mammals).

Question 3

What are reinforcement schedules?

Accepted Answer

Rules for when behavior is reinforced. Continuous — every response. Partial schedules. (1) Fixed-ratio (FR) — every Nth response (factory piece-rate). (2) Variable-ratio (VR) — average N responses (slot machines, fishing). (3) Fixed-interval (FI) — first response after fixed time (paycheck). (4) Variable-interval (VI) — first response after variable time (random check-ins). Each produces distinctive cumulative records: VR yields the highest, most persistent rates; FI yields scalloping near reinforcement time.

Question 4

Why is variable-ratio so addictive?

Accepted Answer

Extinction-resistant. Because reinforcement is unpredictable, the absence of reward never signals "stop trying" — maybe the next one. Slot machines deliver cash on a VR schedule; social media notifications and email refresh similarly. Behavior persists long after rewards become rare. This is the same property that makes intermittent reinforcement of children's whining so hard to extinguish — give in occasionally and the behavior is reinforced for years.

Question 5

What is the Premack principle?

Accepted Answer

Premack (1965) — a higher-frequency behavior reinforces a lower-frequency one. "Eat your vegetables, then dessert." Children clean their room (low frequency) for video-game time (high frequency). Generalized to "the relativity of reinforcement" — what counts as reward depends on baseline rates. Useful in classrooms and therapy because it identifies reinforcers from observation rather than relying on guessing what the child finds rewarding.

Question 6

Does punishment work?

Accepted Answer

It suppresses behavior in the short term but with caveats. Punishment must be immediate, certain, and proportionate to be effective. Side effects. (1) Generalized fear and avoidance of the punisher. (2) Aggression. (3) Modeling — children punished physically are more likely to be aggressive. (4) Suppression without learning alternative behavior. Most behaviorists prefer reinforcement of incompatible behavior over punishment whenever possible. Severe punishment is unethical and counterproductive.

Question 7

Where is it applied today?

Accepted Answer

Many domains. (1) Applied behavior analysis (ABA) — autism intervention, contested but widely used. (2) Token economies — schools, prisons, hospitals. (3) Animal training — service dogs, zoos, conservation. (4) Behavior modification — habit formation, addiction recovery. (5) Tech design — gamification, streaks, badges. (6) Animal welfare — operant choice tests measure preferences. (7) Pharmaceutical research — operant tasks assess drug effects on motivation.

Operant Conditioning

Interactive visualization

Watch the 60-second explainer

Why operant conditioning matters

Common misconceptions

Frequently asked questions

Interactive visualization

Watch the 60-second explainer

Why operant conditioning matters

Common misconceptions

Frequently asked questions

Related concepts