Industrial Organization

Stackelberg Leader

Move first, commit visibly, and exploit the follower's reaction curve — the first-mover advantage of sequential quantity competition

A Stackelberg leader is the first mover in a sequential quantity-competition duopoly: it commits to an output q_L knowing the follower will respond with its best-response quantity q_F. Under linear demand the leader produces twice as much as the follower and earns twice the profit — but only if its commitment is credible.

  • Introduced byHeinrich von Stackelberg, 1934
  • Game formSequential, perfect information
  • Leader outputq_L = (a − c) / 2b
  • Follower outputq_F = (a − c) / 4b
  • Profit ratioπ_L = 2 π_F
  • Solution conceptSubgame-perfect Nash (Selten 1965)

Interactive visualization

Press play, or step through manually. The visualization is yours to drive — try it before reading on.

Open visualization fullscreen ↗

Watch the 60-second explainer

A condensed visual walkthrough — narrated, captioned, under a minute.

The game: two firms, one demand curve, two stages

Consider an industry with exactly two firms producing a homogeneous good. Demand is linear: the market price falls as total quantity rises,

P(Q) = a − b Q,   where Q = q_L + q_F.

Both firms have the same constant marginal cost c, with a > c > 0 and b > 0. What distinguishes Stackelberg from the simultaneous Cournot game is the timing. Stage 1: the leader chooses q_L. Stage 2: the follower observes q_L and then chooses q_F. The follower's decision is made with full knowledge of the leader's quantity; the leader's decision is made with full knowledge that the follower will optimise afterward. Both stages happen exactly once and both firms are profit-maximisers.

The model was introduced by the German economist Heinrich von Stackelberg in his 1934 monograph Marktform und Gleichgewicht ("Market Structure and Equilibrium"). Stackelberg's question was simple: what changes when one firm moves first? The answer turned out to be unintuitive — moving first can be a strict advantage, not a handicap — and the analysis became one of the foundational results of modern industrial organization.

Backward induction: solve the follower first

The standard procedure for sequential games is backward induction. Start with the last decision and work back. Given any q_L the leader has already committed to, the follower's profit is

π_F(q_F | q_L) = (P − c) q_F
              = (a − b(q_L + q_F) − c) q_F.

Differentiating with respect to q_F and setting the derivative to zero,

dπ_F / dq_F = a − c − b q_L − 2 b q_F = 0
           →  q_F*(q_L) = (a − c − b q_L) / (2b).

That is the follower's reaction function or best-response curve. It is decreasing in q_L: every extra unit the leader produces pushes the follower's optimum quantity down by half a unit. Intuitively, the leader steals some of the market, the residual demand shrinks, and the follower's profit-maximising response is to produce less.

The leader, anticipating this, treats q_F as the known function q_F*(q_L) rather than as a free choice. Its profit becomes

π_L(q_L) = (a − b(q_L + q_F*(q_L)) − c) q_L
        = (a − c − b q_L − b · (a − c − b q_L)/(2b)) q_L
        = ((a − c) / 2 − (b/2) q_L) q_L.

This is a quadratic in q_L, maximised at

q_L* = (a − c) / (2b).

Substituting back gives the follower's equilibrium output and the implied total,

q_F* = (a − c − b · (a − c)/(2b)) / (2b) = (a − c) / (4b)
Q*  = q_L* + q_F* = 3(a − c) / (4b)
P*  = a − b Q* = (a + 3c) / 4.

The leader produces exactly twice the follower's quantity. Equilibrium profits are

π_L* = (a − c)² / (8b)
π_F* = (a − c)² / (16b).

The leader earns precisely double the follower's profit. That ratio is a constant — it does not depend on a, b, or c, only on the structure of the game.

Stackelberg vs Cournot vs Bertrand: a complete ladder

The four canonical models of duopolistic interaction sort by how aggressively output is competed away. Working from least to most competitive:

ModelTimingTotal QPrice Pπ per firmConsumer surplus
Monopoly(a − c) / 2b(a + c) / 2(a − c)² / 4bbaseline
Cournot duopolySimultaneous quantities2(a − c) / 3b(a + 2c) / 3(a − c)² / 9b (each)higher
Stackelberg duopolySequential quantities3(a − c) / 4b(a + 3c) / 4L: (a − c)² / 8b · F: (a − c)² / 16bhigher still
Bertrand duopolySimultaneous prices(a − c) / bc0maximum

The pattern is striking. With the same demand curve and the same costs, just rearranging the rules of the game moves the outcome smoothly from joint-monopoly profit to zero profit. Stackelberg sits between Cournot and Bertrand on every welfare dimension. Total output is larger than under Cournot — so consumers are better off — but the leader's profit is larger than its Cournot profit, while the follower's profit is smaller. Total industry profit is lower under Stackelberg than Cournot, because the leader's output increase more than offsets the follower's contraction.

Why commitment is the real story

Stackelberg is often described as a "first-mover advantage" model, and the description is correct as far as it goes — but the deep reason the leader wins is commitment, not chronological priority. If the leader could re-optimise after seeing q_F, it would prefer to do so. Given the follower's actual quantity (a − c)/(4b), the leader's unconditional best response would be to produce its own Cournot best response to that quantity, which is smaller than (a − c)/(2b). The Stackelberg equilibrium is sustained precisely because the leader cannot reverse course: q_L is locked in before the follower moves.

If commitment fails, the equilibrium unravels. The follower, knowing the leader can re-optimise, treats the situation as effectively simultaneous and plays its Cournot quantity. The leader, also knowing this, plays its Cournot quantity in response. The system reverts to the Cournot equilibrium with each firm producing (a − c)/(3b) and earning (a − c)²/(9b) — a worse outcome for the leader than the Stackelberg payoff.

This is the central insight behind Avinash Dixit's 1980 entry-deterrence model. An incumbent firm builds productive capacity before a potential entrant decides whether to enter. The capacity investment is a sunk cost: it makes the threat to flood the market credible, because once the factory is built the marginal cost of producing the threatened output is low. The entrant, anticipating that flood, stays out. Capacity is the commitment device that turns a Cournot game into a Stackelberg game in the incumbent's favour.

A worked numerical example

Let a = 100, b = 1, c = 20. Demand is P = 100 − Q with constant marginal cost 20.

Quantity / outcomeCournotStackelberg
Firm 1 (leader) output26.6740
Firm 2 (follower) output26.6720
Total output Q53.3360
Market price P46.6740
Firm 1 profit711.11800
Firm 2 profit711.11400
Total industry profit1422.221200
Consumer surplus (½ b Q²)1422.221800

Going from Cournot to Stackelberg, the leader gains 89 in profit while the follower loses 311, for a net industry loss of 222. Consumers gain 378, more than offsetting the producer loss, so total welfare rises. Stackelberg is a step toward the competitive outcome, but it is asymmetric: the leader is the one party that prefers it over Cournot.

Beyond linear demand

The clean closed-form results — q_L = 2 q_F, π_L = 2 π_F — are specific to linear demand and constant marginal cost. The qualitative results are not. For any downward-sloping demand and convex cost where the follower's best-response function is decreasing in q_L, the Stackelberg leader will produce more than its Cournot quantity, the follower will produce less than its Cournot quantity, and the leader's profit will strictly exceed its Cournot profit. The exact ratios depend on the demand and cost curves, but the ordering π_L,Stackelberg > π_L,Cournot is robust.

Two technical conditions are required for the leader's commitment to pay off. First, the follower's reaction function must be downward-sloping (quantities are strategic substitutes). If reaction functions sloped upward (strategic complements, as in Bertrand price competition with differentiated products), being a Stackelberg leader can actually hurt the leader compared with simultaneous play. Second, the follower must observe q_L. If q_L is private or noisy, the model collapses to a signalling game and the simple backward-induction logic no longer applies.

Variants and extensions

  • Multiple followers. If there are n − 1 followers responding to a single leader, total follower output is (n − 1)(a − c)/(2bn) and the leader's output remains (a − c)/(2b). The leader's share of total output falls but its absolute output is unchanged.
  • Price leadership. Stackelberg in prices rather than quantities. Used to model dominant-firm pricing and benchmark-setting industries. The dominant firm sets a price; smaller fringe firms take it as given and supply whatever they want at that price. The leader serves residual demand.
  • Stackelberg with capacity constraints (Dixit 1980). The leader chooses capacity in stage 1; the follower observes capacity then decides whether to enter and at what scale. Used as the workhorse model of entry deterrence and limit pricing.
  • Endogenous timing (Hamilton-Slutsky 1990). Allow firms to choose whether to move early or late. Both firms generally prefer to be leader, giving rise to a war of attrition over timing. With asymmetric costs, the low-cost firm typically emerges as the natural leader.
  • Stackelberg with R&D or product quality. Stages 0–1 are R&D investment, stages 2–3 are quantity competition. Leadership in R&D translates to leadership in production. The model is the basis for much of the literature on technology races and innovation.
  • Bayesian Stackelberg. The follower's type (its cost, demand, or capacity) is private information. The leader maximises expected profit over the type distribution. The basis for the Stackelberg-security-game literature in computer science (e.g. airport screening, wildlife protection).

Where Stackelberg shows up in real markets

  • OPEC and the Saudi swing producer. The canonical real-world example. Saudi Arabia, controlling roughly 30% of OPEC capacity and historically the lowest-cost producer, has long played the role of "swing producer": it adjusts its output first to target a price, and the rest of OPEC plus the non-OPEC fringe adjust afterward. The post-2014 shale revolution and the 2020 price war complicated the picture but the underlying Stackelberg dynamic — one large player whose commitments are unusually credible because of sunk infrastructure costs — remains the textbook framing of the global oil market.
  • Incumbent vs entrant. Established firms in capital-intensive industries (semiconductors, airliners, telecommunications) build capacity ahead of demand to deter entry. The sunk capacity acts as the leader's commitment device. The literature on limit pricing, entry deterrence, and predatory pricing is essentially applied Stackelberg.
  • Dominant-firm pricing. A large firm sets a benchmark price; smaller fringe firms match it and supply residual demand. Found historically in steel (U.S. Steel as price leader in the early 20th century), tobacco, and ready-to-eat cereals. The leader's commitment device is its visibility and the cost to the fringe of starting a price war.
  • Platform launches. The first credible commitment to a large user base — through network-effects investments, marketing, exclusive content — discourages later entrants from contesting the market. The mechanism is Stackelberg in quality or coverage rather than physical quantity, but the logic is identical.
  • Stackelberg security games. A defender (police, customs, conservation agency) commits to a randomised patrol schedule that an attacker observes before choosing where to strike. The commitment is the publicly announced patrol policy. Real deployments include the Federal Air Marshal Service, LAX airport screening, and anti-poaching patrols in African wildlife reserves (PROTECT/ARMOR/IRIS systems).

Common pitfalls

  • Confusing first-mover advantage with timing. Moving first by itself is not the source of the advantage; commitment is. A leader that can be observed reconsidering — for instance through quietly adjusted shipments or contractual escape clauses — receives no Stackelberg premium. The follower will rationally treat the situation as simultaneous Cournot.
  • Applying it to price competition with substitutes. Stackelberg in quantities assumes strategic substitutes (reactions slope down). With differentiated products competing on price, reactions typically slope up (raising your price gives your rival room to raise theirs), and leadership becomes a handicap rather than an advantage.
  • Forgetting that total profit falls. Industry profit is lower under Stackelberg than under Cournot. From an industry-wide perspective Stackelberg is a worse cartel — the leader's gain is smaller than the follower's loss. Firms with the ability to collude prefer Cournot or, better still, joint-monopoly outcomes.
  • Ignoring the follower's exit option. If the follower's equilibrium profit falls below its outside option (or below zero with fixed entry costs), it exits and the leader becomes a monopolist. Models that treat the follower as captive miss this important boundary case — and it is exactly what makes Stackelberg useful for studying entry deterrence.
  • Misreading "leader produces more, earns more" as a recipe. The leader produces more because committing to that level is the optimal exploitation of the follower's reaction curve. Doubling output for its own sake — without the structural ability to commit — just drives down price and profit for both firms.

Frequently asked questions

What makes Stackelberg different from Cournot?

Both models have two firms choosing quantities under the same demand curve, but Cournot is simultaneous — neither firm sees the other's choice before committing — while Stackelberg is sequential: the leader moves first and the follower observes q_L before responding. Sequencing matters enormously. In Cournot with linear demand and equal costs, each firm produces (a − c)/(3b) and earns identical profit. In Stackelberg the leader produces (a − c)/(2b), the follower only (a − c)/(4b), and the leader earns double the follower's profit.

Why does the leader benefit from moving first?

Because commitment changes the follower's best response. When the leader commits visibly to a large q_L, the follower's best response is to produce less — undercutting the leader would just push price down further and hurt the follower more than the leader. The leader exploits this by picking the point on the follower's reaction curve that maximises its own profit, rather than the intersection of the two reaction curves (which is Cournot). The advantage is sometimes called a "first-mover advantage", but the deep reason is commitment, not timing per se.

What if the leader cannot commit?

Then there is no Stackelberg equilibrium — the model collapses to Cournot. The leader's threat to produce (a − c)/(2b) is only credible if the follower believes the leader cannot or will not back down once the follower picks its quantity. In practice, commitment is established by sinking costs (building capacity, signing long-term contracts), public announcements that damage reputation if reversed, or contractual penalties. Without one, the follower will rationally expect the leader to re-optimise, and the only consistent equilibrium is the simultaneous Cournot outcome.

How is Stackelberg solved by backward induction?

You solve the second-stage problem first. Given any q_L the leader might choose, the follower picks q_F to maximise π_F = (a − b(q_L + q_F) − c) q_F, giving the reaction function q_F*(q_L) = (a − c − b q_L) / (2b). The leader, knowing this, substitutes q_F*(q_L) into its own profit and maximises over q_L. The answer is q_L* = (a − c)/(2b), and q_F* = (a − c)/(4b). This is backward induction, the algorithmic foundation of subgame perfection.

What is a subgame-perfect equilibrium and why does Stackelberg need it?

A Nash equilibrium asks that no player wants to deviate given the others' strategies; a subgame-perfect equilibrium (SPE) further requires the strategies to be optimal in every subgame, including those off the equilibrium path. Reinhard Selten introduced the concept in 1965 to eliminate Nash equilibria sustained by non-credible threats. In Stackelberg there are many Nash equilibria but most are not subgame perfect. Backward induction selects the unique SPE: q_L = (a − c)/(2b), q_F = (a − c)/(4b).

How do total quantity and price compare to Cournot and Bertrand?

With linear demand P = a − bQ and constant marginal cost c: Cournot Q_C = 2(a − c)/(3b), price P_C = (a + 2c)/3. Stackelberg Q_S = 3(a − c)/(4b), price P_S = (a + 3c)/4. Bertrand Q_B = (a − c)/b, price P_B = c. Ranking: P_C > P_S > P_B and Q_C < Q_S < Q_B. Stackelberg sits between Cournot and Bertrand on every welfare dimension — more output and lower price than Cournot, but still above marginal cost.

Are there real-world Stackelberg markets?

The classical example is OPEC with Saudi Arabia as the "swing producer": Saudi Arabia adjusts its output first to set a target price, and the rest of OPEC plus the non-OPEC fringe adjust afterward. Other plausible cases include incumbent–entrant interactions where the incumbent builds capacity before the entrant decides whether to enter (Dixit 1980), dominant-firm pricing where a price leader sets a benchmark that smaller competitors then match, and product-launch races where the first credible commitment to volume signals later movers to stay small.