Evolution

Evolutionarily Stable Strategy

A behavior that, once common, cannot be invaded by any rare mutant — game theory rewritten for natural selection

An evolutionarily stable strategy (ESS) is a behavior that, once common in a population, cannot be invaded by any rare alternative — game theory applied to evolution. John Maynard Smith and George Price defined it in a 1973 Nature paper: a strategy S is an ESS if it does better against itself than any mutant does, or ties and then beats the mutant when the mutant is rare. Unlike a Nash equilibrium, an ESS needs no rational players — payoffs are measured in offspring, so selection alone enforces it. In the Hawk-Dove game, all-out aggression is stable only when the prize value V exceeds the injury cost C; otherwise the ESS is to escalate with probability exactly V/C and bluff the rest of the time. The framework explains why animals settle most contests with displays rather than duels, why sex ratios hover near 1:1 (but skew female in fig wasps), and why side-blotched lizards cycle endlessly through three mating morphs in a rock-paper-scissors loop that has no winner.

Defined byMaynard Smith & Price, Nature 1973
Core testE(S,S) > E(T,S), or tie then E(S,T) > E(T,T)
Relation to NashRefinement: every ESS is Nash, not vice versa
Hawk-Dove ESSPlay Hawk with probability V/C
Payoff unitsFitness (expected offspring)
When none existsRock-paper-scissors → endless cycle

Interactive visualization

Press play, or step through manually. The visualization is yours to drive — try it before reading on.

Open visualization fullscreen ↗

Watch the 60-second explainer

A condensed visual walkthrough — narrated, captioned, under a minute.

The core idea: uninvadable behavior

Imagine an entire population playing one strategy — say, every animal escalates every fight. Now drop in a single mutant doing something different. If that mutant leaves more offspring than the residents, it spreads, and the original strategy was not stable. If no possible mutant can out-reproduce the residents, the strategy is an evolutionarily stable strategy. That is the whole concept: a strategy that, once common, defends itself against every alternative by the simple fact that the alternative earns fewer descendants.

The decisive move John Maynard Smith and George Price made in 1973 was to measure payoffs in fitness — expected number of surviving offspring — rather than in money or utility. That swap matters enormously. Classical game theory (von Neumann and Morgenstern, 1944) assumed rational players who compute their best response. But a digger wasp, a stickleback, or a dung fly computes nothing. By denominating payoffs in offspring, Maynard Smith let natural selection do the optimizing automatically: whichever strategy reproduces fastest becomes common, and the ESS is wherever that race comes to rest. No brains required.

The formal definition and its two conditions

Write E(X,Y) for the expected fitness payoff to an individual playing strategy X against an opponent playing strategy Y. A resident strategy S is an ESS against a mutant T if one of two conditions holds:

Strict condition: E(S,S) > E(T,S). The resident does better against the population (which is almost entirely S) than any mutant does. This alone makes T unable to gain a foothold.
Tie-breaker: E(S,S) = E(T,S) and E(S,T) > E(T,T). If the mutant matches the resident against the common type, the contest is decided by the rare encounters in which the mutant meets a copy of itself — and there, the resident must win.

The second condition is exactly what separates ESS from Nash equilibrium. A Nash equilibrium only demands E(S,S) ≥ E(T,S), which permits neutral mutants to drift in for free. The ESS tie-breaker closes that loophole by checking the second-order encounters. Geometrically, an ESS is an attractor of the replicator dynamics — the differential equation that says a strategy's frequency grows in proportion to how far its payoff sits above the population average. Push the population a little away from an ESS and selection pushes it back; that local stability is the property a bare Nash point can lack.

The Hawk-Dove game, step by step

The canonical worked example is a contest over a resource worth V fitness units, where escalation risks an injury costing C. Each contestant uses one of two strategies. A Hawk escalates and fights until injured or victorious. A Dove displays, and retreats the instant the opponent escalates. Resolve every pairing:

Hawk vs Hawk: they fight; on average one wins V and one is injured for −C, so each expects (V − C)/2.
Hawk vs Dove: the Dove flees, the Hawk takes the whole resource: Hawk gets V, Dove gets 0.
Dove vs Dove: they display, then share: each gets V/2.

Now test pure Hawk. In an all-Hawk population, every individual plays Hawk-vs-Hawk for (V − C)/2. A rare Dove playing against Hawks earns 0. Pure Hawk is an ESS only if (V − C)/2 > 0, i.e. V > C — the prize is worth more than the worst injury. When injury is cheap relative to the resource, ruthless aggression wins and stays.

But for almost any real contest C > V: serious injury (a torn flank, a broken jaw) is reproductively catastrophic, far costlier than one meal or one mating. Then (V − C)/2 is negative, a pure-Hawk population is invadable by Doves, and there is no pure ESS. The stable solution is a mixed strategy: play Hawk with probability p* = V/C. At that frequency the average payoff to Hawk and to Dove are exactly equal, so neither type can spread — the population is pinned. This is why field studies of red deer, fiddler crabs, and spiders find ritualized displays, roaring contests, and size assessments rather than fights to the death: the math of V and C forbids relentless aggression.

Hawk-Dove payoff matrix and the V/C threshold

Scenario (payoff to row player)	vs Hawk	vs Dove	ESS outcome
Hawk	(V − C)/2	V	If V > C: pure Hawk. If C > V: mixed, Hawk with probability V/C
Dove	0	V/2
Worked numbers below assume the resource is worth V = 50 fitness units
V = 50, C = 100 (injury twice the prize)	Mixed ESS: Hawk played with p* = 50/100 = 0.50		Half escalate, half bluff
V = 50, C = 250 (injury very costly)	Mixed ESS: p* = 50/250 = 0.20		Only 20% escalate
V = 50, C = 25 (cheap injury)	V > C → pure Hawk is an ESS		Everyone escalates

The players and the assumptions ESS requires

ESS is a clean idealization, and like any model it rests on conditions worth naming because real biology bends them:

Large, well-mixed population. Encounters are random pairwise contests; the classic theory ignores spatial clustering. Adding spatial structure (lattice models, viscous populations) can stabilize cooperation that the well-mixed version forbids.
Asexual, haploid replicators (to first order). Strategies breed true. Diploid genetics, dominance, and recombination can prevent a population from reaching the ESS phenotype even when it exists — the genetic ESS and phenotypic ESS need not coincide.
Symmetric contests. Both players are interchangeable. When roles differ — resident versus intruder, large versus small — the relevant solution becomes an asymmetric ESS, and the famous result is that an arbitrary, payoff-irrelevant cue (such as "owner wins") can become a stable convention. This is the Bourgeois strategy: respect ownership, escalate as owner, retreat as intruder. Speckled wood butterflies (Pararge aegeria) defending sun-spots show almost exactly this rule.
One-shot or memoryless encounters. Repeated games with memory (the Iterated Prisoner's Dilemma) open the door to reciprocal strategies like Tit-for-Tat, which Robert Axelrod's 1980 tournaments showed can be evolutionarily stable against many invaders — though no single strategy is unbeatable in the iterated game.

Where ESS shows up in real organisms

Side-blotched lizards (Uta stansburiana). Barry Sinervo and Curtis Lively (1996, Nature) documented three male throat-color morphs locked in a rock-paper-scissors cycle: orange (ultradominant, large territories) beats blue (mate-guarders), blue beats yellow (sneaker mimics of females), and yellow beats orange. Each morph's frequency oscillates with a roughly 6-year period. There is no single-morph ESS — the only stable outcome is the cyclic polymorphism itself, the textbook case of a game with no static equilibrium.
Fig-wasp sex ratios. W. D. Hamilton's 1967 local-mate-competition theory predicts that when brothers compete to inseminate their sisters inside one fig, the ESS sex ratio is strongly female-biased — observed broods are often only 8–20% male, sharply below the 50% Fisher predicted for outbred species. The fit between predicted and measured male fraction across dozens of wasp species is among the most quantitatively successful tests in evolutionary biology.
Dung-fly mating times. Geoff Parker showed male yellow dung flies (Scathophaga stercoraria) copulate for an ESS duration near 36 minutes — the marginal-value point where extra fertilization gained by guarding longer just balances the offspring lost by not searching for a fresh female. The observed mean matches the predicted optimum closely.
Bacterial cheaters and producers. In microbial populations, "producer" cells that secrete a costly public good (a siderophore that scavenges iron, or an invertase that digests sucrose) can be invaded by "cheater" mutants that consume the good without paying. The stable mixed frequency of producers and cheaters is a mixed-ESS analog and a live model system for the evolution of cooperation and its breakdown.
Plant seed dormancy and germination timing. The fraction of seeds that germinate immediately versus stay dormant is a bet-hedging ESS tuned to the variance in good-versus-bad years — germinate too eagerly and a drought wipes out the cohort; stay dormant too long and competitors seize the soil.

ESS vs Nash equilibrium vs optimal strategy

Property	Evolutionarily stable strategy	Nash equilibrium	Group-optimal strategy
Who decides	Natural selection (no foresight)	Rational agents	A hypothetical planner
Payoff currency	Fitness (offspring)	Utility	Total population fitness
Stability requirement	Uninvadable by rare mutants (dynamic)	No profitable unilateral deviation (static)	Maximizes the average — often unstable
Defining test	E(S,S) > E(T,S), or tie + E(S,T) > E(T,T)	E(S,S) ≥ E(T,S) only	Maximize mean payoff over all players
Logical relationship	Strict subset of Nash equilibria	Superset; allows neutral drift in	Generally not a Nash point at all
Hawk-Dove result	Hawk with probability V/C	Same mixed point (and pure points)	All-Dove (sharing maximizes the average)
Why it's not group-optimal	All-Dove is invaded by a single Hawk	—	Cannot resist invasion — the tragedy of the commons

The last row is the philosophical sting of the theory. The group-best outcome in Hawk-Dove is everyone being a Dove and sharing peacefully (average payoff V/2). But that population is wide open to a single Hawk mutant who steals V from every neighbor. Selection drives the population away from the group optimum toward the lower-average ESS. ESS is therefore a precise statement of why evolution does not produce harmonious, welfare-maximizing populations — it produces uninvadable ones.

Common misconceptions and pitfalls

"An ESS is the best strategy for the species." No — it is the strategy no mutant can beat, which is frequently worse for the group than an unstable cooperative alternative. ESS describes what selection settles on, not what would maximize collective welfare.
"Animals must understand the game to play it." The opposite is the point. Payoffs are in offspring, so the strategy is encoded in genes and development, not reasoning. A wasp obeys an ESS the way a planet obeys gravity — without computing anything.
"A mixed ESS means each animal must randomize." Not necessarily. A population fraction V/C of pure Hawks and the rest pure Doves gives identical average payoffs to individual randomization. Polymorphism and individual mixing are payoff-equivalent; genetics decides which appears.
"Every game has an ESS." False. Rock-paper-scissors games (like the side-blotched lizard morphs) have none — the population cycles forever. When ESS predicts no static point, real systems often genuinely oscillate.
"ESS and Nash equilibrium are the same." Every ESS is a Nash equilibrium, but many Nash equilibria are not ESSs because they fail the stability tie-breaker E(S,T) > E(T,T) and can be eroded by neutral drift.
"The Bourgeois 'owner wins' rule reflects real fighting ability." It need not. In the symmetric Hawk-Dove-Bourgeois game, the convention can be completely arbitrary — "respect ownership" is stable purely because everyone follows it, even if owner and intruder are physically identical. The cue is a coordination device, not a measure of strength.

Frequently asked questions

What exactly makes a strategy an ESS?

A strategy S is an evolutionarily stable strategy if a population of S-players cannot be invaded by any rare mutant strategy T. Maynard Smith and Price gave two precise conditions, and S qualifies if it satisfies the first or the second. Condition one: E(S,S) > E(T,S), meaning S does strictly better against the resident population than any mutant does — this alone guarantees uninvadability. Condition two (the tie-breaker, needed when E(S,S) = E(T,S)): E(S,T) > E(T,T), meaning when the mutant is rare and occasionally meets a copy of itself, S still outperforms it. Here E(X,Y) is the expected payoff (in offspring or fitness units) to a player using strategy X against an opponent using strategy Y. Because payoffs are measured in reproductive success, no rationality or foresight is required — natural selection does the optimizing.

How is an ESS different from a Nash equilibrium?

Every ESS is a Nash equilibrium, but not every Nash equilibrium is an ESS. A Nash equilibrium only requires that no player can do strictly better by unilaterally deviating: E(S,S) is at least as large as E(T,S) for all T. That allows neutral mutants (E(S,S) = E(T,S)) to drift in freely and potentially take over. An ESS adds the stability tie-breaker — the second condition E(S,T) > E(T,T) — which a plain Nash equilibrium need not satisfy. The deeper difference is conceptual: Nash equilibrium assumes rational agents reasoning about each other's choices, whereas ESS replaces rationality with population dynamics. Maynard Smith built the idea explicitly so it would apply to insects and lizards that cannot calculate anything. The ESS is the resting point of replicator dynamics rather than the fixed point of a reasoning process.

Why don't animals usually fight to the death over resources?

The Hawk-Dove game answers this. Let V be the value of the contested resource and C the cost of injury from an escalated fight, both in fitness units. A Hawk always escalates; a Dove displays and retreats if attacked. Two Hawks split the expected outcome but pay the injury cost, giving each an average payoff of (V − C)/2; a Hawk meeting a Dove takes the whole V; two Doves share, getting V/2 each. If C > V — injury costs more than the prize is worth — then a pure-Hawk population is invadable, because a rare Dove avoids the ruinous (V − C)/2 payoff. The ESS is instead a mixed strategy: escalate with probability V/C and display otherwise. At that frequency, Hawk and Dove earn identical average payoffs, so neither can spread. Real contests over mates, territory, or food almost always have C > V because serious injury is reproductively catastrophic, which is why ritualized displays, not duels to the death, dominate.

Can an ESS be a mix of behaviors instead of a single one?

Yes, and this is one of the theory's most important results. A mixed ESS can be realized two equivalent ways at the population level. First, every individual can randomize — playing Hawk with probability V/C on each encounter. Second, the population can be polymorphic, with a fraction V/C of individuals being pure Hawks and the rest pure Doves; the average payoffs come out identical. Selection cannot tell these apart from payoffs alone, so which one evolves depends on genetics and development. The side-blotched lizard Uta stansburiana shows a genuine polymorphic case: three throat-color morphs (orange, blue, yellow) coexist in a rock-paper-scissors cycle with roughly six-year periods, and no single morph is an ESS — the only stable outcome is the mixture itself, sometimes called a stable polymorphism rather than a strict ESS.

How does ESS explain 1:1 sex ratios?

R. A. Fisher's 1930 argument is the original ESS-style reasoning, predating the formal theory. Because every offspring has one mother and one father, the total reproductive value of all males equals that of all females. If a population skews toward females, then sons become scarce and each son fathers more grand-offspring on average — so a mutant that produces extra sons spreads, and vice versa. The only frequency that no mutant can invade is the one where parents spend equal resource on sons and daughters, giving a 1:1 ratio when the two sexes cost the same to produce. The exception proves the rule: in fig wasps, where mated daughters disperse but brothers compete to fertilize their own sisters inside a single fig (local mate competition, Hamilton 1967), the ESS sex ratio is heavily female-biased — often 10 to 20 percent males — exactly as the payoff math predicts.

Does an ESS always exist?

No. Some games have no ESS at all in pure or mixed strategies. The rock-paper-scissors structure is the classic counterexample: rock beats scissors, scissors beats paper, paper beats rock, so whichever strategy becomes common is beaten by the strategy it loses to, and the population cycles forever instead of settling. The side-blotched lizard morphs follow exactly this cyclic dynamic in the field. ESS also assumes a large, well-mixed population of haploid asexual replicators playing pairwise symmetric contests; relaxing any of these — finite populations, spatial structure, repeated games with memory, asymmetric roles (owner versus intruder), or sexual diploid genetics — can create, destroy, or shift the equilibria. So ESS is a powerful first approximation, not a universal law: when it predicts no stable point, the biology often does cycle.