Decision Theory
Ellsberg Paradox
Why people prefer known 50/50 odds to unknown ones — and what that breaks in Savage's subjective expected utility
Daniel Ellsberg's 1961 paradox: people prefer betting on a known 50/50 urn rather than one with unknown red-black ratio — ambiguity aversion violates Savage's sure-thing principle.
- Posed byDaniel Ellsberg, 1961
- PublishedQuarterly Journal of Economics
- 2-color urn50/50 known vs 100-ball unknown
- Axiom violatedSavage's sure-thing principle
- PhenomenonAmbiguity aversion
- RootsKnightian uncertainty (Knight 1921)
Interactive visualization
Press play, or step through manually. The visualization is yours to drive — try it before reading on.
Watch the 60-second explainer
A condensed visual walkthrough — narrated, captioned, under a minute.
The two-urn version, exactly
Stand two urns in front of you. Urn K (known) contains exactly 50 red balls and 50 black balls. Urn U (unknown) contains 100 balls, with the red-black ratio chosen by an adversary — could be 0/100, could be 50/50, could be 99/1. You don't know.
The experimenter offers four bets, $100 if you win, $0 if you lose:
- RK: Draw from Urn K — win if the ball is red.
- RU: Draw from Urn U — win if the ball is red.
- BK: Draw from Urn K — win if the ball is black.
- BU: Draw from Urn U — win if the ball is black.
You're asked: (a) prefer RK or RU? Then (b) prefer BK or BU? Across thousands of experimental subjects, the dominant pattern is RK ≻ RU and BK ≻ BU — pick the known urn both times. The known 50% chance feels safer than the unknown ratio.
Why no probability assignment makes this consistent
Suppose you do assign a subjective probability p to "red is drawn from Urn U" (and so 1−p to black). For RK ≻ RU, you need 0.5 > p — so p < 0.5. For BK ≻ BU, you need 0.5 > 1−p — so p > 0.5. Both inequalities require p < 0.5 and p > 0.5 — a contradiction.
Equivalently, RK ≻ RU says you think red is less likely in Urn U than in Urn K. BK ≻ BU says you think black is also less likely in Urn U than in Urn K. But P(red) + P(black) = 1 in both urns. If you think both colors are less likely in U than in K, your probabilities sum to less than 1 in Urn U — incoherent. No probability assignment rationalizes the pair of choices.
Savage's 1954 axioms (the foundation of subjective expected utility) include the sure-thing principle: if you prefer outcome a to outcome b in all states of the world, you must prefer a-overall to b-overall. The Ellsberg pattern violates this. The agent is acting on something other than a single probability distribution.
Two-color and three-color variants compared
| Two-color (Urns K, U) | Three-color (90-ball urn) | |
|---|---|---|
| Setup | Urn K = 50R+50B; Urn U = 100 balls in unknown R/B ratio | 30R + 60 unknown mix of B and Y |
| Choice 1 | Bet on red — pick urn | A: Bet on red. B: Bet on black |
| Choice 2 | Bet on black — pick urn | A': Bet on red ∪ yellow. B': Bet on black ∪ yellow |
| Modal pattern | K, K — pick known urn both times | A and B' — pick the known probability each time |
| Probabilities | P(R,U) + P(B,U) sums < 1 | P(R) < 1/3 & P(R∪Y) > 1/3 — yellow's prob inconsistent |
| Axiom violated | P(·) cannot be assigned consistently | Sure-thing principle (yellow cancels) |
| Interpretation | Ambiguity aversion vs known risk | Same — but with one ambiguous and one risky bet at a time |
| Replication rate | ~60–75% across hundreds of studies | ~70% in original Ellsberg (1961) and replications |
From Harvard to the Pentagon Papers
Daniel Ellsberg was a 30-year-old RAND Corporation strategic analyst when he wrote the paradox into his 1961 Harvard PhD thesis. His advisor Howard Raiffa, one of the architects of modern decision analysis, was initially skeptical — the paradox seemed too simple. But Raiffa replicated it on his own faculty colleagues at Harvard Business School and confirmed the pattern.
The paper appeared in the Quarterly Journal of Economics in November 1961, titled "Risk, Ambiguity, and the Savage Axioms." Reception was muted for two reasons. First, Savage himself (whom Ellsberg cited extensively) maintained that the subjects were just confused — a position he held until his 1971 death. Second, Ellsberg's career trajectory shifted: by 1969 he was a RAND consultant on the Vietnam War; by 1971 he had leaked the Pentagon Papers; by 1973 he was facing 115 years in prison under the Espionage Act (charges later dismissed for prosecutorial misconduct).
The paradox sat dormant until Itzhak Gilboa and David Schmeidler's 1989 maxmin expected utility paper formalized ambiguity aversion. Gilboa-Schmeidler showed how to model the Ellsberg pattern with a set of priors rather than a single distribution — the foundation of the modern "ambiguity averse" decision theory subfield. Ellsberg lived to see his paradox become the lead example in nearly every graduate decision theory textbook. He died in June 2023 at 92.
Variants and extensions
- Maxmin expected utility (Gilboa-Schmeidler 1989). Agent considers a set 𝒫 of priors and maximizes minp∈𝒫 Ep[u(x)]. The dominant ambiguity-averse model in finance.
- Choquet expected utility (Schmeidler 1989). Replaces probabilities with non-additive capacities — assigns lower weight to ambiguous events than additive probability would.
- Smooth ambiguity (Klibanoff-Marinacci-Mukerji 2005). Two-stage model: first form a prior over priors (a second-order distribution), then aggregate with a concave function reflecting ambiguity attitude. Separates risk aversion (over outcomes) from ambiguity aversion (over priors).
- α-maxmin (Hurwicz 1951, Ghirardato-Maccheroni-Marinacci 2004). Weighted combination of worst-case and best-case expected utility. Pure maxmin is the α = 1 limit.
- Multiple priors model in macro (Hansen-Sargent 2001). Brought Knightian uncertainty into business-cycle and asset-pricing macroeconomics; agents make robust decisions under model uncertainty.
- Variational preferences (Maccheroni-Marinacci-Rustichini 2006). Most general axiomatic framework — encompasses maxmin, smooth ambiguity, and multiplier preferences.
Real-world applications
- Equity premium puzzle. Knightian uncertainty about long-run consumption growth explains a chunk of the historical excess return on stocks vs bonds (Maenhout 2004, Ju-Miao 2012) without absurd risk aversion.
- Home bias in investing. Investors over-allocate to domestic assets despite gains from international diversification — partially explained by ambiguity aversion over unfamiliar foreign markets (French-Poterba 1991, Uppal-Wang 2003).
- Insurance reluctance for low-probability events. People underinsure for ambiguous risks (cyber, terror, pandemic) — willingness-to-pay drops when probabilities are vague (Hogarth-Kunreuther 1989).
- Climate policy. Weitzman's dismal theorem (2009) argues that fat-tailed Knightian uncertainty about catastrophic warming dominates standard cost-benefit analysis; deep uncertainty is the policy problem.
- Monetary policy under uncertainty. Hansen-Sargent's robust-control framework — used at the Fed and ECB — explicitly models ambiguity over the macroeconomy.
- Pharmaceutical pricing. Ambiguity aversion explains why patients prefer therapies with known modest efficacy over experimental treatments with unknown — possibly higher — payoffs.
- Pandemic decision-making. COVID-19 policies in early 2020 reflected Knightian uncertainty about transmission and mortality; ambiguity-averse modeling predicted earlier lockdowns than risk-neutral cost-benefit (Manski 2020).
Common pitfalls and critiques
- Treating ambiguity as just risk with broader priors. Bayesian dogma: a "complete" prior subsumes uncertainty. Ellsberg's response: the way you behave when you have no prior to commit to is empirically different — and the difference is the phenomenon.
- Suspicion-of-the-experimenter explanation. Maybe subjects pick the known urn because they suspect the experimenter rigged the unknown one. Replications that let subjects construct the urn themselves still find the pattern, weakening this explanation.
- Confounding with risk aversion. Standard risk aversion (concave u) is consistent with EU; ambiguity aversion is a separate, additive phenomenon. Smooth-ambiguity models cleanly separate them.
- Generalizing to all uncertainty. Some empirical work (Charness-Karni-Levin 2013, Trautmann-van de Kuilen 2015) finds substantial heterogeneity: a sizable minority shows ambiguity-seeking behavior. The modal pattern is aversion, but not universal.
- Assuming the paradox dissolves under feedback. Ambiguity aversion attenuates with experience but does not vanish even among trained subjects (Halevy 2007). The phenomenon is robust.
Frequently asked questions
What's the two-urn version of Ellsberg's paradox?
Urn K (known): 50 red balls and 50 black balls. Urn U (unknown): 100 balls, red and black in unknown proportions. Two bets: Bet 1 wins $100 if you draw red. Bet 2 wins $100 if you draw black. For each bet, choose either urn. Most people pick Urn K twice — they want red from the known urn AND black from the known urn. But this implies they think P(red in U) < 0.5 and P(black in U) < 0.5, which sums to less than 1. No probability assignment to Urn U is consistent with the observed pattern.
What's the three-color version?
One urn with 90 balls. 30 are red. The remaining 60 are some mix of black and yellow — unknown ratio. Choice A: bet on red. Choice B: bet on black. Most people prefer A (known 30/90 chance). Choice A': bet on red or yellow. Choice B': bet on black or yellow. Most people prefer B' (known 60/90 chance). Picking A and B' is inconsistent under subjective EU — yellow's probability cancels in both pairs (Savage's sure-thing principle), so A ≽ B requires A' ≽ B'. The paradox shows the sure-thing principle is empirically false.
Why is this different from the Allais paradox?
Allais's paradox uses lotteries with explicit known probabilities — it violates von Neumann-Morgenstern's independence axiom. Ellsberg's paradox uses urns where probabilities are themselves uncertain — it violates Savage's (1954) sure-thing principle, which extends EU to subjective probabilities. The two are siblings: Allais shows risk-with-known-probabilities is mis-modeled by EU; Ellsberg shows uncertainty-without-known-probabilities is mis-modeled by subjective EU. Both target the same axiom structure, but Ellsberg adds a distinct empirical failure: ambiguity aversion.
What's Knightian uncertainty?
Frank Knight's 1921 distinction in 'Risk, Uncertainty, and Profit' between two situations: risk (probabilities known or knowable, like a fair coin) and uncertainty (probabilities not assignable — true unknowns). Knight argued that economic profit comes from bearing Knightian uncertainty, not measurable risk — markets price risk away but not uncertainty. Ellsberg's paradox gave this distinction empirical and decision-theoretic teeth: people treat ambiguity differently from risk, and standard models that conflate them give wrong predictions in insurance, finance, and policy.
Who was Daniel Ellsberg?
Same Daniel Ellsberg famous for leaking the Pentagon Papers in 1971. The paradox came from his 1961 Quarterly Journal of Economics paper based on his Harvard PhD thesis, advised by Howard Raiffa. Ellsberg worked at RAND on nuclear strategy and decision analysis before turning anti-Vietnam-War. The paradox was a footnote in his life until economists realized in the 1980s that he had created the foundation of an entire subfield. Ellsberg lived until 2023 and gave occasional interviews on the paradox alongside his political activism.
How does maxmin expected utility resolve it?
Gilboa-Schmeidler's 1989 maxmin EU model says the agent considers a set of probability priors over the ambiguous urn and chooses the alternative that maximizes the worst-case expected utility across that set. With the unknown urn, an ambiguity-averse agent might consider priors ranging from (red 0.3, black 0.7) to (red 0.7, black 0.3); the worst-case for betting on red has probability 0.3, so the known 0.5 urn is preferred. Similarly the worst-case for betting on black has probability 0.3, so again the known urn wins. The model rationalizes the Ellsberg pattern by weakening the sure-thing principle.