Question 1

How can a splay tree be O(log n) amortized if a single operation can be O(n)?

Accepted Answer

Amortized cost averages across a sequence. A splay that costs n actual rotations builds up the tree's potential beforehand — earlier accesses left credit on the structure that the expensive splay now spends. Sleator and Tarjan formalize this with Phi = sum over nodes of r(x) where r(x) = log2(s(x)) and s(x) is x's subtree size. The amortized cost of an operation is its real cost plus the change in potential. The expensive splay pays its real cost mostly out of potential decrease, not out of new accounting. Over any sequence of m operations, total amortized cost = total real cost + final potential − initial potential, and since the potential is bounded by n log n, the worst-case discrepancy is just O(n log n) — absorbed into the O((m+n) log n) bound.

Question 2

What is the access lemma exactly?

Accepted Answer

Let r(x) = floor(log2(s(x))) be the rank of node x, where s(x) is the size of x's subtree. The access lemma states: the amortized cost of splaying node x to the root of a tree rooted at T is at most 3 * (r(T) - r(x)) + 1. The proof analyzes each zig, zig-zig, and zig-zag step individually, showing its amortized cost is at most 3 times the rank gain at x. Summing along the splay path, the rank-gain terms telescope: only the initial r(x) and final r(T) survive. Since r(T) ≤ log2(n), the total amortized cost is O(log n).

Question 3

Why does the order of rotations inside zig-zig matter for the proof?

Accepted Answer

In the standard zig-zig (x and parent p both left children of grandparent g), Sleator and Tarjan's algorithm rotates g first, then p. The case analysis for this specific order produces an amortized bound of 3 * (r'(x) - r(x)) — exactly tight enough to telescope. If you swap the rotations (do p first, then g), the structure after the rotation is different, and the case analysis no longer yields a 3 * delta_r bound. You'd still get an O(log n) tree shape, but you can't prove it with the standard potential function. There exist alternative analyses, but the canonical proof is tied to this rotation order.

Question 4

What is the static optimality theorem?

Accepted Answer

For any access sequence sigma of m operations on a fixed set of n keys with key i accessed f_i times, the splay tree's total cost is O(m + sum_i f_i log(m / f_i)). The sum on the right is exactly the entropy lower bound — the minimum number of comparisons any static binary search tree could achieve given those frequencies. Splay trees match this bound up to a constant factor, without knowing the frequencies in advance. The best static BST (which knows frequencies) cannot do asymptotically better. Hence splay is statically optimal.

Question 5

What is the dynamic optimality conjecture and why is it still open?

Accepted Answer

Dynamic optimality asks: are splay trees within a constant factor of the optimal offline BST algorithm — one that sees the entire access sequence in advance and chooses the best rotation sequence? The conjecture says yes, the competitive ratio is O(1). Best known upper bound is O(log log n) via Tango trees (Demaine, Harmon, Iacono, Patrascu, 2007) and multi-splay trees. Proving O(1) for splay trees has resisted 40 years of effort because the offline optimal is hard to characterize, and connecting splay-tree behavior to it requires a structural insight no one has found. It is a top-tier open problem in algorithms theory.

Question 6

How does the working set theorem follow from the access lemma?

Accepted Answer

Define w(x) = number of distinct keys accessed since x was last accessed. The working-set theorem states: the amortized cost of accessing x is O(log(w(x) + 2)). Proof sketch: assign a weight to each node equal to 2^(-rank-in-recency), so recently-accessed keys have higher weight. Plug this weighting into the access lemma's generalized form (which uses arbitrary positive weights w(x), with potential = sum log(weighted-subtree-size)). The access lemma yields amortized cost O(log(sum_w / w(x))), and sum_w / w(x) ≤ w(x) + 2 by construction. Splay trees thus give LRU-like locality for free — no separate cache machinery needed.

	Splay tree	Red-black tree	AVL tree
Lookup amortized	O(log n)	O(log n)	O(log n)
Lookup worst-case	O(n)	O(log n)	O(log n)
Insert amortized	O(log n)	O(log n)	O(log n)
Insert worst-case	O(n)	O(log n)	O(log n)
Working-set bound	Yes, O(log w(x))	No	No
Static optimality	Yes	No	No
Per-node metadata	None	1 color bit	balance factor (2 bits)
Concurrent reads	Lock required (reads mutate)	Read-only safe	Read-only safe

Splay Tree Amortized Analysis

Interactive visualization

Watch the 60-second explainer

What "amortized O(log n)" actually means

Choosing the potential: rank and Phi

The access lemma — what to prove

Worked zig-zig with numbers

Why zig-zig needs grandparent-first

Theorems that follow from the access lemma

Amortized vs worst-case — practical impact

Common misconceptions

A 3-step proof outline you can reproduce

Frequently asked questions