Question 1

What is speculative execution?

Accepted Answer

Speculative execution is a CPU technique that issues and executes instructions before it knows they are needed — typically past an unresolved branch. The predictor guesses which way the branch will go, the back-end runs the predicted path, and if the guess was right those results retire normally. If the guess was wrong, the speculated work is squashed and the pipeline restarts. It is the single biggest reason a modern out-of-order core sustains 4-8 instructions per cycle.

Question 2

How is speculative execution different from out-of-order execution?

Accepted Answer

Out-of-order execution reorders ready instructions to fill execution units. Speculative execution lets the front-end keep fetching past an unresolved branch or load so that there are instructions to reorder. The two are complementary — OoO without speculation would stall on every branch (one every 5-7 instructions in typical code), and speculation without OoO would not pay off because the speculated work would serialize. Modern x86 and ARM cores do both, with a Reorder Buffer that can hold 200-600 in-flight instructions.

Question 3

What is the Spectre family of vulnerabilities?

Accepted Answer

Spectre describes a class of side-channel attacks that abuse speculative execution. The attacker trains the branch predictor or branch target buffer to mispredict, causing the CPU to speculatively load secret data, use it as an index, and bring an attacker-readable cache line in. Architectural state is rolled back when the misprediction is detected, but the cache footprint survives. A flush-and-reload probe then reads the secret indirectly. Reported variants include Spectre v1 (bounds-check bypass), v2 (branch target injection), MDS, L1TF, and RetBleed.

Question 4

How much data can a Spectre attack leak per second?

Accepted Answer

The original Spectre v1 paper reported leakage rates of around 10 KB/s with high reliability and roughly 2 KB/s across processes. Practical exploits in 2018-2020 ranged from 100 B/s to 10 KB/s depending on the gadget, the microarchitecture, and how aggressively the attacker could prime the cache. That is slow for bulk exfiltration but plenty fast for stealing SSH keys, browser cookies, or password hashes.

Question 5

What mitigations have CPUs and operating systems shipped?

Accepted Answer

Compiler-inserted LFENCE after bounds checks (Spectre v1). Retpolines and IBRS/STIBP/IBPB MSRs (Spectre v2). KPTI / KAISER page-table isolation (Meltdown). Single Thread Indirect Branch Predictors that flush BTB state on context switch. Process-context identifiers for TLBs. Most mitigations cost 1-10 percent of throughput; some workloads (system-call-heavy servers, network forwarding) lost 30 percent or more on launch and recovered partially with hardware fixes in later silicon.

Question 6

Why don't CPUs just stop speculating?

Accepted Answer

Because speculation is responsible for most of the IPC of a modern core. A branch happens every 5-7 dynamic instructions; a 20-stage pipeline running without speculation would stall 14 of those 20 cycles on every branch. Estimated end-to-end slowdown of disabling speculation entirely is 3-5x on most code and 10x+ on branch-heavy workloads. The alternative is to keep speculating but isolate the side-channel surface — invisible speculation, safe speculation, hardware fences, partitioned caches. Several of those ideas have landed in current chips.

Question 7

What is a transient instruction?

Accepted Answer

A transient instruction is one that executes speculatively, then never retires because the speculation was wrong. Architecturally it is as if it never happened — registers and memory are rolled back. Microarchitecturally it still touched the cache, the TLB, the branch predictor, and possibly other shared structures. Spectre-class attacks live entirely inside the transient window.

Microarchitecture	ROB size (µops)	Load buffer	Store buffer
Intel Skylake (2015)	224	72	56
Intel Sunny Cove (2019)	352	128	72
Intel Golden Cove (2021)	512	192	114
AMD Zen 3 (2020)	256	72	64
AMD Zen 4 (2022)	320	136	64
Apple M1 Firestorm (2020)	~630	~210	~140

Speculative Execution

Interactive visualization

How speculative execution works

The Reorder Buffer and the speculation window

Spectre and the side-channel surface

Mitigations

Inspecting and controlling speculation

Performance numbers

Common pitfalls

Frequently asked questions