Statistical Mechanics

Boltzmann Distribution

Probability of a state with energy E in thermal equilibrium scales as exp(−E/kT)

The Boltzmann distribution is the fundamental probability law of statistical mechanics: in thermal equilibrium at temperature T, the probability of a system being in a state with energy E is P(E) = (1/Z) e^(−E/kT), where k is Boltzmann's constant (1.38 × 10⁻²³ J/K) and Z = Σ e^(−E_i/kT) is the partition function that normalizes. The exponential creates the Boltzmann factor e^(−ΔE/kT) — at room temperature kT ≈ 25 meV, so a state 1 eV above ground is suppressed by e^(−40) ≈ 10⁻¹⁷. Maxwell-Boltzmann (kinetic energy distribution of gas molecules) is a special case. Bose-Einstein and Fermi-Dirac generalize for quantum statistics. Used everywhere: chemical reaction rates (Arrhenius equation), atmospheric scale height, semiconductor carrier densities, blackbody radiation precursor.

  • DistributionP(E) = (1/Z) e^(−E/kT)
  • Boltzmann constantk = 1.38 × 10⁻²³ J/K
  • kT at 300 K25 meV
  • Partition functionZ = Σ e^(−E/kT)
  • AuthorBoltzmann 1860s–1870s
  • GeneralizationsBose-Einstein, Fermi-Dirac

Interactive visualization

Press play, or step through manually. The visualization is yours to drive — try it before reading on.

Open visualization fullscreen ↗

Watch the 60-second explainer

A condensed visual walkthrough — narrated, captioned, under a minute.

Why Boltzmann distribution matters

  • Chemistry rates. Arrhenius k = A·e^(−E_a/kT). 90% of physical chemistry computation reduces to Boltzmann factors. A 0.5 eV barrier at room T gives e^(−20) ≈ 2 × 10⁻⁹ — only one in 500 million collisions react. Heating from 300 K to 350 K can multiply rates by 30.
  • Semiconductors. Intrinsic carrier density n_i ∝ e^(−E_g/2kT). Silicon's E_g = 1.12 eV gives n_i ≈ 10¹⁰ cm⁻³ at 300 K, rising to 10¹⁵ cm⁻³ at 500 K — sets the temperature at which devices stop working as transistors.
  • Atmospheres. Pressure and density profiles n(h) = n_0·e^(−mgh/kT). Earth's scale height 7.4 km, Mars's 11 km, Jupiter's 27 km. Photochemistry depends on Boltzmann-weighted column densities.
  • Lasers. Population inversion requires N_2/N_1 > g_2/g_1, exceeding the Boltzmann ratio. Pumping mechanisms work because thermal equilibrium puts the upper state at e^(−ΔE/kT) of the lower — laser action is fundamentally a non-Boltzmann condition.
  • Cosmology. Big-bang nucleosynthesis ratios, recombination of e⁻ and p to form H, freeze-out of weakly interacting particles — all Boltzmann factors at T_universe.
  • Astrophysics. Stellar atmospheres show Boltzmann-distributed line strengths; spectroscopy reads off T from ratios. Saha equation extends to ionization.
  • Information theory. Maximum-entropy distributions under energy constraint are Boltzmann; widely used in machine learning (energy-based models, Boltzmann machines).

The math

  • Discrete states. P(E_i) = e^(−E_i/kT)/Z, with Z = Σ_i g_i e^(−E_i/kT) summing over levels with degeneracies g_i.
  • Continuous energies. P(E) dE = ρ(E) e^(−E/kT) dE / Z, where ρ(E) is the density of states. Phase-space integration in 6N dimensions for N classical particles.
  • Boltzmann factor. Ratio of populations between two non-degenerate states: N_2/N_1 = e^(−(E_2 − E_1)/kT). Symmetric form: ratio depends only on the energy gap.
  • Free energy. F = −kT ln Z. Entropy S = −∂F/∂T. Pressure p = −∂F/∂V. Internal energy U = ⟨E⟩ = −∂(ln Z)/∂β where β = 1/kT.
  • Quantum extensions. Bose-Einstein P(E) = 1/(e^((E−μ)/kT) − 1) for bosons; Fermi-Dirac P(E) = 1/(e^((E−μ)/kT) + 1) for fermions. Boltzmann is the high-T limit of both.

Common misconceptions

  • "Always classical." Quantum analogs exist — Bose-Einstein and Fermi-Dirac. Boltzmann is the high-T, low-density classical limit. The exponential structure persists in all three; the only difference is the ±1 in the denominator.
  • "Only for ideal gas." Boltzmann is universal in any system in thermal equilibrium — solids, liquids, magnetic spins, photons in a cavity, nuclear states. The "ideal gas" association is historical, since Maxwell-Boltzmann velocities were the first concrete derivation.
  • "Z is just normalization." Z encodes all thermodynamic information. Knowing Z(T, V, N) means knowing every thermal property of the system through derivatives. Computing Z is the central task of statistical mechanics.
  • "Energy must be small for Boltzmann to apply." No — Boltzmann factors apply to any energy, but states with E >> kT are exponentially rare. A state 100 kT above ground has P ∝ e^(−100) ≈ 10⁻⁴³ — not zero, just astronomically suppressed.
  • "Probability of a single state." P(E_i) gives the probability of a specific microstate i. The probability of any macrostate with energy E is P(E)·g(E) where g(E) is the degeneracy — the density of states matters.
  • "Boltzmann's k is exotic." It's a unit-conversion factor between energy and temperature. In natural units (E measured in temperature, like in plasma physics or astrophysics), k = 1 and the formula is just e^(−E/T).

History

  • 1860 Maxwell. Velocity distribution f(v) for a gas in equilibrium derived from kinetic-theory arguments. First Boltzmann-like exponential.
  • 1872 Boltzmann. H-theorem proves f(v) approaches Maxwell's distribution irreversibly via molecular collisions.
  • 1877 Boltzmann. Generalizes to arbitrary energy E with P ∝ e^(−E/kT). Tombstone formula S = k log W follows from this.
  • 1889 Arrhenius. Empirical rate equation k = A·e^(−E_a/RT) — direct Boltzmann factor in chemistry.
  • 1900 Planck. Black-body radiation derived assuming oscillators populated per Boltzmann; reveals quantization of energy.
  • 1924 Bose, Einstein. Bose-Einstein statistics — Boltzmann modified for indistinguishable bosons.
  • 1926 Fermi, Dirac. Fermi-Dirac statistics — Boltzmann modified for fermions obeying Pauli exclusion.
  • 1985 Hopfield, Hinton. Boltzmann machines — neural networks whose state probabilities follow e^(−E/T) over a learned energy function.

Applications

  • Chemical reaction rates. Catalysts work by lowering E_a; Boltzmann factor turns small E_a reductions into orders-of-magnitude rate increases. A drop of 0.2 eV at 300 K multiplies rate by e^8 ≈ 3000.
  • Semiconductor doping. n-type ≈ N_D·e^(−E_D/kT) at low T; carrier freeze-out below liquid-N₂ temperatures. Determines IC operating ranges.
  • NMR and ESR. Spin polarization in a magnetic field is the Boltzmann ratio between Zeeman states; tiny at room T (few parts per million for protons in 11.7 T).
  • Plasma physics. Saha ionization equation gives ionization fraction in a thermal plasma — Boltzmann ratio with continuum.
  • Stellar structure. Hydrogen excitation in stars peaks around 9000 K (Balmer lines strongest in A-type stars) — direct readout of Boltzmann population at stellar temperatures.
  • Bioenergetics. Membrane potential, ATP synthesis efficiency, ion-channel gating — all Boltzmann-weighted in equilibrium.
  • Machine learning. Energy-based models, restricted Boltzmann machines, score-based diffusion; deep learning's "Boltzmann distribution over data" framing.

Worked example

  • Two-level system. A hydrogen atom has ground state (1s) at E_1 = 0 and first excited state (2p) at E_2 = 10.2 eV. At T = 300 K, kT = 0.0258 eV.
  • Population ratio. N_2/N_1 = (g_2/g_1) · e^(−ΔE/kT) = 3 · e^(−10.2/0.0258) = 3 · e^(−395) ≈ 10⁻¹⁷¹. Essentially nothing in 2p at room T.
  • At stellar T. Same calculation at T = 9500 K (A-type star surface) gives kT = 0.82 eV, ratio = 3 · e^(−12.4) = 1.2 × 10⁻⁵. About 1 atom in 100,000 in 2p — and that's why hydrogen's Balmer lines (transitions involving 2p) are strongest in A stars.
  • Comparison to barrier crossings. Same formula governs an ion in a 0.3 eV potential well at T = 300 K: P(top of barrier) ≈ e^(−12) ≈ 6 × 10⁻⁶. With attempt frequency 10¹³ Hz, the ion escapes about once per 10⁻⁷ s — fast enough for diffusion to matter.
  • Lesson. The same exponential maps "almost zero" populations into observable rates when multiplied by enormous attempt rates. Boltzmann factors of 10⁻¹⁰ can still drive cellular reactions, vacancy diffusion in metals, and electron emission from cathodes.

Frequently asked questions

What's the partition function?

Z = Σ_i e^(−E_i/kT), summed over all microstates (or integrated for continua). It normalizes the distribution: P(E_i) = e^(−E_i/kT)/Z. But Z is more than a normalizer — its derivatives give all thermodynamics. Internal energy U = −∂(ln Z)/∂β where β = 1/kT. Helmholtz free energy F = −kT ln Z. Entropy S = −∂F/∂T. Knowing Z is equivalent to knowing every thermodynamic property of the system.

Why exponential?

Two derivations converge on the same answer. (1) Maximum entropy under fixed average energy: maximize S = −Σ p_i ln p_i subject to Σ p_i E_i = ⟨E⟩ and Σ p_i = 1, and Lagrange multipliers force p_i ∝ e^(−βE_i). (2) Microcanonical ensemble of an isolated system + small subsystem: Boltzmann's S = k ln W applied to the bath gives the subsystem's probability ∝ e^(−E/kT) once the bath is much larger. Either route, the exponential is mandatory.

How does it lead to the Maxwell-Boltzmann distribution?

Apply P(E) = e^(−E/kT)/Z to a free particle with E = (1/2)mv². Multiply by the density of states 4πv² dv (number of velocity vectors with magnitude v in the shell), normalize, and you get f(v) = (m/2πkT)^(3/2) · 4πv² · e^(−mv²/2kT). That's Maxwell-Boltzmann — peak at v_p = √(2kT/m), mean ⟨v⟩ = √(8kT/πm), rms √(3kT/m). For air at 300 K, v_p ≈ 400 m/s.

How does the Arrhenius rate law come from it?

A reaction with activation energy E_a requires reactants to climb an energy barrier. The fraction of molecules with E ≥ E_a is e^(−E_a/kT) (the high-energy tail of the Boltzmann distribution). Multiplied by an attempt frequency (collision rate or vibrational frequency), the rate constant becomes k_rate = A·e^(−E_a/kT) — the Arrhenius equation (1889). Doubling temperature near room T (300 K → 600 K) typically multiplies rates by 1000 to 10⁶ for E_a around 0.5–1 eV.

What's atmospheric scale height?

An air molecule at altitude h has potential energy E = mgh. Boltzmann gives the density n(h) = n_0·e^(−mgh/kT). The scale height H = kT/mg is the altitude over which density drops by factor e. For air (mean molecular mass 29 g/mol = 4.8 × 10⁻²⁶ kg) at 250 K, H ≈ 7.4 km. So pressure halves every 5 km. Mountain altitudes (3 to 5 km) sit at roughly half sea-level pressure — purely Boltzmann.

When does it fail?

When quantum statistics matter. Bosons (photons, ⁴He) at low T or high density crowd into the ground state — Bose-Einstein P(E) = 1/(e^((E−μ)/kT) − 1). Fermions (electrons, ³He, neutrons) avoid each other — Fermi-Dirac P(E) = 1/(e^((E−μ)/kT) + 1). Boltzmann is the high-T, low-density limit where occupation per state is much less than 1 and exchange effects vanish. Conduction electrons in a metal at room T are deeply Fermi-Dirac, not Boltzmann.