Cosmology

Sunyaev-Zeldovich Effect

How a galaxy cluster dents the oldest light in the universe — a shadow that never fades with distance

Hot electrons in galaxy clusters inverse-Compton scatter CMB photons, distorting its spectrum. A redshift-independent way to find clusters.

  • Low-frequency distortionΔT/T ≈ -2y (Rayleigh-Jeans)
  • Compton y-parametery = ∫ n_e σ_T (k_B T_e / m_e c²) dl
  • Null frequency≈ 217 GHz (decrement below, increment above)
  • Redshift dependenceNone — surface brightness fixed at any z
  • Gas temperature~10⁷–10⁸ K, k_B T_e up to ~15 keV
  • PredictedSunyaev & Zeldovich, 1969–1972

Interactive visualization

Press play, or step through manually. The visualization is yours to drive — try it before reading on.

Open visualization fullscreen ↗

Watch the 60-second explainer

A condensed visual walkthrough — narrated, captioned, under a minute.

A shadow on the oldest light in the universe

The cosmic microwave background is a near-perfect blackbody at 2.725 K filling all of space — the relic glow of the hot early universe, last scattered when the cosmos was 380,000 years old. On its way to us across billions of light-years, a tiny fraction of those photons happen to pass through a galaxy cluster: the most massive gravitationally bound structures in existence, glued together by gas at a hundred million degrees. When a CMB photon crosses that gas, there is a small but non-zero chance it collides with a fast-moving electron and gets kicked to a slightly higher energy. Multiply that nudge over the entire population of CMB photons crossing the cluster and you get a measurable, frequency-dependent dent in the background — colder than the average sky below 217 GHz, hotter than average above it. That dent is the Sunyaev-Zeldovich effect.

It is one of the most useful tools in observational cosmology, for one strange reason. Almost everything we observe in the distant universe gets fainter with distance: light spreads out, surface brightness dims as (1+z)⁻⁴, faint things at high redshift slip below detection. The SZ effect does not. Because it is a fractional distortion of a background that fills the whole sky, the cluster's signal is the same surface brightness whether it sits next door or halfway across the observable universe. Point a millimetre-wave telescope at the sky and the most massive clusters announce themselves as cold spots, at any redshift, with essentially the same contrast. That redshift independence is what turned SZ surveys into cluster-finding machines.

How it works: inverse-Compton scattering

Ordinary Compton scattering is what happens when an energetic photon hits a slow electron and gives the electron some of its energy — the photon comes out softer. Inverse-Compton scattering is the mirror image: a low-energy photon hits a fast electron and the energy flows the other way, from electron to photon. The CMB photons are cold — typical energy a fraction of a milli-electron-volt — while the cluster's electrons are blazing hot, with thermal energies of several keV. So when a 2.7 K CMB photon scatters off a 10⁸ K electron, it almost always comes out with a tiny bit more energy than it went in with.

Crucially, scattering does not create or destroy photons — it just redistributes their energies. The total number of CMB photons crossing the cluster is conserved; what changes is that each one is, on average, boosted slightly upward in frequency. This shifts the entire blackbody spectrum a hair toward higher frequencies. Photons get robbed from the low-frequency Rayleigh-Jeans tail and deposited into the high-frequency Wien tail. So an observer sees:

  • A decrement at low frequencies (radio / long-millimetre): fewer photons than the unperturbed CMB, so the cluster looks colder than the average sky.
  • An increment at high frequencies (sub-millimetre): more photons, so the cluster looks hotter.
  • A null at the crossover, near 217 GHz, where the two effects cancel and the cluster is invisible in temperature.

This distinctive spectral signature — a decrement, a null at a fixed frequency, and an increment — is the fingerprint that lets us separate the SZ effect from every other source of millimetre-wave emission. No other astrophysical signal has a guaranteed sign flip at exactly 217 GHz.

The Compton y-parameter

The amplitude of the thermal SZ effect is captured by a single dimensionless number, the Compton y-parameter:

y = ∫ n_e σ_T (k_B T_e / m_e c²) dl

Read it from the inside out. The factor k_B T_e / m_e c² is the average fractional energy a photon gains per scattering: it compares the electron's thermal energy to its rest energy m_e c² = 511 keV. For a 5 keV cluster that ratio is about 0.01. The Thomson cross-section σ_T = 6.65 × 10⁻²⁵ cm² is the probability of a scattering. Multiplying by the electron number density n_e and integrating dl along the line of sight through the cluster counts how many scatterings a photon undergoes on its way through. So y is, in effect, the average number of scatterings times the average fractional energy gain per scattering — the line-of-sight integral of the electron pressure n_e k_B T_e.

In the low-frequency Rayleigh-Jeans limit, the fractional brightness-temperature distortion is beautifully simple:

ΔT/T ≈ -2y          (Rayleigh-Jeans, h𝜈 ≪ k_B T_CMB)

The minus sign is the decrement: at low frequencies the cluster is colder than the background by twice the y-parameter. Across the full spectrum the distortion is ΔT/T = y · g(x), where x = h𝜈 / k_B T_CMB and g(x) = x coth(x/2) − 4, which equals −2 in the Rayleigh-Jeans limit (x → 0), passes through zero at x ≈ 3.83 (𝜈 ≈ 217 GHz), and becomes positive at higher frequencies. The y-parameter sets the size; the function g(x) sets the shape.

Worked example: a massive cluster

Take a rich cluster like Coma. The intracluster gas near the center has an electron density of order n_e ≈ 3 × 10⁻³ cm⁻³ and a temperature k_B T_e ≈ 8 keV, spread over a path length through the core of about L ≈ 1 Mpc ≈ 3 × 10²⁴ cm. Estimate y by treating the integral as a simple product:

y ≈ n_e σ_T (k_B T_e / m_e c²) L

  = (3×10⁻³ cm⁻³) × (6.65×10⁻²⁵ cm²) × (8 keV / 511 keV) × (3×10²⁴ cm)

  = (3×10⁻³) × (6.65×10⁻²⁵) × (0.0157) × (3×10²⁴)

  ≈ 9 × 10⁻⁵

So y ≈ 10⁻⁴ for a massive cluster — a value confirmed by real measurements of Coma. Now turn that into a temperature dip in the Rayleigh-Jeans regime:

ΔT/T ≈ -2y ≈ -1.8 × 10⁻⁴

ΔT ≈ -1.8 × 10⁻⁴ × 2.725 K ≈ -490 μK

The cluster dims the CMB by about half a milli-kelvin — roughly 0.5 mK — at low frequencies. That sounds tiny, and it is: it is comparable to the primary CMB anisotropies and far smaller than the 2.725 K mean. But it is a stable, predictable signature with a known spectral shape, and it does not weaken with distance. A cluster of the same gas pressure at z = 1 produces the same ≈ −490 μK decrement. Compare that to the cluster's X-ray surface brightness, which scales with n_e² and dims by (1+z)⁻⁴ = 16× by z = 1, vanishing into the noise. The SZ shadow stays put.

Why distance does not dim it

The redshift independence deserves a sharper statement because it is the whole reason the effect matters for surveys. Most extragalactic flux measurements fight cosmological surface-brightness dimming: a source of fixed physical surface brightness appears fainter by (1+z)⁻⁴ as you push it to higher redshift, the combination of energy redshift, time dilation, and solid-angle effects. This is why high-z galaxies and the X-ray emission of distant clusters fade so fast.

The SZ effect sidesteps this entirely because the cluster's signal is not its own emission — it is a fractional modulation of the CMB, which itself fills the sky and dims by exactly the same (1+z)⁻⁴. The CMB temperature at the cluster's epoch was hotter, T_CMB(z) = 2.725 (1+z) K, and the up-scattered photons redshift back down by the same factor on their way to us. The ratio ΔT/T survives untouched, set only by the integrated pressure y. So the observed SZ surface brightness of a cluster of given mass and gas pressure is the same at z = 0.1 and z = 1.5. A millimetre survey of fixed sensitivity therefore selects clusters in a way that is nearly mass-limited rather than distance-limited — the holy grail for counting the most massive halos across cosmic time.

Variants and regimes

The "SZ effect" is really a family of related distortions:

  • Thermal SZ (tSZ). The main effect, driven by the random thermal motion of hot electrons. It has the characteristic decrement/increment with the 217 GHz null and amplitude set by y. This is what cluster surveys exploit.
  • Kinematic SZ (kSZ). A separate, smaller distortion from the cluster's bulk peculiar velocity v_pec relative to the CMB rest frame. It is a pure Doppler shift that preserves the blackbody shape, so it has no spectral null — the same sign at all frequencies — with amplitude ΔT/T ≈ −(v_pec/c) τ, where τ = ∫ n_e σ_T dl is the optical depth. For typical v_pec ~ 300 km/s and τ ~ 10⁻³, kSZ is roughly an order of magnitude below tSZ, but it directly measures velocities and probes the missing-baryon problem and large-scale flows.
  • Relativistic corrections (rSZ). In the hottest clusters (k_B T_e ≳ 10 keV) the electrons are mildly relativistic, and the simple non-relativistic formula picks up percent-level corrections that shift the null frequency slightly above 217 GHz and let multi-frequency data measure the gas temperature directly from the SZ spectrum.
  • Polarized SZ. A much fainter effect from the quadrupole of the CMB as seen by the scattering electrons, of interest for future high-sensitivity experiments.

Observational status

SZ astronomy moved from heroic single detections to industrial cluster surveys within a few decades. Early single-dish attempts on the Coma cluster and Abell 2218 in the 1970s–80s were difficult and disputed at the microkelvin level. Interferometers at the Ryle Telescope and OVRO/BIMA produced clean imaging detections in the 1990s. The transformation came with dedicated wide-field millimetre experiments:

  • Planck (2009–2013) mapped the whole sky and delivered a catalog of more than a thousand SZ clusters plus an all-sky map of the integrated y-parameter (the tSZ power spectrum), a primary probe of σ8.
  • The South Pole Telescope (SPT) and the Atacama Cosmology Telescope (ACT) surveyed thousands of square degrees at arcminute resolution, producing SZ-selected catalogs of thousands of clusters out to z > 1.5 — the highest-redshift massive clusters known were largely found this way.
  • The integrated SZ signal Y_SZ (the y-parameter integrated over the cluster's solid angle) is the cleanest available proxy for total cluster mass, with remarkably low scatter, anchoring cluster-cosmology constraints on Ω_m and σ8.

A persistent puzzle is the mild tension between σ8 inferred from SZ cluster counts and the value from the primary CMB power spectrum. The discrepancy hinges on cluster mass calibration — converting Y_SZ into mass requires hydrostatic or weak-lensing masses, and a systematic "hydrostatic mass bias" of ~20–30% can shift the inferred σ8 enough to reconcile or worsen the tension. Future kSZ measurements and lensing-calibrated masses aim to close this.

SZ versus other cluster probes

ProbePhysical observableScales withRedshift dimmingBest for
Thermal SZIntegrated electron pressure∫ n_e T_e dl (y)None — redshift-independentFinding & weighing distant clusters
Kinematic SZBulk peculiar velocity × τ(v_pec/c) ∫ n_e dlNoneVelocities, missing baryons
X-ray surface brightnessBremsstrahlung emission∫ n_e² Λ(T) dl(1+z)⁻⁴ — fades fastNearby clusters, gas density
Optical richnessCount of member galaxiesNumber of galaxiesDimming + projectionCheap large samples, low z
Weak gravitational lensingTotal projected massΣ(R) shearSource-density limitedDirect mass, no gas assumptions
Galaxy velocity dispersionMember kinematicsσ_v of galaxiesSpectroscopy-limitedDynamical mass, low z

The standout column is "redshift dimming." X-ray brightness goes as n_e² and dims by 16× at z = 1; the SZ effect goes as the first power of n_e (through pressure) and does not dim at all. That single contrast is why SZ surveys reach the highest-redshift clusters and why X-ray follow-up is reserved for the nearer, brighter ones.

Quantitative analysis: from scattering to spectrum

The spectral distortion of the CMB by a population of hot electrons is governed by the Kompaneets equation, which describes how Compton scattering redistributes photon energies in a thermal bath. In the limit of small optical depth and non-relativistic electrons, integrating the Kompaneets diffusion over the line of sight gives the SZ intensity change:

ΔI(x) / I_0 = y · (x⁴ eˣ / (eˣ − 1)²) · g(x)

with  x = h𝜈 / k_B T_CMB
      g(x) = x coth(x/2) − 4
      y = ∫ n_e σ_T (k_B T_e / m_e c²) dl

The spectral function g(x) controls the shape. Evaluate its limits:

x → 0   (low 𝜈, Rayleigh-Jeans):   g(x) → -2     →  ΔT/T = -2y   (decrement)
x = 3.83  (𝜈 ≈ 217 GHz):           g(x) = 0      →  null
x → ∞   (high 𝜈, Wien):            g(x) → x - 4  →  ΔT/T > 0    (increment)

The crossover x = 3.83 follows from solving x coth(x/2) = 4, which lands at 𝜈 = x · k_B T_CMB / h ≈ 3.83 × (56.78 GHz) ≈ 217 GHz for T_CMB = 2.725 K. This null is fixed by fundamental constants and the CMB temperature, not by the cluster — which is exactly why multi-frequency observations straddling 217 GHz cleanly isolate the SZ signal from foregrounds and the primary CMB.

One more consistency check on the redshift independence: the optical depth τ = ∫ n_e σ_T dl is dimensionless and depends only on the gas column, while the energy-gain factor k_B T_e / m_e c² depends only on the gas temperature. Neither carries any explicit factor of luminosity distance or (1+z). The y-parameter is a property of the cluster gas alone, so the observed ΔT/T is too. Distance enters only through the angular size of the cluster on the sky — a far gentler dependence than the (1+z)⁻⁴ that kills surface-brightness observables.

Common pitfalls and misconceptions

  • "The SZ effect heats the CMB photons because the gas is hot." The gas being hot matters, but the mechanism is inverse-Compton scattering off individual fast electrons, not bulk thermal emission. The cluster does not add its own photons; it redistributes the CMB photons that pass through, conserving their total number.
  • Confusing the decrement with absorption. The low-frequency dip is not the gas absorbing light. It is photons being scattered up in frequency, leaving a deficit at low frequencies and a surplus at high frequencies. Total photon number is unchanged.
  • Thinking the null at 217 GHz is a property of the cluster. The null frequency is set by the CMB temperature and fundamental constants, identical for every (non-relativistic) cluster. Only relativistic corrections in the hottest clusters shift it slightly.
  • Assuming SZ flux measures gas mass directly. y integrates pressure n_e T_e, not density alone. Two clusters with the same gas mass but different temperatures have different y. The integrated Y_SZ correlates with total mass, but through the pressure–mass relation, which must be calibrated (hence the hydrostatic-mass-bias debate).
  • Treating tSZ and kSZ as the same thing. They have different physical origins, different spectral shapes (kSZ has no null), and different amplitudes. Disentangling them requires multi-frequency data.
  • Expecting the signal to fade with distance like everything else. It does not. The whole point is that ΔT/T is preserved as the CMB and the scattered photons redshift together. This is counter-intuitive precisely because every other extragalactic observable dims.

Frequently asked questions

What is the Sunyaev-Zeldovich effect in one sentence?

The Sunyaev-Zeldovich (SZ) effect is the spectral distortion stamped onto the cosmic microwave background when its photons pass through a galaxy cluster's hot ionised gas and inverse-Compton scatter off the cluster's energetic electrons, gaining energy and shifting the blackbody spectrum to higher frequencies. The result is a brightness decrement below 217 GHz and an increment above it, with strength fixed by the Compton y-parameter.

What is the Compton y-parameter?

The Compton y-parameter is the dimensionless amplitude of the thermal SZ effect, defined as y = ∫ n_e σ_T (k_B T_e / m_e c²) dl — the line-of-sight integral of the electron pressure n_e k_B T_e weighted by the Thomson cross-section σ_T and divided by the electron rest energy m_e c². It is the average fractional energy a photon gains times the number of scatterings. For a massive cluster y is of order 10⁻⁴; in the Rayleigh-Jeans limit the fractional temperature distortion is ΔT/T ≈ -2y.

Why is the SZ effect a decrement at low frequencies and an increment at high frequencies?

Inverse-Compton scattering conserves the total number of CMB photons but boosts each photon's energy slightly, so the whole spectrum shifts toward higher frequencies. That moves photons out of the low-frequency tail (creating a deficit, the decrement) and into the high-frequency tail (creating a surplus, the increment). The two regimes cross over at a null frequency near 217 GHz, where the brightness change vanishes. Below the null a cluster looks colder than the CMB; above it, hotter.

Why is the SZ effect independent of redshift?

The SZ signal is a fractional change in the CMB brightness, ΔT/T, set only by the integrated electron pressure y along the line of sight. Both the CMB temperature and the scattered photon energies redshift by exactly the same factor (1+z) as they travel to us, so the ratio ΔT/T is preserved. Unlike X-ray surface brightness, which dims as (1+z)⁻⁴, the SZ surface brightness of a cluster of fixed pressure is the same whether it sits at z = 0.1 or z = 1.5. This makes SZ surveys nearly mass-limited rather than distance-limited.

How is the thermal SZ effect different from the kinematic SZ effect?

The thermal SZ (tSZ) effect comes from the random thermal motion of hot electrons and produces the characteristic frequency-dependent decrement/increment with a null at 217 GHz. The kinematic SZ (kSZ) effect comes from the bulk peculiar velocity of the whole cluster relative to the CMB rest frame: it is a pure Doppler shift that preserves the blackbody shape, appearing as a small ΔT/T ≈ -(v_pec/c) τ with the same sign at all frequencies (no null). The kSZ amplitude is typically an order of magnitude smaller than the tSZ for a given cluster, but it directly measures peculiar velocities and large-scale flows.

Who predicted the SZ effect and when was it first detected?

Rashid Sunyaev and Yakov Zeldovich predicted the effect in a series of papers from 1969 to 1972, building on Zeldovich's earlier work on Comptonisation of the CMB spectrum. The first credible detections came in the late 1970s and early 1980s with single-dish radio telescopes pointed at Coma and Abell 2218, but the signal was so faint (microkelvin) that early measurements were contentious. Interferometric work in the 1990s produced clean detections, and the era of large catalogs arrived with Planck, the South Pole Telescope, and the Atacama Cosmology Telescope after 2008.

What does the SZ effect tell us about cosmology?

Because the SZ signal is redshift-independent and tightly correlated with cluster mass through the integrated pressure (the Y_SZ observable), SZ-selected cluster samples count the most massive halos across cosmic time. The abundance of clusters as a function of mass and redshift is sensitive to the amplitude of matter fluctuations σ8 and the matter density Ω_m, and constrains dark energy through the growth of structure. SZ surveys like Planck, SPT, and ACT have produced these constraints and famously sit in mild tension with primary-CMB values of σ8, a tension entangled with cluster mass calibration.

How hot is the gas that produces the SZ effect?

The intracluster medium that scatters the CMB is a diffuse plasma at temperatures of roughly 10⁷ to 10⁸ kelvin — tens of millions of degrees, corresponding to k_B T_e of a few keV up to about 15 keV in the most massive clusters. At these temperatures the electrons are mildly relativistic, electron densities are around 10⁻³ to 10⁻⁴ per cubic centimetre, and the gas radiates X-rays by thermal bremsstrahlung. The same hot electrons that emit X-rays up-scatter CMB photons to make the SZ effect, which is why X-ray and SZ observations are deeply complementary.