Gravitational Waves
Standard Siren
A merging black-hole or neutron-star binary whose gravitational-wave chirp reads out its own absolute distance — a cosmic ruler with no ladder to climb
A standard siren is a merging compact binary whose gravitational-wave chirp directly encodes its absolute luminosity distance — no cosmic distance ladder, no calibration. The chirp rate fixes the chirp mass, the chirp mass fixes the intrinsic amplitude, and the measured amplitude reads off the distance, giving a one-step, self-calibrating route to the Hubble constant.
- Method proposedSchutz, 1986
- Strainh ∝ ℳc5/3/dL
- First measurementGW170817
- Siren distance≈ 40 Mpc
- H₀ (one siren)70 (+12/−8)
Interactive visualization
Press play, or step through manually. The visualization is yours to drive — try it before reading on.
Watch the 60-second explainer
A condensed visual walkthrough — narrated, captioned, under a minute.
A siren you can hear the distance to
Imagine an ambulance whose siren always blares at exactly the same loudness — and you happen to know that loudness perfectly, with no guesswork. Then the volume you hear tells you, by itself, how far away the ambulance is. No map, no triangulation, no chain of assumptions. You hear faint, it's far; you hear loud, it's close. A standard siren is the gravitational-wave version of that ambulance, except the "loudness" is fixed not by a manufacturer's spec but by Einstein's field equations.
The source is a pair of compact objects — two black holes, two neutron stars, or one of each — spiralling toward a merger. As they whirl ever faster, they radiate gravitational waves whose frequency sweeps upward into an audible-range "chirp." Crucially, general relativity predicts the entire waveform from just the masses and spins of the two bodies. Once you know those, you know exactly how strong the wave should have been at the source. The strength you actually measure at Earth is weaker, diluted by the distance it travelled. The ratio is the distance. That is why the method, proposed by Bernard Schutz in 1986 (the term "standard siren," by analogy with the optical "standard candle," was coined later by Holz and Hughes in 2005), has become the cleanest distance probe in cosmology.
The inspiral waveform and the chirp mass
For two point masses m₁ and m₂ on a slowly decaying circular orbit, the leading-order ("Newtonian quadrupole") gravitational-wave strain measured at luminosity distance dL is
h(t) ∝ (1/d_L) · (G ℳ_c / c²)^(5/3) · (π f(t) / c)^(2/3)
Every feature of the binary's masses collapses into one combination, the chirp mass:
ℳ_c = (m1 m2)^(3/5) / (m1 + m2)^(1/5)
The chirp mass also governs how fast the frequency climbs. Differentiating the orbital energy-loss equation gives
df/dt = (96/5) π^(8/3) (G ℳ_c / c³)^(5/3) f^(11/3)
This is the linchpin. The frequency f and its sweep df/dt are both directly observable in the time series and do not depend on distance at all. Inverting the relation hands you ℳc with no cosmological assumptions. With ℳc in hand, the amplitude formula above contains only one remaining unknown on the right-hand side — dL. The measured strain therefore determines the luminosity distance. No ladder, no calibrator, no anchor galaxy: the source calibrates itself.
Why "self-calibrating" is the whole point
An optical standard candle assumes you know an object's intrinsic luminosity. You never observe that directly — you bootstrap it. Parallax calibrates nearby Cepheids; Cepheids calibrate Type Ia supernovae in galaxies that host both; supernovae reach into the Hubble flow. Each handoff is a "rung," and each rung carries an independent systematic uncertainty (metallicity dependence of the Cepheid period–luminosity relation, supernova standardisation residuals, photometric zero-points). The community spends enormous effort cross-checking rungs.
The siren skips the ladder entirely. Its absolute output is a theorem, not a measurement: plug the masses into general relativity and the strain amplitude is fixed up to known constants. There is no empirical anchor to drift: the waveform measures the mass directly in the gravitational units that set the amplitude (it is the combination G·ℳ_c that the chirp pins down, with c exact by definition), so no external standard candle ever enters. This is why sirens are described as an absolute, single-rung distance measure — the gravitational analogue of being handed the ambulance's true loudness on a certificate of authenticity from the manufacturer of the universe.
Standard siren vs standard candle vs standard ruler
| Property | Standard siren | Standard candle | Standard ruler |
|---|---|---|---|
| Observable | GW strain h(t) | Apparent magnitude m | Angular size θ |
| Yields | Luminosity distance dL | Luminosity distance dL | Angular-diameter distance dA |
| Calibration | None — set by GR + G, c | Empirical distance ladder | Sound horizon from CMB physics |
| Rungs / anchors | 0 | 3+ (parallax → Cepheid → SN Ia) | 1 (CMB sound horizon) |
| Canonical example | GW170817 | Type Ia supernova | Baryon acoustic oscillations |
| Dominant systematic | Distance–inclination degeneracy | Dust, metallicity, standardisation | Early-universe physics, galaxy bias |
| Redshift from | EM counterpart or galaxy catalogue | Host-galaxy spectrum | Galaxy survey spectra |
| Universe regime probed | Late (z ≲ 0.1 currently) | Late (z ≲ 2) | Early sound horizon, all z |
The takeaway: sirens and candles both deliver dL, but the siren's chain of inference is one link long instead of three or more. That independence is precisely what makes a siren valuable as an arbiter — its errors do not correlate with the errors that plague the ladder.
The missing half: getting the redshift
Here is the catch that surprises people. The gravitational waveform gives you a beautifully clean distance — but it tells you almost nothing about redshift. To measure the Hubble constant you need both, because H₀ relates recession velocity (∝ redshift) to distance:
H_0 = c z / d_L (low redshift, z ≪ 1)
Worse, the chirp itself is redshift-degenerate: a heavier binary nearby and a lighter binary far away, redshifted, can produce the same observed frequency track. What the waveform actually constrains is the redshifted chirp mass (1+z)ℳc, so you cannot disentangle mass from redshift using the waves alone. The redshift must come from somewhere electromagnetic — either an optical counterpart that pins down a host galaxy, or a statistical match to galaxies in the localisation volume. This is the entire reason sirens split into "bright" and "dark" varieties.
GW170817 — the textbook bright siren
On 17 August 2017 the LIGO–Virgo network recorded GW170817, the inspiral of two neutron stars of about 1.46 and 1.27 solar masses (chirp mass ≈ 1.188 M☉), the loudest and longest gravitational-wave signal seen to that date. Roughly 1.7 seconds after merger, the Fermi and INTEGRAL satellites caught a short gamma-ray burst, GRB 170817A, from the same patch of sky. Within 11 hours, optical telescopes localised the fading kilonova AT2017gfo to the galaxy NGC 4993.
That coincidence let cosmologists do the siren measurement for real:
| Quantity | Value | Source |
|---|---|---|
| Luminosity distance dL | 43.8 (+2.9/−6.9) Mpc | GW waveform (siren) |
| Host galaxy | NGC 4993 | Kilonova localisation |
| Recession velocity (Hubble-flow corrected) | 3017 ± 166 km/s | Galaxy spectroscopy |
| Derived H₀ | 70.0 (+12.0/−8.0) km/s/Mpc | v / dL |
| Sky localisation | ≈ 28 deg² (GW only); arcsec with counterpart | 3-detector network + kilonova |
A single event delivered the Hubble constant to about 15 percent — comfortably straddling both the early-universe (≈67) and late-universe (≈73) camps, so it could not yet adjudicate, but it proved the method works end-to-end. The error was dominated by the distance–inclination degeneracy; the gamma-ray jet geometry and later radio-VLBI superluminal motion of the jet eventually constrained the inclination to roughly 19° from face-on, tightening the distance.
Quantitative scales and a worked example
Let's sanity-check the GW170817 distance from its loudness. The peak strain at Earth was h ≈ 4 × 10⁻²² — a fractional length change so tiny that LIGO's 4 km arms wobbled by about 1.6 × 10⁻¹⁸ m, a thousandth the width of a proton. Working the amplitude relation the other way, a chirp mass of 1.188 M☉ radiating at the ~few hundred Hz where LIGO is most sensitive implies an intrinsic amplitude that, diluted to h ≈ 4 × 10⁻²², places the source at tens of megaparsecs — consistent with the ~40 Mpc fit.
For the cosmology step, plug the numbers into H₀ = cz/dL. With recession velocity v = cz = 3017 km/s and dL = 43.8 Mpc:
H_0 = 3017 km/s / 43.8 Mpc ≈ 68.9 km/s/Mpc
The official analysis marginalises properly over the inclination and the galaxy's peculiar velocity, landing at 70 (+12/−8). The precision of a single siren scales roughly as the inverse signal-to-noise ratio, so combining N comparable events shrinks the error like 1/√N. Forecasts: ~50 well-localised bright neutron-star sirens reach roughly 2 percent on H₀; reaching 1 percent — enough to settle the tension — needs a few hundred bright sirens or thousands of dark ones, plausible within a decade of the upgraded LIGO–Virgo–KAGRA network and future detectors.
Dark sirens, where there is no light
The overwhelming majority of detections are binary black hole mergers, which produce no electromagnetic flash, so there is no host galaxy to read a redshift from. These are dark sirens. The fix is statistical: the gravitational-wave sky map plus the distance defines a 3D localisation volume, and you assign the event a redshift probability distribution built from every galaxy catalogued inside that volume. Any single event is hopelessly broad, but stacking hundreds of them — each contributing a faint preference — converges on H₀. The well-localised binary black hole GW190814 and the catalogue-based analysis of the full GWTC transient set are the running examples. A variant uses features in the black-hole mass spectrum (such as the pair-instability mass gap) as a "spectral siren," an intrinsic mass scale that, observed at a redshift-stretched value, encodes redshift without any galaxy catalogue at all.
Common misconceptions and edge cases
- "The waveform gives the redshift too." It does not. Gravitational waves are exquisite distance meters and poor redshift meters; the mass–redshift degeneracy means you only ever measure the redshifted chirp mass. Redshift must be supplied electromagnetically or statistically.
- "Sirens give angular-diameter distance, like the BAO ruler." No — a siren measures luminosity distance dL, exactly like a standard candle. The two are related by dL = (1+z)² dA, but the siren observable is the dimming of amplitude with distance, which is a luminosity-distance quantity.
- "More detectors only help with sky position." They also break the distance–inclination degeneracy. A single detector cannot separate a faint, far, face-on binary from a nearby edge-on one; a network sampling both polarisations from different orientations resolves the inclination and tightens the distance — often the single biggest improvement.
- "Sirens are immune to all systematics." They are immune to ladder systematics, but not to detector calibration of the strain amplitude, to gravitational lensing magnifying or demagnifying distant events, to peculiar velocities at low redshift, and (for dark sirens) to galaxy-catalogue incompleteness. The systematics are simply different from — and uncorrelated with — the ladder's.
- "A siren tests cosmology only through H₀." At higher redshift, sirens trace the dL(z) relation and so constrain dark energy and even possible deviations from general relativity in how gravitational waves propagate (a modified-gravity friction term would make the GW luminosity distance differ from the EM one). Space-based detectors like LISA will see massive-black-hole-binary sirens out to z ≳ 1.
Frequently asked questions
Why is a standard siren self-calibrating when a standard candle is not?
A standard candle (a Type Ia supernova, a Cepheid) has an intrinsic brightness you have to calibrate empirically against nearer objects, building a multi-rung distance ladder where each rung carries its own systematic error. A standard siren needs no such calibration: general relativity predicts the gravitational waveform exactly from the binary's masses and spins. The frequency sweep of the chirp, df/dt, measures the chirp mass directly, and once you know the masses the intrinsic strain amplitude is fixed by theory. Comparing predicted to measured amplitude returns the luminosity distance in one step, anchored only to Newton's constant G and the speed of light c.
What is the chirp mass and why does it matter?
The chirp mass ℳ_c = (m1 m2)^(3/5) / (m1 + m2)^(1/5) is the single combination of the two component masses that governs the leading-order inspiral. The rate at which the gravitational-wave frequency sweeps upward obeys df/dt ∝ ℳ_c^(5/3) f^(11/3), so timing the chirp measures ℳ_c with no reference to distance. Since the waveform amplitude scales as ℳ_c^(5/3), knowing ℳ_c fixes the intrinsic loudness — which is exactly what makes the binary a siren of known absolute output.
How did GW170817 measure the Hubble constant?
GW170817 was a binary neutron star merger detected on 17 August 2017. Its gravitational-wave chirp gave a luminosity distance of about 40 megaparsecs directly from the waveform. Independently, its optical counterpart AT2017gfo was localised to the galaxy NGC 4993, whose recession velocity (corrected for peculiar motion) is about 3017 km/s. Dividing recession velocity by siren distance gives the Hubble constant: H₀ = 70 (+12/−8) km/s/Mpc — the first measurement made without any rung of the cosmic distance ladder.
What is a dark siren?
A dark siren is a binary black hole merger with no electromagnetic counterpart — most gravitational-wave events. Without a flash you cannot pin down a single host galaxy, so you statistically marginalise over every galaxy inside the gravitational-wave localisation volume, weighting by the redshifts in galaxy catalogues. Each event is weak on its own, but stacking many dark sirens converges on H₀. GW170817's bright counterpart made it a "bright siren"; the dark-siren method lets the far more numerous black-hole mergers contribute too.
Why is the distance from a siren degenerate with the inclination angle?
The strain a detector records depends on both the luminosity distance and the binary's orbital inclination: a face-on binary (looking straight down the orbital axis) is intrinsically louder than an edge-on one at the same distance. From a single detector the two effects trade off, inflating the distance error. Breaking the degeneracy needs the two gravitational-wave polarisations, which requires a network of detectors viewing from different orientations, or an independent inclination constraint — for GW170817, the relativistic jet geometry of the associated gamma-ray burst sharpened the distance.
Could standard sirens resolve the Hubble tension?
That is the goal. The early-universe value of H₀ from the cosmic microwave background (about 67 km/s/Mpc) disagrees with the late-universe distance-ladder value (about 73 km/s/Mpc) at more than 5σ. Sirens are a completely independent late-universe probe with different systematics, so they can arbitrate. A single siren gives roughly 15 percent precision; forecasts suggest a few hundred well-localised bright sirens, or a few thousand dark sirens, are needed to reach the ~1 percent precision that would decisively confirm or dissolve the tension.