Question 1

Who first proposed the molecular clock?

Accepted Answer

Emile Zuckerkandl and Linus Pauling first articulated the idea in a 1962 Festschrift paper for Albert Szent-Gyorgyi, then elaborated it in their 1965 essay Evolutionary Divergence and Convergence in Proteins. By comparing hemoglobin sequences across vertebrates they noticed that the number of amino-acid differences scaled almost linearly with the time since common ancestry estimated from fossils — alpha and beta globin chains differ by about the same amount in human-horse and human-cow comparisons. They coined the phrase chemical paleogenetics and conjectured that proteins were timekeepers. Motoo Kimura's 1968 neutral theory provided the mechanistic underpinning: if most substitutions are selectively neutral, fixation rate equals mutation rate, which is approximately constant per generation.

Question 2

How fast is the molecular clock?

Accepted Answer

Rates vary by orders of magnitude depending on the gene, the lineage, and the type of site. Synonymous sites in mammalian nuclear DNA tick at about 2 to 3 x 10^-9 substitutions per site per year; nonsynonymous sites are about ten times slower because purifying selection removes most amino-acid changes. Cytochrome c, a deeply conserved respiratory protein, diverges at roughly 1 substitution per site per 20 million years. RNA viruses evolve about a million times faster than vertebrate nuclear genes — HIV-1 substitutes about 2.5 x 10^-3 per site per year, accumulating intra-host diversity in weeks. Mitochondrial DNA in mammals ticks 5 to 10x faster than nuclear DNA. The clock is genome-region-specific, not universal.

Question 3

What is the difference between a strict and a relaxed clock?

Accepted Answer

A strict clock assumes one rate parameter for the whole tree — every branch ticks at the same speed. This is the original Zuckerkandl-Pauling formulation and is rejected by likelihood-ratio tests on most empirical datasets because lineages with shorter generation times (rodents) accumulate substitutions faster than lineages with longer generation times (great apes). A relaxed clock allows each branch its own rate drawn from a prior distribution — uncorrelated lognormal in the popular UCLN model, or autocorrelated Brownian motion in the older method. Bayesian programs like BEAST, MCMCTree, and RevBayes integrate over rate variation while constraining a few internal nodes with fossil calibrations, then output posterior age distributions for every node.

Question 4

How are clocks calibrated with fossils?

Accepted Answer

Fossils provide minimum age constraints — a 56-million-year-old early primate fossil tells you the primate crown group is at least that old, but not how much older. Clock calibration places a probability distribution on selected internal nodes (gamma, lognormal, or uniform with a hard minimum from the fossil and a soft maximum from a guess about the geologic context). The MCMC then samples branch lengths in time units rather than substitutions, with rate parameters as nuisance variables. A common pitfall: with a strict clock and a single calibration point, all node ages scale linearly with that one number — a 10% miscalibration shifts every estimate by 10%. Multiple well-spread calibrations are essential.

Question 5

Why does the molecular clock work at all?

Accepted Answer

Kimura's neutral theory provides the cleanest answer. If a fraction f of mutations are selectively neutral and the per-generation mutation rate at a given site is mu, then in a population of size N the per-generation fixation probability of any one new mutation is 1/(2N) for a diploid. The expected number of new neutral mutations per generation is 2N x mu x f, so the fixation rate is just mu x f — independent of population size. Population fluctuations and drift cancel out. The clock thus ticks at the per-site mutation rate times the neutral fraction, which biology keeps roughly stable for housekeeping genes. The clock breaks down when selection regimes shift, generation times change, or DNA repair efficiency varies.

Question 6

What is the generation-time effect and why does it matter?

Accepted Answer

Most mutations arise during DNA replication, so species that copy their genomes more often per unit time accumulate more mutations per unit time. Mice (1 generation per year) substitute roughly 2 to 4x faster per million years than humans (25 years per generation) at synonymous sites, and rate differences of 5x have been measured between rodents and primates. This violates the strict clock and was the central reason relaxed-clock methods were developed in the late 1990s. The hominoid slowdown — apes ticking slower than Old World monkeys — was first measured by Wen-Hsiung Li in 1987 and remains a textbook example of why a single clock rate cannot be applied across a vertebrate phylogeny.

Aspect	Strict clock	Relaxed clock (UCLN)
Rate parameter	Single rate for all branches	One rate per branch, drawn from a prior
Assumption	Constant substitution rate across the tree	Rate varies; mean and variance of variation are estimated
Fits short timescales (within-genus)	Often acceptable	Acceptable but over-parameterized
Fits deep phylogenies (vertebrates, plants)	Rejected by likelihood-ratio test	Standard choice
Statistical test	Felsenstein 1981 LR test against unconstrained tree	Pass through, no test needed (it is the unconstrained model)
Number of free parameters	1	2B + 2 where B is number of branches
Computation	Fast, closed-form for some priors	Slow MCMC, hours to days
Software example	r8s, MEGA's RelTime	BEAST 2, MCMCTree, RevBayes
Generation-time effect	Cannot accommodate	Captures via branch-specific rates

Molecular Clock

Interactive visualization

Watch the 60-second explainer

Why the molecular clock matters

Common misconceptions

How the molecular clock works

Strict vs relaxed clock

Famous case studies

Frequently asked questions