Molecular Biology
Nucleosome
147 base pairs wrapped 1.65 turns around an octamer of histones — the bead on the chromatin string
A nucleosome is the basic packaging unit of every eukaryotic genome. 147 base pairs of double-stranded DNA wrap 1.65 left-handed turns around an octamer of histone proteins — two copies each of H2A, H2B, H3, and H4 — held together by 14 specific contacts between histone arginines and the DNA minor groove. Chain millions of these together with short linker stretches and you get the "beads-on-a-string" 11-nm fibre; add histone H1 at the linkers and the array compacts into a 30-nm fibre and ultimately the metaphase chromosome. Two metres of DNA fit into a six-micron nucleus because of this scaffold, and every gene in the genome is gated by whether its nucleosomes are stable, slid, ejected, or post-translationally modified.
- DNA wrapped147 bp, 1.65 left-handed turns
- Core composition2× H2A, 2× H2B, 2× H3, 2× H4 (octamer)
- Linker DNA~50 bp average in mammals (variable)
- Linker histoneH1 → chromatosome (167 bp protected)
- First crystal structureLuger et al., Nature 1997 (2.8 Å)
- Compaction ratio~7× at the nucleosome; ~10,000× in metaphase
Interactive visualization
Press play, or step through manually. The visualization is yours to drive — try it before reading on.
Watch the 60-second explainer
A condensed visual walkthrough — narrated, captioned, under a minute.
A bead the cell uses two metres of
Each of the four core histones is small (102–135 residues) and shares the same fold — three alpha helices linked by two short loops, a motif called the histone fold. Two histone-fold pairs interlock head-to-tail to make a dimer; in the octamer, an H3-H4 pair handshakes another H3-H4 to form a central tetramer, and one H2A-H2B dimer docks onto each face. The four-helix bundles of H3 and H4 sit at the tetramer's core, and H2A-H2B dimers latch on the sides like wings. The whole assembly is symmetric — flip it and the structure looks identical.
The DNA wrap is left-handed and 1.65 turns long, covering 147 base pairs. Fourteen contacts hold it in place: at every full helical turn (about every 10 bp) the DNA minor groove faces the histone surface, where an arginine from one of the histones reaches in and bonds the phosphate backbone. There are no sequence-specific contacts — the histones grip whatever DNA is presented — but they prefer flexible sequences and reject stiff poly(dA:dT) tracts.
This wrap is the cell's first compaction step. 147 bp of B-form DNA is 50 nm long stretched out; wound around a 7-nm-tall octamer it spans about 11 nm. Multiply that across the 6 billion base pairs of a diploid genome and you have already shrunk DNA roughly seven-fold. Stack the nucleosomes into a 30-nm fibre with H1 and you reach about 50-fold; loop that fibre on the chromosome scaffold and you reach 10,000-fold compaction at metaphase.
Core composition: who is what, where
| Component | Stoichiometry | Length (residues) | Position in particle | Distinguishing feature |
|---|---|---|---|---|
| Histone H3 | 2 copies | 135 | Central tetramer core, contacts H4 | Long N-terminal tail (residues 1-40); CENP-A variant at centromeres |
| Histone H4 | 2 copies | 102 | Central tetramer core, contacts H3 | Most evolutionarily conserved protein known; H4K16ac is the global accessibility switch |
| Histone H2A | 2 copies | 129 | Lateral dimers, contacts H2B | Long C-terminal tail; H2A.X is phosphorylated at DSBs (γ-H2AX); H2A.Z marks promoters |
| Histone H2B | 2 copies | 126 | Lateral dimers, contacts H2A | Short tail; H2BK120ub on gene bodies activates transcription |
| DNA | 147 bp | — | Wraps 1.65 left-handed turns around octamer | Minor groove faces histone surface every ~10 bp |
| Histone H1 (linker) | 0 or 1 per nucleosome | ~220 | Caps entry/exit DNA | Forms the chromatosome (167 bp protected); 11 mammalian variants |
The numbers above are the textbook ones. The reality has a few wrinkles: H1 occupancy is sub-stoichiometric in active chromatin, the linker length varies (yeast averages ~20 bp, mammals ~50 bp, sea-urchin sperm over 100 bp), and many chromosomal regions use histone variants that change the rules.
Assembly behind the replication fork
New nucleosomes are not built spontaneously. After replication, the parental octamers split — roughly half stay with the leading-strand daughter and half with the lagging strand — and new histones (synthesised in S phase by an entire cluster of replication-coupled genes) fill the gaps. The chaperone CAF-1 binds PCNA at the fork and delivers the H3-H4 tetramer first; NAP1 and the FACT complex then add the H2A-H2B dimers. The whole process takes seconds and runs in lockstep with replication.
Two consequences follow. First, half the histones on a daughter chromosome are old (and therefore carry the parental modifications), which is the substrate that lets reader-writer enzymes copy the marks onto new histones in the hours after replication. Second, the cell needs vast quantities of new histones every S phase — histones are among the most abundant proteins synthesised during cell division, and disrupting their cluster (HIST1 on chromosome 6 in humans) is lethal.
How nucleosomes are positioned
If nucleosomes were placed randomly, the cell could not know in advance whether a given gene was accessible. In practice positioning is highly non-random and determined by three forces. Sequence preferences favour flexible GC-rich DNA and reject stiff poly(dA:dT) tracts — these tracts at promoters create nucleosome-free regions a few hundred base pairs wide. Statistical positioning packs subsequent nucleosomes against any barrier; a transcription factor or a stable nucleosome anchors a phased array of nucleosomes downstream. ATP-dependent remodellers — SWI/SNF, ISWI, CHD, INO80 — burn ATP to slide, eject, or load nucleosomes, often against the local thermodynamic preference.
The result is a stereotyped layout at most genes: a nucleosome-free region (NFR) ~150 bp wide at the promoter, flanked by a +1 nucleosome positioned over the transcription start site and a -1 nucleosome upstream. RNA polymerase II then has to displace the +1 nucleosome to begin transcription — which is one reason the +1 is enriched for the variant H2A.Z and H3.3, both of which destabilise the wrap.
Eukaryotic nucleosome vs bacterial nucleoid
| Eukaryotic nucleosome | Bacterial nucleoid | |
|---|---|---|
| Packaging proteins | Histones (H2A, H2B, H3, H4) + H1 | HU, IHF, H-NS, Fis (NAPs) |
| Wrapping geometry | 147 bp, 1.65 left-handed turns | Variable; HU bends DNA but does not wrap fully |
| Compaction | ~10,000× at metaphase | ~1,000× in the nucleoid |
| Regulation | Post-translational marks on histone tails | Supercoiling, NAP binding, growth-phase control |
| Replication-coupled assembly | CAF-1 deposits new H3-H4 at the fork | None; nucleoid is dynamic, not modular |
| Variants | CENP-A, H3.3, H2A.Z, macroH2A, H2A.X | Different NAPs at different growth phases |
Archaea blur this distinction. They use proteins related to eukaryotic histones (HMfA, HMfB) and form roughly 60-bp wraps in tetramer particles — looking like an evolutionary midpoint between bacterial nucleoids and eukaryotic nucleosomes.
Why this structure matters
- Cancer. Mutations in histones themselves are oncogenic. H3K27M (lysine-to-methionine) in pediatric diffuse intrinsic pontine glioma poisons EZH2 and globally erases H3K27me3; H3G34V in giant-cell tumour of bone disrupts H3K36 methylation locally. Both are gain-of-function, dominant-negative oncohistones.
- Centromeres. The kinetochore — the platform that grabs spindle microtubules during mitosis — assembles only on nucleosomes containing the H3 variant CENP-A. Without CENP-A loading at G1, sister chromatids cannot segregate.
- Pioneer factors. Most transcription factors cannot bind DNA wrapped on a nucleosome. A small set — FOXA1, OCT4, GATA, KLF4 — can, and they recruit remodellers to evict the nucleosome, opening chromatin for the next round of factors. They are why developmental cell-fate decisions can override pre-existing chromatin.
- Drug action. Topoisomerase poisons (etoposide, doxorubicin), HDAC inhibitors (vorinostat), BET-bromodomain inhibitors (JQ1) and EZH2 inhibitors (tazemetostat) all act through the nucleosome's contacts and modifications.
- Sperm chromatin. Sperm package most of their DNA on protamines instead of histones, achieving denser compaction; the small fraction (~5–15% in humans) that remains on nucleosomes is at developmentally important genes — an inheritable signal beyond the genetic code.
Histone variants and their niches
- CENP-A (CenH3). Replaces H3 at every functional centromere. Required for kinetochore assembly. Loaded by HJURP in early G1.
- H3.3. Replication-independent variant deposited by HIRA at active genes and by ATRX/DAXX at telomeres and pericentric heterochromatin. Differs from canonical H3 by only four residues but is functionally distinct.
- H2A.Z. Marks promoters, enhancers, and the +1 nucleosome of active genes; destabilises the wrap and primes the locus for activation.
- H2A.X. Phosphorylated at S139 to γ-H2AX within minutes of a DNA double-strand break, marking megabases around the lesion for repair-factor recruitment.
- macroH2A. Carries an extra C-terminal macrodomain; coats the inactive X chromosome and senescent heterochromatin.
- TH2B and protamines. Replace H2B and the entire octamer respectively during spermiogenesis, achieving dense, transcriptionally silent packaging in sperm.
Pitfalls and easy misreadings
- "DNA is just stored in nucleosomes." Nucleosomes do not store DNA, they regulate access to it. Every step of transcription, replication, and repair has to negotiate them.
- "The 30-nm fibre is the in-vivo state." Recent ChromEM and cryo-electron tomography work suggests the 30-nm fibre may be a salt-induced artefact and that interphase chromatin is more like a disordered liquid than a regular helical fibre. The textbook picture is still useful as a teaching device but not literal.
- "All four core histones are equivalent." H4 is one of the most conserved proteins in biology — only two residue differences between yeast and human — because H4K16 acetylation is the master internucleosomal contact. H3 has the most variants. H2A and H2B are the most readily exchanged in and out of nucleosomes.
- "Nucleosomes block transcription completely." RNA polymerase II elongates through nucleosomes routinely, with the help of FACT and elongation factors that disassemble and reassemble the wrap behind the polymerase. The block is at initiation, not elongation, in most cases.
- "H1 is just a clip." H1 has 11 variants in humans, each with different tissue expression and chromatin preferences; H1 depletion in mouse embryonic stem cells causes specific transcription defects, not just global decompaction.
Frequently asked questions
Why exactly 147 base pairs?
The number is fixed by geometry. A 1.65-turn wrap of B-form DNA around the histone octamer covers 147 bp, with the entry and exit points sitting roughly 80° apart on the same face. Take it down to 146 or up to 148 and the helical phasing of the contact points between histone arginines and the minor groove no longer aligns. This is why micrococcal nuclease, which cleaves linker DNA, releases mononucleosomes that protect almost exactly 147 bp regardless of organism.
How is the histone octamer assembled?
In two stages, on chaperones. CAF-1 deposits a tetramer of (H3-H4)2 onto newly replicated DNA; NAP1 and FACT then add two H2A-H2B dimers, one on each face. The octamer is unstable on its own — it falls apart at physiological salt — but stable once wrapped by 147 bp of DNA. Removing nucleosomes during transcription is essentially the inverse: FACT lifts a dimer, the tetramer briefly dissociates, then re-forms behind the polymerase.
What are the histone tails for?
The N-terminal tails of H3, H4, H2A, and H2B (and the H2A C-terminal tail) are unstructured and protrude from the wrap. They are the substrate for the post-translational marks that make up the histone code: acetylation, methylation, phosphorylation, ubiquitination. Removing the tails reduces internucleosomal interactions and weakens 30-nm fibre formation, which is one reason hyper-acetylated chromatin is more accessible to transcription factors.
What is histone H1 and where does it sit?
H1 is the linker histone. It binds the entry/exit DNA of a nucleosome and an additional 20 bp of linker, producing a particle called a chromatosome (about 167 bp protected). H1 stabilises the wrap, restricts DNA breathing, and is essential for compacting nucleosomes into higher-order chromatin fibres. Cells running active transcription tend to be H1-depleted at expressed genes.
How are nucleosomes positioned along the genome?
Three forces. (1) DNA sequence — nucleosomes prefer flexible, GC-rich stretches and avoid stiff poly(dA:dT) tracts. (2) Statistical positioning — barriers like promoter-bound transcription factors create nucleosome-free regions, and the next nucleosome packs against the barrier, then the next against that, in a phased array. (3) ATP-dependent remodellers — SWI/SNF, ISWI, CHD, INO80 chew up ATP to slide, eject, or load nucleosomes against thermodynamics.
What replaces canonical histones at special places?
Histone variants. CENP-A (CenH3) replaces H3 at centromeres and is essential for kinetochore assembly. H3.3 replaces H3 at active genes and telomeres and is deposited replication-independently by HIRA or DAXX. H2A.Z marks promoters and enhancers; macroH2A marks the inactive X chromosome; H2A.X is phosphorylated to gamma-H2AX at double-strand breaks. The chromatin state of a region is the variant plus its modifications, not the modifications alone.