Molecular Biology

mRNA Capping

The m7G hat that lets the ribosome find your message

The 5' cap is a 7-methylguanosine (m7G) attached to the first nucleotide of nascent eukaryotic mRNA through an unusual 5'-5' triphosphate bridge — m7GpppN. The cap is added co-transcriptionally in three enzymatic steps: a triphosphatase removes the gamma phosphate, guanylyltransferase adds GMP, and N7-methyltransferase methylates the new guanine. The eukaryotic initiation factor eIF4E binds the cap and recruits eIF4G, eIF4A, and the 40S ribosome to begin translation. Capping protects mRNA from 5'→3' exonucleases, marks transcripts as self for the innate immune system, and routes them to the ribosome. Vaccinia virus carries its own multifunctional capping enzyme; influenza steals caps from host transcripts. Capped synthetic mRNA pioneered the COVID-19 vaccines.

  • Cap structurem7GpppN (5'-5' triphosphate)
  • Added whenCo-transcriptionally, ~25 nt into transcript
  • Cap-binding factoreIF4E
  • Cap0 / cap1 / cap2Increasing 2'-O-methylation on first 1–2 nt
  • DiscoveryFuruichi, Shatkin, Moss (1975)
  • mRNA vaccinesUse CleanCap AG (cap1) analog

Interactive visualization

Press play, or step through manually. The visualization is yours to drive — try it before reading on.

Open visualization fullscreen ↗

Watch the 60-second explainer

A condensed visual walkthrough — narrated, captioned, under a minute.

Capping in three enzymatic steps

Capping is the very first post-transcriptional modification. It occurs while the mRNA is still tethered to RNA polymerase II — the enzymes that build the cap ride along on the phosphorylated CTD of Pol II so they cap the first transcript they see, not later ones.

  1. Triphosphatase removes the γ phosphate. The nascent 5'-pppN end is hydrolyzed to 5'-ppN. In humans, RNGTT (RNA guanylyltransferase and 5'-phosphatase) carries this activity in its N-terminal domain.
  2. Guanylyltransferase transfers GMP from GTP. The enzyme first forms a covalent enzyme-GMP intermediate (lysine-bound), then attacks the 5'-diphosphate, joining GMP via a 5'-5' triphosphate bridge: GpppN.
  3. N7-methyltransferase methylates the new guanine. RNMT, with its activator RAM, transfers a methyl group from S-adenosylmethionine (SAM) onto the N7 nitrogen, completing m7GpppN — the cap0.

Higher eukaryotes go further. CMTR1 methylates the 2'-OH of the first transcribed ribose (cap1); CMTR2 methylates the second ribose (cap2). These additional methylations are critical for self/non-self discrimination by IFIT1 and RIG-I in the cytoplasm.

Capping pathway and cap structure

  Pol II nascent transcript:  pppN-N-N-N-...
       │
       ▼  RNA triphosphatase  (γ-PO₄ removed)
       │
       ppN-N-N-...
       │
       ▼  guanylyltransferase  + GTP
       │
       GpppN-N-N-...        (5'-5' triphosphate bridge!)
       │
       ▼  N7-methyltransferase  + SAM
       │
       m7GpppN-N-N-...       (cap0)
       │
       ▼  CMTR1 (2'-O-methylates 1st ribose)
       │
       m7GpppNm-N-N-...      (cap1, mammalian standard)
       │
       ▼  CMTR2 (2'-O-methylates 2nd ribose)
       │
       m7GpppNm-Nm-N-...     (cap2)


  Cap structure (head-on view):

       7-methylguanosine
              │
        5'-5'│triphosphate
              │
       first transcribed N (often A or G)
              │
              ▼
       rest of mRNA  →  AAA(A)n  3'

Cap-dependent vs cap-independent translation

Cap-dependent (canonical)IRES-drivenm6A-drivenCap-snatching (viral)Vaccinia-style (viral)uORF reinitiation
5' end requirementm7GpppN capStructured RNA >100 ntm6A in 5' UTRStolen capped fragmentm7GpppN (made by virus)m7GpppN (host)
Initiation factoreIF4E + eIF4G + eIF4A + eIF3eIF4G fragment + eIF3 (varies)YTHDF1/2/3 / eIF3eIF4E (cap is real)eIF4E (cap is real)eIF4E (rebinds)
Used in stress / apoptosisSuppressedActive (XIAP, BiP, c-myc)Active (HSP70 under heat)Viral takeoverViral takeoverATF4, GCN4
40S recruitment mode5' end scanningDirect internal bindingDirect via YTHDFStandard scanning of capStandard scanningRe-scan after stop
Drug targetseIF4E (4EGI-1), eIF4A (silvestrol)HCV IRES (miravirsen)METTL3 (STM2457)PA endonuclease (baloxavir)Vaccinia D1
Cellular example~95% of mRNAs at restBiP, XIAP, p53, c-mycHSP70 under heat shockATF4 stress response
Viral exemplarPolio, FMDV, HCVSome flavivirusesInfluenza, hantavirusVaccinia, reovirus

Real-world relevance

  • mRNA vaccines. BioNTech/Pfizer and Moderna COVID-19 vaccines use cap1 analogs (CleanCap AG). Cap1 alone, not cap0, evades IFIT1 sensing and prevents premature interferon response — the difference between a tolerable shot and a pyrogenic one. Karikó and Weissman (Nobel 2023) showed that pseudouridine plus cap1 makes synthetic mRNA non-immunogenic enough for therapy.
  • Cap-snatching antivirals. Baloxavir marboxil (Xofluza, 2018) inhibits influenza PA endonuclease, the cap-snatching subunit. A single oral dose shortens flu by about a day. Resistance via PA I38T arose almost immediately and is now monitored.
  • eIF4E in cancer. eIF4E is overexpressed in many tumors; it preferentially translates 5'-UTR-structured mRNAs encoding cyclin D1, c-Myc, VEGF, and survivin. 4EGI-1 and ribavirin disrupt eIF4E-eIF4G binding and have been tested clinically.
  • 4E-BP signaling. mTORC1 phosphorylates 4E-BP1, releasing it from eIF4E and switching on cap-dependent translation. Rapamycin and its analogs (everolimus, temsirolimus) inhibit this and dampen translation in tumors and immune cells.
  • Decapping in disease. Loss-of-function mutations in DCPS (cap m7GpppN scavenger decapping) cause Al-Raqad syndrome with intellectual disability — capping pathway integrity matters for neurons.
  • P-bodies and stress granules. Decapping factors concentrate in cytoplasmic P-bodies; stalled preinitiation complexes accumulate in stress granules. The dynamics of these condensates regulate which transcripts get translated under stress.

Cap variants and unusual structures

  • Cap0, cap1, cap2. Differ in 2'-O-methylation status. Mammals require cap1 to evade RIG-I/IFIT1.
  • m2,2,7G trimethylguanosine cap. Found on snRNAs (U1, U2, U4, U5) and some snoRNAs. Made by TGS1.
  • NAD caps. NAD+ can prime transcription, leaving an NAD-cap; removed by DXO and Nudt12.
  • FAD and CoA caps. Discovered in bacteria; possibly more widespread.
  • Cap analogs in vitro. ARCA forces correct orientation; CleanCap AG produces cap1 in one step.

Pitfalls and design traps

  • Reverse-orientation cap analogs. Older mCAP could go either direction during T7 transcription; 30–50% of transcripts ended up with reversed caps. ARCA fixed this with a 3'-O-methyl that blocks reverse incorporation.
  • Uncapped 5'-triphosphate. Unreacted starting material is recognized by RIG-I as viral; therapeutic IVT products must be CIP-treated or HPLC-purified.
  • Cap0 immunogenicity. Older mRNA platforms used cap0 and triggered interferon. Cap1 (CleanCap or vaccinia 2'-O-MTase) is essential for in vivo use.
  • Decapping during prep. Some IVT kits contaminate with phosphatase. Validate by LC-MS.
  • NAD-cap miscounting. Standard RNA-seq is biased toward 5'-monophosphate ends and misses NAD caps. CapZyme-seq detects them.
  • eIF4E oncogenicity. Overexpressing eIF4E in mice drives lymphoma (Ruggero, Nat Med 2004) — translation initiation as a primary oncogene.

Frequently asked questions

How is the cap structure built chemically?

Capping happens co-transcriptionally on the nascent mRNA when it reaches ~25 nucleotides. Three enzymatic activities act in sequence on the 5' triphosphate end. (1) RNA triphosphatase hydrolyzes the gamma phosphate, leaving 5'-diphosphate. (2) Guanylyltransferase forms a covalent enzyme-GMP intermediate from GTP, then transfers GMP onto the 5' diphosphate, creating the 5'-5' triphosphate linkage. (3) RNA (guanine-N7) methyltransferase methylates the new guanine at N7 using S-adenosyl-methionine. The result is m7GpppN. In higher eukaryotes, additional methylations on the first (cap1) and second (cap2) transcribed nucleotide riboses follow.

Why does eIF4E binding matter?

Eukaryotic initiation factor 4E recognizes the m7G cap through stacking of the methylated guanine between two tryptophan residues — a sandwich first seen in Marcotrigiano's 1997 crystal structure. eIF4E recruits eIF4G as a scaffold, which in turn brings eIF4A (RNA helicase) and eIF3 / 40S ribosomal subunit. The 43S preinitiation complex scans 5' to 3' for an AUG. Without a cap, none of this happens — uncapped mRNA in the cytoplasm is degraded by Xrn1. eIF4E is the rate-limiting factor for cap-dependent translation; cells regulate its activity with 4E-BPs.

What's cap-independent translation?

Internal ribosome entry sites (IRESes) recruit ribosomes without a cap. Picornaviruses (poliovirus, hepatitis A, FMDV, EMCV) shut off host cap-dependent translation by cleaving eIF4G, then translate their own mRNAs from 400+ nt structured 5' UTRs that bind 40S directly. Cellular IRESes also exist in stress-response transcripts (XIAP, c-myc, BiP), letting them be made when overall cap-dependent translation is suppressed during apoptosis, heat shock, or hypoxia.

How do viruses handle caps?

Three strategies. (1) Make your own — vaccinia virus, reoviruses, and many large DNA viruses encode multifunctional capping enzymes. Vaccinia D1 has triphosphatase, guanylyltransferase, and N7-methyltransferase activities in one polypeptide. (2) Steal caps — influenza, hantavirus, and bunyavirus polymerases bind a host capped transcript and cleave 10–14 nt downstream, using that fragment as a primer. PB2 binds the cap; PA cleaves; baloxavir targets PA. (3) Skip caps — picornaviruses use IRESes.

Are mRNA vaccine caps natural?

mRNA vaccines (Pfizer-BioNTech, Moderna) use a synthetic cap analog. CleanCap AG (Trilink), which forms cap1 in a single step during in vitro transcription, replaced the older anti-reverse cap analog (ARCA, cap0) in commercial vaccines. The cap1 structure (with 2'-O-methylation on the first transcribed nucleotide) is essential — without it, RIG-I and IFIT proteins detect the RNA as foreign and trigger interferon response, reducing vaccine potency.

How is the cap removed?

Decapping is an irreversible commitment to mRNA decay. The decapping holoenzyme is centered on Dcp2, an Nudix hydrolase that cleaves the cap to release m7GDP, leaving a 5'-monophosphate end vulnerable to the 5'→3' exonuclease Xrn1. Dcp1 stimulates Dcp2; Edc1, Edc3, and Pat1 enhance the reaction. Decapping is regulated by deadenylation, NMD, and miRNA-mediated silencing. P-bodies are cytoplasmic granules where decapping concentrates.