Biochemistry

Protein Folding

From linear sequence to functional 3D structure — sometimes elegant, sometimes pathological

Protein folding is the process by which a linear amino acid chain folds into its functional 3D structure. Driven by: hydrophobic effect (nonpolar residues bury inside), hydrogen bonds (between backbone and side chains), ionic interactions, disulfide bonds, van der Waals. Folding hierarchy: primary (sequence), secondary (α-helix, β-sheet), tertiary (overall 3D), quaternary (multi-subunit). Levinthal's paradox: random search would take forever; actual folding takes microseconds-seconds via funnel-like pathways. Misfolding causes: prion diseases (mad cow, CJD), Alzheimer's, Parkinson's. AlphaFold (2020-) revolutionized prediction.

  • LevelsPrimary, secondary, tertiary, quaternary
  • Driving forceHydrophobic effect (most important)
  • Secondary structureα-helix, β-sheet from H-bonds
  • Misfolding diseasesPrion diseases, Alzheimer's, Parkinson's
  • Levinthal's paradoxFolding too fast for random search
  • AlphaFoldAI prediction of structure (2020+)

Interactive visualization

Press play, or step through manually. The visualization is yours to drive — try it before reading on.

Open visualization fullscreen ↗

Watch the 60-second explainer

A condensed visual walkthrough — narrated, captioned, under a minute.

Why folding matters

  • Function. Misfolded protein = non-functional.
  • Drug design. Need 3D structure for design.
  • Disease. Many diseases from misfolding.
  • Biotech. Recombinant proteins must fold correctly.
  • Evolution. Folding constrains sequence evolution.
  • Structure prediction. AlphaFold transforming biology.
  • Origins of life. First proteins must have folded.

Common misconceptions

  • Sequence determines structure trivially. Folding is complex.
  • All proteins fold same way. Variety of mechanisms.
  • Folding random. Highly directed.
  • Folded structure is rigid. Dynamic; multiple states.
  • Cells fold proteins easily. Often need chaperones.
  • Misfolding always = disease. Some misfolded proteins functional.

Frequently asked questions

What drives protein folding?

Hydrophobic effect dominates. Nonpolar amino acids bury inside (away from water); polar amino acids on outside (interact with water). Plus: H-bonds (backbone + side chains), ionic interactions (salt bridges between charged residues), disulfide bonds (between cysteines), van der Waals (close packing). Result: specific 3D structure where energy is minimized.

What are secondary structures?

Local arrangements stabilized by H-bonds between backbone atoms. (1) α-helix: 3.6 residues per turn; H-bonds between i and i+4; common (~30% of residues). (2) β-sheet: extended strands H-bonded laterally; parallel or antiparallel. (3) β-turn: short reversal of direction. (4) Random coil: less ordered. Combined: structural framework of proteins.

What's tertiary structure?

Overall 3D arrangement of single polypeptide chain. Combines secondary structures into specific shape. Stabilized by: hydrophobic packing, H-bonds, salt bridges, disulfide bonds. Functional shape — active site of enzyme, binding pocket of receptor, etc. Each protein has unique tertiary structure determined by sequence.

What's quaternary structure?

Multi-subunit proteins. Multiple polypeptide chains assembled. Examples: hemoglobin (4 subunits — 2 α + 2 β), antibodies (4 chains), DNA polymerase (multiple subunits). Subunits: same (homo-oligomer) or different (hetero-oligomer). Binding sites between subunits; sometimes regulated allosterically (cooperativity in hemoglobin).

What's Levinthal's paradox?

Levinthal (1968): if protein folded by random search through all conformations, would take ~10⁵⁰ years. Actual folding: microseconds to seconds. Conclusion: folding is directed. Modern explanation: energy landscape is funnel-shaped; folded state is at bottom. Many paths to same final state. Folding is rapid optimization, not random search.

What's a chaperone?

Helper protein that assists folding. (1) Heat shock proteins (HSP60, HSP70) — bind nascent or partially folded proteins; prevent aggregation; release for proper folding. (2) GroEL/GroES (bacterial): "folding cage" where proteins fold isolated. Critical: folding crowded cellular environment; preventing misfolding under stress.

What's protein misfolding?

Improper structure. Causes diseases. (1) Prion diseases: PrP misfolds → β-rich form; aggregates; transmissible. Examples: mad cow (BSE), CJD, scrapie. (2) Alzheimer's: amyloid β plaques + tau tangles. (3) Parkinson's: α-synuclein. (4) Huntington's: polyglutamine aggregates. Misfolded proteins often form amyloid fibrils — β-sheet structures.

What's AlphaFold?

AI model from DeepMind (2020-2021). Predicts protein 3D structure from sequence with accuracy approaching experimental methods. CASP14: nearly solved problem. Open access database: most known proteins now have predicted structures. Revolutionizing structural biology, drug discovery. Demis Hassabis, John Jumper Nobel Prize 2024 (Chemistry).