Phonology
Syllable Onset, Nucleus, Coda
A universal three-part architecture, governed by sonority — and tightly constrained by every language
Every syllable everywhere splits into three slots: Onset (consonants before the vowel), Nucleus (the vowel itself, the only obligatory slot), and Coda (consonants after the vowel). The legal fillers obey the Sonority Sequencing Principle — loudness must rise into the nucleus and fall away from it. English permits up to three onset consonants (str- in street) and four in the coda (-lpts in sculpts); Mandarin permits only simple CV with a nasal coda; Hawaiian permits no coda at all. The architecture is universal; the limits set by each language are what create distinctive accents and force loanword adaptation.
- SlotsOnset · Nucleus · Coda
- Only obligatory slotNucleus
- Sonority hierarchystop < fricative < nasal < liquid < glide < vowel
- English max onsetCCC (str-, spr-, skr-, spl-)
- Mandarin syllable(C)(G)V(N) — onset, glide, vowel, nasal coda only
- Hawaiian syllable(C)V — no coda permitted
Interactive visualization
Press play, or step through manually. The visualization is yours to drive — try it before reading on.
Watch the 60-second explainer
A condensed visual walkthrough — narrated, captioned, under a minute.
The three slots
The classical decomposition (Pike 1947, Hyman 1975) treats every syllable as a hierarchical structure with two main constituents: the Onset and the Rhyme, where the Rhyme further splits into Nucleus and Coda. Diagrammatically:
σ (syllable)
/ \
Onset Rhyme
/ \
Nucleus Coda
c a t → Onset = /k/, Nucleus = /æ/, Coda = /t/
eye → Onset = ∅, Nucleus = /aɪ/, Coda = ∅
strengths → Onset = /str/, Nucleus = /ɛ/, Coda = /ŋkθs/
Why is the Rhyme one constituent? Because rhymes pattern as a unit in poetry, in cross-linguistic stress assignment, and in psycholinguistic experiments. Word-pair memory tasks treat cat/bat as more similar than cat/cap — the shared -at rhyme cluster is mentally chunked. Onsets do not show the same cohesion.
The Sonority Sequencing Principle
Sonority is roughly the perceptual loudness or vowel-likeness of a sound. From least to most sonorous, sounds rank as:
| Class | Examples | Sonority rank | Vowel-like? |
|---|---|---|---|
| Voiceless stops | /p t k/ | 1 (lowest) | No — full closure, no voicing |
| Voiced stops | /b d g/ | 2 | Slightly — voicing through closure |
| Voiceless fricatives | /f s ʃ θ/ | 3 | Some — turbulent airflow |
| Voiced fricatives | /v z ʒ ð/ | 4 | Yes — turbulence + voicing |
| Nasals | /m n ŋ/ | 5 | Yes — full voicing, oral closure but nasal flow |
| Liquids | /l r/ | 6 | Very — open lateral or rhotic flow |
| Glides | /j w/ | 7 | Almost vowels — ʃʷ /w/ ≈ rapid /u/ |
| Vowels | /a i u e o/ | 8 (highest) | The category itself |
The Sonority Sequencing Principle (SSP) states: sonority rises from the edge of the syllable toward the nucleus and falls away from it. So the onset cluster /pl-/ in plant rises (stop=1 → liquid=6 → vowel=8); the coda cluster /-nt/ falls (vowel=8 → nasal=5 → stop=1). Reverse the cluster — try to start a syllable with /lp-/ or end one with /tn/ — and most languages reject it as ill-formed.
The /s/ exception
English (along with German, Greek, Russian) violates SSP at one point: clusters of /s/ + stop. Words like spit /spɪt/, stop /stɑp/, ski /ski/ start with fricative=3 → stop=1, a sonority drop right at the syllable edge. Languages handle this differently:
- English: Tolerates the violation, treating /s/ as an "appendix" outside the core onset.
- Spanish: Repairs by epenthesising /e/ at the start. spanish → español, school → escuela, stress → estrés, stadium → estadio. The repair is so regular that Spanish-speakers learning English over-produce it.
- Modern Greek: Permits /s/+stop natively — stalo "I send", spazo "I break".
- Mandarin: Bans the cluster outright; loanwords break it up with epenthetic vowels.
The /s/ cluster is the cleanest example of phonotactic parameter setting. English-speaking children acquire str- early; Spanish-speaking children resist it and prepend a vowel even in invented words.
Maximum syllable templates across languages
| Language | Max template | Example | Onset | Nucleus | Coda |
|---|---|---|---|---|---|
| Hawaiian | (C)V | aloha | 0–1 | 1 | 0 |
| Japanese | (C)(j)V(N) | shimbun "newspaper" | 0–1 + glide | 1 | only /n/ |
| Mandarin | (C)(G)V(N) | chuáng "bed" | 0–1 + glide | 1 | /n/, /ŋ/ |
| Spanish | (C)(C)V(C)(C) | tras- "across" | 0–2 | 1 | 0–2 |
| English | (C)(C)(C)V(C)(C)(C)(C) | strengths /strɛŋkθs/ | 0–3 | 1 | 0–4 |
| Russian | (C)(C)(C)(C)V(C)(C)(C) | vstrečа "meeting" | 0–4 | 1 | 0–3 |
| Georgian | (C)(C)(C)(C)(C)(C)V… | brts'q'inavs "shines" | 0–6 (notorious) | 1 | 0–4 |
Hawaiian sits at the simple end — every syllable is V or CV, every coda is forbidden, and the inventory of consonants is among the smallest documented (8 consonants, 5 vowels). At the other extreme, Georgian permits onset clusters of up to six consonants — far past anything SSP would predict, and one of the genuinely hard cases for syllable theory.
English's complicated clusters
English allows the densest onsets and codas of any major Indo-European language. The systematic patterns:
- Onset CC: stop + liquid (play, brain, glow, true, dry, crow, fry); fricative + liquid (fly, throw, slow); /s/ + stop (spy, sty, sky); /s/ + nasal (smoke, snow).
- Onset CCC: /s/ + stop + liquid only. str- (street), spr- (spring), skr- (scream), spl- (split), skw- (square if /w/ counts).
- Coda CC: liquid/nasal + stop (belt, bend, ant, lamp); stop + /s/ (cats, hops, picks); fricative + stop (raft, ask).
- Coda CCC: nasal/liquid + stop + /s/ or /θ/ (warmth, sixths, prompts).
- Coda CCCC: only with morphological additions. twelfths /twɛlfθs/, sculpts /skʌlpts/, texts /tɛksts/.
The four-consonant codas almost always involve plural -s, third-person -s, or ordinal -th stacking on top of an already heavy stem. Pure morpheme-internal English codas top out at three: angst, midst, glimpse.
Worked example: Mandarin's tight template
Mandarin Chinese famously restricts every syllable to the schema (C)(G)V(N)/(R) — initial consonant, optional medial glide, vowel nucleus, optional final nasal /n/ /ŋ/ or rhotic /ɻ/. There are no consonant clusters in either onset or coda. Every Mandarin syllable also carries one of four lexical tones, written here with a tone number:
chuáng2 "bed" ch-u-a-ng onset=ch, glide=u, nuc=a, coda=ng
xiǎng3 "to think" x-i-a-ng onset=x, glide=i, nuc=a, coda=ng
mā1 "mother" m-a onset=m, nuc=a
ér2 "child" ∅-e-r no onset, nuc=e, coda=r
lüè4 "approx" l-üe onset=l, nuc=üe (diphthong)
That single template generates exactly ~400 syllable shapes; multiply by 4 tones and you get ~1300 distinct Mandarin syllables — vastly fewer than English's roughly 15,000. The trade-off is dense homophony: shi (4th tone alone) covers 是 "to be", 事 "matter", 市 "market", 试 "test", and dozens more.
Loanwords get hammered into shape. McDonald's becomes màidāngláo (麦当劳, 3 syllables); Trump becomes tèlǎngpǔ (3 syllables); strawberry as a sound transcription becomes shī-tè-lè-bèi-lì, with every cluster blown up into separate CV syllables.
Syllabic consonants: when the nucleus isn't a vowel
The nucleus is overwhelmingly a vowel, but a few languages allow sonorant consonants — usually nasals or liquids — to carry the syllabic peak. The classic case is Czech:
krk /kr̩k/ "neck" onset=/k/, nucleus=syllabic /r̩/, coda=/k/
prst /pr̩st/ "finger" onset=/p/, nucleus=/r̩/, coda=/st/
strč prst skrz krk — Czech tongue-twister, no vowels at all
English does this allophonically in unstressed syllables: button [ˈbʌʔn̩] (final /n/ syllabic), bottle [ˈbɒtl̩], rhythm [ˈrɪðm̩]. Native speakers don't perceive these as missing-vowel words because the syllabic consonant carries the peak. In each case the SSP-critical fact is that the syllabic segment is more sonorous than its neighbours.
Why this matters in acquisition
- Children's first words are CV-shaped. "mama", "papa", "baba", "dada" — the unmarked syllable template. Codas come later; complex onsets later still.
- Cluster reduction in child speech. 2-year-olds say "tar" for "star", "poon" for "spoon" — they're following SSP by deleting the less sonorous member of the cluster.
- L2 phonology recapitulates the schema. A Japanese learner of English may insert epenthetic vowels in clusters (strike → sutoraiku) just as Japanese loanwords do.
- Aphasia diagnostics. Patients with apraxia of speech show selective collapse of complex syllable structures while preserving CV — direct evidence for the structural hierarchy.
Adjacent phenomena
- Maximal Onset Principle. When syllabifying a word, consonants are assigned to the onset of the following syllable wherever possible. petrol divides as pe-trol, not pet-rol, because /tr-/ is a legal onset.
- Ambisyllabicity. A consonant between a stressed lax vowel and an unstressed vowel may belong to both syllables simultaneously (happy [ˈhæp.pi] vs [ˈhæ.pi]).
- Heavy/light syllables. Heavy syllables have a branching rhyme (long vowel or coda); light syllables have just a short vowel. The mora is the formal unit underlying this.
- Extrametricality. Some final consonants (especially /s/) are treated as outside the syllable for stress-assignment purposes — a way to keep the mainstream system simple while explaining edge effects.
- Resyllabification at word boundaries. French les amis [le.za.mi] re-binds the /z/ to the following onset. English casual speech does the same in fast tempo: not at all [nɑ.tə.ɾɔl].
Why syllable structure matters
- Universal grammar. The Onset–Nucleus–Coda hierarchy is among the strongest candidates for a true cross-linguistic universal in phonology.
- Loanword phonology. Borrowing strategies (epenthesis, deletion, restructuring) all reflect the borrower's syllable template.
- Speech recognition. ASR engines model syllabic and sub-syllabic structure to handle coarticulation.
- Reading and literacy. Onset-rhyme awareness predicts phonological reading skill in young children better than phoneme awareness alone.
- Poetic metre. Rhyme requires shared rhymes (nucleus + coda); alliteration requires shared onsets — separate phonological constituents do separate poetic work.
- Sound change. Diachronic loss of codas (Latin noctem → Italian notte → French nuit) is a primary engine of phonological history.
Common pitfalls
- Counting letters instead of phonemes. English orthography is misleading. strengths has 9 letters but only 7 phonemes /strɛŋkθs/, with /ŋk/ as a single phonological cluster.
- Treating vowels and nuclei as equivalent. A nucleus is usually a vowel, but can be a syllabic consonant. The functional role (sonority peak) is the constant.
- Assuming all CV languages are simple. Mandarin's tight CV-template hides extreme tonal complexity. Hawaiian's looks simple but adds long-vowel and diphthong contrasts.
- Forgetting /s/-clusters. The English /s/+stop violates SSP but is fully native. Treating SSP as inviolable predicts the wrong English data.
- Reading orthographic clusters as phonological clusters. English knife begins with /n/, not /kn/. thought begins with /θ/, a single phoneme, not /th/.
- Mistaking loanword forms for the donor pronunciation. Japanese sutoraiku is not a "mispronounced" /strajk/ but a fully Japanese-shaped output of an underlying loan.
Frequently asked questions
What is the Sonority Sequencing Principle?
Sonority — roughly, how loud or vowel-like a sound is — should rise from the start of an onset to the nucleus, then fall to the end of the coda. The hierarchy from least to most sonorous is: stops < fricatives < nasals < liquids < glides < vowels. English plant is a perfect ramp: /p/ (stop) → /l/ (liquid) → /æ/ (vowel) → /n/ (nasal) → /t/ (stop). Violations are repaired by epenthesis (Spanish spanish becomes español) or borrowing failure.
Why does English allow /str-/ but Mandarin doesn't?
Each language sets a maximum onset size. English permits up to three consonants in the onset, with /s/ + stop + liquid being the densest pattern (str-, spr-, skl-). Mandarin permits only one consonant in the onset and bans every coda except /n/ and /ŋ/. So street /striːt/ is impossible in Mandarin — speakers borrowing it must epenthesise vowels (sī-tè-lì-tè) or simplify radically. The difference isn't about articulatory ability; it's about the language's syllable template.
Why is the nucleus the only required part?
Because the nucleus carries the syllable's sonority peak — the part you can sing, the part that bears stress and tone. Onsets and codas are optional in many languages. English eye is V (no onset, no coda), go is CV, an is VC, cat is CVC, strengths is CCCVCCC. Across the world's languages, every syllable type with a vowel exists; types without a vowel are vanishingly rare and only occur with sonorant consonants (Czech krk [kr̩k] uses syllabic /r/ as the nucleus).
What's the densest English syllable?
Strengths /strɛŋkθs/ packs CCCVCCCC — three onset consonants and four coda consonants around a single vowel. Others: sculpts /skʌlpts/ is CCVCCCC; twelfths /twɛlfθs/ is CCVCCCC. English tolerates these heavy clusters partly because of complex coda restrictions on /-s/ (which can attach to almost any preceding sequence) and partly because final /θ/ and /s/ are themselves morphological.
How do loanwords reveal syllable templates?
Borrowed words must be re-fitted to the borrower's syllable structure. Japanese has strict CV syllables (with moraic /n/ as the only coda), so baseball becomes beesubooru /be:subo:ɾu/ — every consonant cluster broken up, every coda either /n/ or vowel-padded. Spanish requires a vowel before initial /s/+consonant clusters, hence esquí, estrés, estudiante. Mandarin renders strawberry as cǎoméi (substitution) or shī-tè-lè-bèi-lì (syllabified). Loanword phonology is always a window into native phonotactics.
Are CV syllables the unmarked default?
Yes. CV is the simplest syllable that is also fully formed (it has both onset and nucleus, no coda). Every language has CV syllables; many languages (Hawaiian, Mandarin, Japanese) allow only CV or close to it. Children acquire CV first (mama, papa, baba). Sound changes often simplify toward CV by deleting codas (English walk lost final /k/ in some dialects) or splitting clusters. Optimality Theory codifies CV as the universal preferred output.