Phonetics
Phonetic Vowel Space
The IPA vowel chart — front/back, high/low, rounded/unrounded
The vowel space is the two-dimensional acoustic-articulatory map on which all human vowels are plotted. Two primary axes: tongue height (close/high to open/low) and tongue backness (front to back). A third dimension is lip rounding (rounded vs unrounded). The IPA vowel chart, formalized by Daniel Jones (1917) using his "cardinal vowels" system, plots these on a trapezoidal diagram. The four corners are [i] (high front unrounded), [a] (low front), [u] (high back rounded), [ɑ] (low back). English five-vowel system in writing maps to ~14-20 vowel phonemes including diphthongs (varies by dialect). Acoustically, vowel quality correlates with the first two formants: F1 inverse to height, F2 with frontness. The vowel space is universally constrained by anatomy and used by all languages, with subsets selected per language.
- Two main dimensionsHeight (close-open), backness (front-back)
- Third dimensionRounding (rounded/unrounded)
- Cardinal vowel proposerDaniel Jones (1917)
- Acoustic correlate of heightFirst formant F1 (inverse)
- Acoustic correlate of backnessSecond formant F2
- Universal corners[i, a, u] (i.e., the most common vowel triangle)
Interactive visualization
Press play, or step through manually. The visualization is yours to drive — try it before reading on.
Watch the 60-second explainer
A condensed visual walkthrough — narrated, captioned, under a minute.
Why the vowel space matters
- Phonetic transcription. IPA depends on vowel-space coordinates.
- L2 pronunciation. Helps learners locate target vowels.
- Speech synthesis. Formant synthesizers operate in F1-F2 space.
- Dialectology. Vowel mergers and shifts plotted on the space.
- Phonological theory. Distinctive features map to space dimensions.
- Speech recognition. Acoustic features include F1-F2 trajectories.
- Speech pathology. Vowel space measurements reveal articulatory disorders.
Common misconceptions
- English has 5 vowels. Five letters; 14-20 vowel phonemes.
- The chart is articulatory. It's primarily auditory; articulatory match is approximate.
- All languages share the same vowel space. Inventory is selected from the universal space, but choice varies.
- Rounding and backness always co-occur. Front rounded vowels (French [y]) break this.
- Schwa is not a vowel. [ə] is a central mid vowel, very common in English unstressed syllables.
- Vowel space is two-dimensional only. Height, backness, rounding, plus length, nasalization, tone.
Frequently asked questions
What are the two axes?
Height: position of the highest part of the tongue. High/close ([i, u]) vs low/open ([a, ɑ]). Mid vowels include [e, ɛ, o, ɔ]. Backness: front-back position of the tongue body. Front ([i, e, æ]) vs central ([ə, ɐ]) vs back ([u, o, ɑ]). Articulatory descriptions roughly match acoustic reality but aren't identical — some "high" vowels involve tongue body fronting more than vertical raising.
What is the cardinal vowel system?
Daniel Jones's reference framework. Eight primary cardinal vowels at roughly equidistant points on the trapezoid: [i, e, ɛ, a, ɑ, ɔ, o, u]. They serve as auditory landmarks for transcribing real-language vowels. Secondary cardinals reverse the rounding: [y, ø, œ, ɶ, ɒ, ʌ, ɤ, ɯ]. Cardinal [i] is the highest, frontest, unrounded vowel possible without becoming a fricative; cardinal [a] is the lowest, frontest possible.
How do English vowels map?
General American has roughly 11-15 monophthong vowels. Front: [i] (beat), [ɪ] (bit), [ɛ] (bet), [æ] (bat). Central: [ə] (about, schwa), [ɜ] (bird in non-rhotic varieties), [ʌ] (but). Back: [u] (boot), [ʊ] (book), [o] (boat — diphthongized), [ɑ] (cot in many dialects), [ɔ] (caught). Plus diphthongs: [aɪ] (buy), [aʊ] (cow), [ɔɪ] (boy). British RP differs notably; California, Boston, Inland North differ further.
What is the role of formants?
Vowels are produced by exciting harmonics of voicing and amplifying selected frequency bands by vocal tract resonance — these are formants. F1 (first formant): inversely related to tongue height. High vowels [i, u] have low F1 (~300 Hz); low vowels [a] have high F1 (~700 Hz). F2 (second formant): correlates with tongue backness. Front vowels have high F2 (~2200 Hz for [i]); back vowels have low F2 (~800 Hz for [u]). F1-F2 plots reproduce the IPA vowel space acoustically.
What is rounding?
Lip rounding shifts formants downward. English has rounded back vowels [u, ʊ, o, ɔ] and unrounded front [i, ɪ, e, ɛ, æ]. Many languages have front rounded vowels: French [y] (tu), German [ø] (schön), Mandarin [y] (女, nǚ). Hungarian, Turkish, Finnish, Swedish all have rich front-rounded inventories. The IPA secondary cardinals systematically reverse rounding for each primary.
How many vowels do languages typically have?
A typical inventory has 5-7 vowels. The most common system is /a, e, i, o, u/ (Spanish, Japanese, Modern Greek, Hawaiian — five-vowel triangle). English is atypically rich (15+). Some Caucasian languages (Ubykh) historically had 80+ consonants and only 2-3 vowels. Tonal languages often have moderate vowel counts (Mandarin ~5-6 monophthongs but 4 tones multiplying contrasts).
What are diphthongs?
Vowels with quality change during the syllable. English [aɪ] (buy) starts low and ends high front. [aʊ] (cow) starts low and ends high back. [ɔɪ] (boy) starts mid back rounded, ends high front. The starting and ending points are vowel space targets the tongue moves between. Pure monophthongs hold quality steady; diphthongs sweep. Triphthongs ([aɪə] in "fire" for some speakers) are vanishingly rare cross-linguistically.