Phonology
Phoneme vs Allophone
Same sound, different status — depending on the language
A phoneme is the smallest sound unit that can distinguish one word from another in a language. An allophone is a predictable, automatic variant of a phoneme that never changes a word's meaning. The very same articulations — aspirated [pʰ] versus unaspirated [p] — are allophones of a single phoneme /p/ in English, but two separate phonemes /pʰ/ and /p/ in Hindi. Whether a sound difference is phonemic or allophonic is a property of the language, not of the acoustics.
- Phoneme notationSlashes /p/
- Allophone notationSquare brackets [pʰ]
- Diagnostic for phonemesMinimal-pair test
- Diagnostic for allophonesComplementary distribution
- FoundersBaudouin de Courtenay, Sapir, Trubetzkoy
- StatusLanguage-specific, not universal
Interactive visualization
Press play, or step through manually. The visualization is yours to drive — try it before reading on.
Watch the 60-second explainer
A condensed visual walkthrough — narrated, captioned, under a minute.
The core idea
Take any consonant — say, the /p/ in English. Pronounce pin aloud while holding a sheet of paper an inch from your lips. The paper jumps: there is a strong puff of air. Now pronounce spin. The paper barely moves. Same letter, same phoneme, but two physically different sounds: aspirated [pʰ] and plain [p]. English speakers almost never notice the difference, because in English the difference is automatic — predictable from position — and never used to distinguish words.
Now switch languages. In Hindi, the words pal /pəl/ "moment" and phal /pʰəl/ "fruit" differ only in aspiration. Hindi speakers hear them as obviously different words, the way English speakers hear "pin" and "bin." The same articulation that English ignores, Hindi treats as a meaning-bearing contrast.
This is the foundational insight of twentieth-century phonology. Sound differences come in two kinds, and the kinds are language-specific:
- Phonemic (contrastive) differences distinguish words. Replacing one with the other yields a different word, or a non-word. Marked with slashes: /p/ vs /b/.
- Allophonic (non-contrastive) differences are predictable variants of a single phoneme. Replacing one with the other leaves the word identifiable, even if it sounds accented. Marked with square brackets: [pʰ] vs [p].
The phoneme is an abstract category in the speaker's mental lexicon; allophones are the concrete realizations the mouth produces in context.
How to tell them apart
Phonologists use three diagnostics, applied in sequence.
- Minimal-pair test. Look for two words that differ in exactly one segment, in the same position, and that mean different things. Pin/bin proves /p/ and /b/ contrast in English. If a minimal pair exists, the sounds are different phonemes. Done.
- Complementary distribution. If you cannot find a minimal pair, list every environment in which each sound appears. If the environments never overlap — sound A only here, sound B only there — they are probably allophones of one phoneme. English [pʰ] only appears at the start of a stressed syllable; English [p] only appears elsewhere. The environments are mutually exclusive.
- Phonetic similarity. Complementary distribution alone is not sufficient. English [h] only appears syllable-initially and [ŋ] only appears syllable-finally — so technically complementary — but the two sounds are too phonetically dissimilar (a glottal fricative vs a velar nasal) to be plausibly grouped. They remain separate phonemes by convention.
A fourth case, free variation, complicates the picture: two sounds appear in the same environment but neither contrast nor distribute predictably. English word-final /t/ can be released [t] or unreleased [t̚] in the same word ("cat") with no meaning change and no rule predicting which. They are allophones in free variation.
Cross-linguistic data: aspiration
The cleanest cross-linguistic illustration is the status of voiceless aspirated and unaspirated stops. The same articulations are organized into different phoneme inventories.
| Language | Sounds present | Status | Evidence |
|---|---|---|---|
| English | [pʰ], [p], [tʰ], [t], [kʰ], [k] | Allophones of /p/, /t/, /k/ | No minimal pairs; aspiration predictable from stress and position |
| Hindi | [pʰ], [p], [tʰ], [t], [kʰ], [k] | Distinct phonemes | pal /pəl/ "moment" vs phal /pʰəl/ "fruit" |
| Mandarin | [pʰ], [p], [tʰ], [t], [kʰ], [k] | Distinct phonemes | 八 bā /pa/ "eight" vs 趴 pā /pʰa/ "lie prone" |
| Korean | [pʰ], [p], [p͈] (and tense) | Three-way phoneme contrast | pal /pal/ "foot" vs phal /pʰal/ "arm" vs ppal /p͈al/ "suck" |
| French | [p] only (no [pʰ]) | Single phoneme /p/ | French speakers don't aspirate; English /p/ sounds "puffy" to them |
| Spanish | [p] only (no [pʰ]) | Single phoneme /p/ | Same as French; English-sounding aspiration is L1 transfer |
| Thai | [pʰ], [p], [b] | Three phonemes | /paː/ "forest" vs /pʰaː/ "to split" vs /baː/ "shoulder" |
Acoustically, the [pʰ] in an English speaker's pin and the [pʰ] in a Hindi speaker's phal are essentially identical. Phonologically, one is a context-conditioned variant of /p/ and the other is a phoneme on its own. Phonemic status is a fact about the language's lexicon, not the speaker's vocal tract.
English /t/: an allophone factory
Each English phoneme typically has several allophones, distributed by position. The phoneme /t/ is a particularly rich example.
| Allophone | Environment | Example word | Approximate sound |
|---|---|---|---|
| [tʰ] | Onset of stressed syllable | top, attack | Aspirated alveolar stop |
| [t] | After /s/, in clusters | stop, still | Plain alveolar stop |
| [ɾ] (flap) | Between vowels, after stress | butter, city | Quick tongue-tap, like Spanish r |
| [ʔ] (glottal stop) | Before syllabic /n/ | button, kitten | Throat closure |
| [t̚] (unreleased) | Word-finally, optionally | cat, hot | Stop closure, no release burst |
| [t̪] (dental) | Before /θ/, /ð/ | eighth, hit them | Tongue contacts teeth, not alveolar ridge |
An English speaker hears all six as "the same sound." A learner tracking the spelling will produce them all as [tʰ] and sound foreign. The phoneme /t/ is a single mental category that surfaces as six (or more) physical realizations, governed by phonological rules the speaker has internalized without ever being taught.
Worked example: is [ɹ] a phoneme of English?
Suppose a field linguist arrives knowing nothing about English. They record a corpus and notice two rhotic-like sounds: a postalveolar approximant [ɹ] (as in red) and a labiodental approximant [ʋ] (as in some London speakers' red). Are these one phoneme or two?
- Minimal pairs? None. No English words distinguish [ɹ] from [ʋ].
- Complementary distribution? No. Both can appear word-initially. They overlap.
- Free variation? Yes — speaker- and dialect-dependent, with no rule predicting which.
Conclusion: [ɹ] and [ʋ] are allophones of /ɹ/ in free variation. The phoneme is /ɹ/; both surface forms realize it. Now compare to read /ɹiːd/ vs lead /liːd/: minimal pair, so /ɹ/ and /l/ are separate phonemes. Same diagnostic, different conclusion — because the data is different.
Competing theories
The phoneme/allophone distinction has been formalized in several frameworks, each with consequences for how phonologies are written.
| Theory | Key claim | Phoneme status | Strengths | Weaknesses |
|---|---|---|---|---|
| American structuralism (Bloomfield, Pike, Hockett) | Phonemes are inferred bottom-up from distribution | An emic unit defined by contrast | Empirical, replicable | Cannot represent abstract underlying forms |
| Generative phonology (Chomsky & Halle 1968) | Phonemes are underlying representations; allophones derived by rule | Underlying form in lexicon | Captures alternations and rules | Choice of underlying form can be arbitrary |
| Natural phonology (Stampe 1973) | Allophones reflect universal innate processes; phonemes are what survives suppression | Phonemic = unsuppressed natural form | Connects child language to adult phonology | "Naturalness" is hard to falsify |
| Optimality Theory (Prince & Smolensky 1993) | No derivational rules; surface forms emerge from ranked constraints | Phonemes as input; allophony from constraint conflict | Universal constraint set, language-specific ranking | "Input" can be opaque; phoneme inventory partly stipulated |
| Exemplar/usage-based models | Speakers store phonetic detail directly; categories emerge from clustering | Phoneme is a probabilistic cloud, not a discrete unit | Handles gradience, social variation, lexical effects | Risk of losing the phoneme/allophone abstraction altogether |
For practical phonological description and L2 teaching, the structuralist phoneme/allophone analysis remains the lingua franca. For theoretical work on opacity, alternations, and morphophonemics, generative or OT frameworks add expressive power — but they all retain a notion of contrastive vs predictable variation.
Variants and edge cases
- Neutralization. Phonemic contrasts can be lost in specific environments. German final-obstruent devoicing turns /d/ into [t] word-finally: Rad /raːd/ "wheel" surfaces as [raːt], homophonous with Rat "advice." The /d/–/t/ contrast is neutralized.
- Archiphoneme. When a contrast is neutralized, some traditions write the result with a capital letter (e.g., /T/) to indicate the neutralized cover phoneme.
- Phonemic split. Historical sound change can convert allophones into phonemes. Old English [f] and [v] were allophones of /f/; loanwords from French and the loss of conditioning environments turned them into separate phonemes /f/ and /v/, giving modern fan vs van.
- Phonemic merger. The reverse: two phonemes collapse into one. The Mary–marry–merry merger in much of American English collapses /eɪ/, /æ/, /ɛ/ before /ɹ/ into a single vowel.
- Marginal phonemes. Some phonemes appear only in loanwords or interjections. English /ʒ/ (in measure, azure) is restricted to French loans; some analysts treat it as a marginal phoneme.
Common pitfalls
- Treating spelling as phonology. English knight and night are written differently but pronounced identically. Phonemes are about sounds, not letters.
- Assuming allophones don't matter. They do — they reveal the phonological rules a speaker has internalized, and they are the source of foreign accents.
- Forgetting phonetic similarity. Complementary distribution alone does not establish allophony; the sounds must also be plausibly related (phonetically similar). [h] and [ŋ] fail this requirement.
- Cross-linguistic transfer. Native English speakers learning Hindi often fail to produce or perceive the aspiration contrast — because their native phonemes group the relevant sounds together.
- Overgeneralizing one-to-one mapping. Phoneme-to-allophone mapping is many-to-many. American flap [ɾ] is an allophone of both /t/ and /d/; the underlying contrast is neutralized in that environment.
- Confusing phonemes with letters of the IPA. The IPA is a phonetic alphabet; phonemic transcription uses IPA symbols but selects only the contrastive ones for the language being described.
Frequently asked questions
What is the difference between a phoneme and an allophone?
A phoneme is a contrastive sound: replacing it with another phoneme in the same word produces a different word ("pin" → "bin"). An allophone is a predictable, non-contrastive variant of a phoneme: replacing it with another allophone of the same phoneme leaves the word unchanged in meaning, even if the result sounds odd or foreign-accented. English [pʰ] in "pin" and [p] in "spin" are allophones of /p/.
How do linguists tell phonemes and allophones apart?
Three diagnostics: (1) the minimal-pair test — if swapping the two sounds yields a meaning change, they are separate phonemes; (2) complementary distribution — if the two sounds never occur in the same environment, they are likely allophones of one phoneme; (3) phonetic similarity — sounds in complementary distribution must also be phonetically similar to be analyzed as one phoneme. English [h] and [ŋ] are in complementary distribution but are not phonetically similar enough, so they remain separate phonemes.
Why is aspiration phonemic in Hindi but allophonic in English?
Hindi has minimal pairs distinguished only by aspiration: pal /pəl/ "moment" vs phal /pʰəl/ "fruit," and similar /tə-tʰə/, /kə-kʰə/ contrasts. So Hindi treats /p/ and /pʰ/ as different phonemes. English never uses aspiration to distinguish words; instead, [pʰ] appears predictably word-initially in stressed syllables, and [p] elsewhere — for example after /s/ in "spin." Same physical sounds, different phonological status.
What is complementary distribution?
Two sounds are in complementary distribution when each appears only in environments where the other never appears. English aspirated [pʰ] occurs at the start of a stressed syllable; unaspirated [p] occurs everywhere else (after /s/, in unstressed syllables, syllable-finally). Their environments do not overlap, so neither can ever form a minimal pair with the other. That is the textbook signature of allophones.
Can a single sound belong to two phonemes at once?
Yes — this is called overlapping distribution or neutralization. In American English, the flapped [ɾ] in "latter" and "ladder" is the same surface sound, but it is the allophone of /t/ in one word and /d/ in the other. The phonemic contrast /t/ vs /d/ is neutralized in this position. The mapping from phoneme to allophone is not always one-to-one.
Are phonemes psychologically real?
Most evidence says yes. Native speakers report that allophones "sound the same" but recognize phoneme swaps instantly as different words. Cross-linguistic perception experiments (Werker & Tees 1984) show infants begin life able to discriminate any phonetic contrast, then lose discrimination of contrasts not used phonemically in their input by ~10 months. Adults often cannot reliably hear non-native phoneme contrasts (Japanese /l/ vs /r/; Spanish /b/ vs /v/).