Phonology

Phoneme vs Allophone

Same sound, different status — depending on the language

A phoneme is the smallest sound unit that can distinguish one word from another in a language. An allophone is a predictable, automatic variant of a phoneme that never changes a word's meaning. The very same articulations — aspirated [pʰ] versus unaspirated [p] — are allophones of a single phoneme /p/ in English, but two separate phonemes /pʰ/ and /p/ in Hindi. Whether a sound difference is phonemic or allophonic is a property of the language, not of the acoustics.

  • Phoneme notationSlashes /p/
  • Allophone notationSquare brackets [pʰ]
  • Diagnostic for phonemesMinimal-pair test
  • Diagnostic for allophonesComplementary distribution
  • FoundersBaudouin de Courtenay, Sapir, Trubetzkoy
  • StatusLanguage-specific, not universal

Interactive visualization

Press play, or step through manually. The visualization is yours to drive — try it before reading on.

Open visualization fullscreen ↗

Watch the 60-second explainer

A condensed visual walkthrough — narrated, captioned, under a minute.

The core idea

Take any consonant — say, the /p/ in English. Pronounce pin aloud while holding a sheet of paper an inch from your lips. The paper jumps: there is a strong puff of air. Now pronounce spin. The paper barely moves. Same letter, same phoneme, but two physically different sounds: aspirated [pʰ] and plain [p]. English speakers almost never notice the difference, because in English the difference is automatic — predictable from position — and never used to distinguish words.

Now switch languages. In Hindi, the words pal /pəl/ "moment" and phal /pʰəl/ "fruit" differ only in aspiration. Hindi speakers hear them as obviously different words, the way English speakers hear "pin" and "bin." The same articulation that English ignores, Hindi treats as a meaning-bearing contrast.

This is the foundational insight of twentieth-century phonology. Sound differences come in two kinds, and the kinds are language-specific:

  • Phonemic (contrastive) differences distinguish words. Replacing one with the other yields a different word, or a non-word. Marked with slashes: /p/ vs /b/.
  • Allophonic (non-contrastive) differences are predictable variants of a single phoneme. Replacing one with the other leaves the word identifiable, even if it sounds accented. Marked with square brackets: [pʰ] vs [p].

The phoneme is an abstract category in the speaker's mental lexicon; allophones are the concrete realizations the mouth produces in context.

How to tell them apart

Phonologists use three diagnostics, applied in sequence.

  1. Minimal-pair test. Look for two words that differ in exactly one segment, in the same position, and that mean different things. Pin/bin proves /p/ and /b/ contrast in English. If a minimal pair exists, the sounds are different phonemes. Done.
  2. Complementary distribution. If you cannot find a minimal pair, list every environment in which each sound appears. If the environments never overlap — sound A only here, sound B only there — they are probably allophones of one phoneme. English [pʰ] only appears at the start of a stressed syllable; English [p] only appears elsewhere. The environments are mutually exclusive.
  3. Phonetic similarity. Complementary distribution alone is not sufficient. English [h] only appears syllable-initially and [ŋ] only appears syllable-finally — so technically complementary — but the two sounds are too phonetically dissimilar (a glottal fricative vs a velar nasal) to be plausibly grouped. They remain separate phonemes by convention.

A fourth case, free variation, complicates the picture: two sounds appear in the same environment but neither contrast nor distribute predictably. English word-final /t/ can be released [t] or unreleased [t̚] in the same word ("cat") with no meaning change and no rule predicting which. They are allophones in free variation.

Cross-linguistic data: aspiration

The cleanest cross-linguistic illustration is the status of voiceless aspirated and unaspirated stops. The same articulations are organized into different phoneme inventories.

LanguageSounds presentStatusEvidence
English[pʰ], [p], [tʰ], [t], [kʰ], [k]Allophones of /p/, /t/, /k/No minimal pairs; aspiration predictable from stress and position
Hindi[pʰ], [p], [tʰ], [t], [kʰ], [k]Distinct phonemespal /pəl/ "moment" vs phal /pʰəl/ "fruit"
Mandarin[pʰ], [p], [tʰ], [t], [kʰ], [k]Distinct phonemes /pa/ "eight" vs 趴 /pʰa/ "lie prone"
Korean[pʰ], [p], [p͈] (and tense)Three-way phoneme contrastpal /pal/ "foot" vs phal /pʰal/ "arm" vs ppal /p͈al/ "suck"
French[p] only (no [pʰ])Single phoneme /p/French speakers don't aspirate; English /p/ sounds "puffy" to them
Spanish[p] only (no [pʰ])Single phoneme /p/Same as French; English-sounding aspiration is L1 transfer
Thai[pʰ], [p], [b]Three phonemes/paː/ "forest" vs /pʰaː/ "to split" vs /baː/ "shoulder"

Acoustically, the [pʰ] in an English speaker's pin and the [pʰ] in a Hindi speaker's phal are essentially identical. Phonologically, one is a context-conditioned variant of /p/ and the other is a phoneme on its own. Phonemic status is a fact about the language's lexicon, not the speaker's vocal tract.

English /t/: an allophone factory

Each English phoneme typically has several allophones, distributed by position. The phoneme /t/ is a particularly rich example.

AllophoneEnvironmentExample wordApproximate sound
[tʰ]Onset of stressed syllabletop, attackAspirated alveolar stop
[t]After /s/, in clustersstop, stillPlain alveolar stop
[ɾ] (flap)Between vowels, after stressbutter, cityQuick tongue-tap, like Spanish r
[ʔ] (glottal stop)Before syllabic /n/button, kittenThroat closure
[t̚] (unreleased)Word-finally, optionallycat, hotStop closure, no release burst
[t̪] (dental)Before /θ/, /ð/eighth, hit themTongue contacts teeth, not alveolar ridge

An English speaker hears all six as "the same sound." A learner tracking the spelling will produce them all as [tʰ] and sound foreign. The phoneme /t/ is a single mental category that surfaces as six (or more) physical realizations, governed by phonological rules the speaker has internalized without ever being taught.

Worked example: is [ɹ] a phoneme of English?

Suppose a field linguist arrives knowing nothing about English. They record a corpus and notice two rhotic-like sounds: a postalveolar approximant [ɹ] (as in red) and a labiodental approximant [ʋ] (as in some London speakers' red). Are these one phoneme or two?

  1. Minimal pairs? None. No English words distinguish [ɹ] from [ʋ].
  2. Complementary distribution? No. Both can appear word-initially. They overlap.
  3. Free variation? Yes — speaker- and dialect-dependent, with no rule predicting which.

Conclusion: [ɹ] and [ʋ] are allophones of /ɹ/ in free variation. The phoneme is /ɹ/; both surface forms realize it. Now compare to read /ɹiːd/ vs lead /liːd/: minimal pair, so /ɹ/ and /l/ are separate phonemes. Same diagnostic, different conclusion — because the data is different.

Competing theories

The phoneme/allophone distinction has been formalized in several frameworks, each with consequences for how phonologies are written.

TheoryKey claimPhoneme statusStrengthsWeaknesses
American structuralism (Bloomfield, Pike, Hockett)Phonemes are inferred bottom-up from distributionAn emic unit defined by contrastEmpirical, replicableCannot represent abstract underlying forms
Generative phonology (Chomsky & Halle 1968)Phonemes are underlying representations; allophones derived by ruleUnderlying form in lexiconCaptures alternations and rulesChoice of underlying form can be arbitrary
Natural phonology (Stampe 1973)Allophones reflect universal innate processes; phonemes are what survives suppressionPhonemic = unsuppressed natural formConnects child language to adult phonology"Naturalness" is hard to falsify
Optimality Theory (Prince & Smolensky 1993)No derivational rules; surface forms emerge from ranked constraintsPhonemes as input; allophony from constraint conflictUniversal constraint set, language-specific ranking"Input" can be opaque; phoneme inventory partly stipulated
Exemplar/usage-based modelsSpeakers store phonetic detail directly; categories emerge from clusteringPhoneme is a probabilistic cloud, not a discrete unitHandles gradience, social variation, lexical effectsRisk of losing the phoneme/allophone abstraction altogether

For practical phonological description and L2 teaching, the structuralist phoneme/allophone analysis remains the lingua franca. For theoretical work on opacity, alternations, and morphophonemics, generative or OT frameworks add expressive power — but they all retain a notion of contrastive vs predictable variation.

Variants and edge cases

  • Neutralization. Phonemic contrasts can be lost in specific environments. German final-obstruent devoicing turns /d/ into [t] word-finally: Rad /raːd/ "wheel" surfaces as [raːt], homophonous with Rat "advice." The /d/–/t/ contrast is neutralized.
  • Archiphoneme. When a contrast is neutralized, some traditions write the result with a capital letter (e.g., /T/) to indicate the neutralized cover phoneme.
  • Phonemic split. Historical sound change can convert allophones into phonemes. Old English [f] and [v] were allophones of /f/; loanwords from French and the loss of conditioning environments turned them into separate phonemes /f/ and /v/, giving modern fan vs van.
  • Phonemic merger. The reverse: two phonemes collapse into one. The Mary–marry–merry merger in much of American English collapses /eɪ/, /æ/, /ɛ/ before /ɹ/ into a single vowel.
  • Marginal phonemes. Some phonemes appear only in loanwords or interjections. English /ʒ/ (in measure, azure) is restricted to French loans; some analysts treat it as a marginal phoneme.

Common pitfalls

  • Treating spelling as phonology. English knight and night are written differently but pronounced identically. Phonemes are about sounds, not letters.
  • Assuming allophones don't matter. They do — they reveal the phonological rules a speaker has internalized, and they are the source of foreign accents.
  • Forgetting phonetic similarity. Complementary distribution alone does not establish allophony; the sounds must also be plausibly related (phonetically similar). [h] and [ŋ] fail this requirement.
  • Cross-linguistic transfer. Native English speakers learning Hindi often fail to produce or perceive the aspiration contrast — because their native phonemes group the relevant sounds together.
  • Overgeneralizing one-to-one mapping. Phoneme-to-allophone mapping is many-to-many. American flap [ɾ] is an allophone of both /t/ and /d/; the underlying contrast is neutralized in that environment.
  • Confusing phonemes with letters of the IPA. The IPA is a phonetic alphabet; phonemic transcription uses IPA symbols but selects only the contrastive ones for the language being described.

Frequently asked questions

What is the difference between a phoneme and an allophone?

A phoneme is a contrastive sound: replacing it with another phoneme in the same word produces a different word ("pin" → "bin"). An allophone is a predictable, non-contrastive variant of a phoneme: replacing it with another allophone of the same phoneme leaves the word unchanged in meaning, even if the result sounds odd or foreign-accented. English [pʰ] in "pin" and [p] in "spin" are allophones of /p/.

How do linguists tell phonemes and allophones apart?

Three diagnostics: (1) the minimal-pair test — if swapping the two sounds yields a meaning change, they are separate phonemes; (2) complementary distribution — if the two sounds never occur in the same environment, they are likely allophones of one phoneme; (3) phonetic similarity — sounds in complementary distribution must also be phonetically similar to be analyzed as one phoneme. English [h] and [ŋ] are in complementary distribution but are not phonetically similar enough, so they remain separate phonemes.

Why is aspiration phonemic in Hindi but allophonic in English?

Hindi has minimal pairs distinguished only by aspiration: pal /pəl/ "moment" vs phal /pʰəl/ "fruit," and similar /tə-tʰə/, /kə-kʰə/ contrasts. So Hindi treats /p/ and /pʰ/ as different phonemes. English never uses aspiration to distinguish words; instead, [pʰ] appears predictably word-initially in stressed syllables, and [p] elsewhere — for example after /s/ in "spin." Same physical sounds, different phonological status.

What is complementary distribution?

Two sounds are in complementary distribution when each appears only in environments where the other never appears. English aspirated [pʰ] occurs at the start of a stressed syllable; unaspirated [p] occurs everywhere else (after /s/, in unstressed syllables, syllable-finally). Their environments do not overlap, so neither can ever form a minimal pair with the other. That is the textbook signature of allophones.

Can a single sound belong to two phonemes at once?

Yes — this is called overlapping distribution or neutralization. In American English, the flapped [ɾ] in "latter" and "ladder" is the same surface sound, but it is the allophone of /t/ in one word and /d/ in the other. The phonemic contrast /t/ vs /d/ is neutralized in this position. The mapping from phoneme to allophone is not always one-to-one.

Are phonemes psychologically real?

Most evidence says yes. Native speakers report that allophones "sound the same" but recognize phoneme swaps instantly as different words. Cross-linguistic perception experiments (Werker & Tees 1984) show infants begin life able to discriminate any phonetic contrast, then lose discrimination of contrasts not used phonemically in their input by ~10 months. Adults often cannot reliably hear non-native phoneme contrasts (Japanese /l/ vs /r/; Spanish /b/ vs /v/).