Syntax
Island Constraints
Why some sentences can't be questioned, no matter how you try
Island constraints are syntactic configurations from which extraction — wh-movement, topicalisation, relativisation — is blocked. The term comes from John Robert Ross's 1967 MIT dissertation Constraints on Variables in Syntax, which catalogued the configurations as "islands" that wh-elements cannot escape: the Complex NP Constraint, the Wh-Island Constraint, the Sentential Subject Constraint, the Coordinate Structure Constraint, and the Left Branch Condition. Later work — Chomsky's Subjacency (1973) and Phase Impenetrability (2000), Cinque's weak/strong island distinction (1990), and Huang's 1982 dissertation on Mandarin — reduced and unified the inventory.
- Foundational referenceRoss, Constraints on Variables in Syntax (MIT, 1967)
- Original five islandsComplex NP, Wh, Sentential Subject, Coordinate Structure, Left Branch
- Unifying principleSubjacency (Chomsky 1973); Phase Impenetrability (2000)
- Strong vs weakCinque, Types of A-bar Dependencies (1990)
- Cross-linguistic robustnessStrong islands universal; weak islands parameterised
- Resumptive escapeHebrew, Irish, Welsh use a pronoun instead of a gap
Interactive visualization
Press play, or step through manually. The visualization is yours to drive — try it before reading on.
Watch the 60-second explainer
A condensed visual walkthrough — narrated, captioned, under a minute.
How an island blocks extraction
Wh-movement is normally permissive. You can ask "What did Mary buy?" from a one-clause sentence; you can ask "What does John think Mary bought?" from a two-clause one; you can ask "What does John think Bill said Mary bought?" from a three-clause one. Long-distance extraction is fine, even across many clause boundaries, as long as each step is local enough.
But certain configurations refuse to let the wh-element escape. Compare the grammatical (1) with the ungrammatical (2):
(1) What does John know that Mary bought ___? OK
(2) *What does John know the claim that Mary bought ___? *
In (1), "that Mary bought" is a finite clause complement of "know". Wh-movement can target the embedded SpecCP as an intermediate stop and then continue up. In (2), the same string is wrapped in the noun "the claim", making the whole thing a complex NP. Now the wh-word is trapped: the noun phrase is an island. No matter how the listener tries to interpret the question, the sentence falls apart.
This is the core puzzle Ross identified. The intuition that "long extraction is hard" doesn't explain it — the ungrammatical (2) is no longer than the grammatical (1) — and a purely semantic explanation fails because the meaning is recoverable. The block must be syntactic: certain phrase-structure configurations seal their interior against movement.
Ross's original islands (1967)
Ross's dissertation is a 600-page catalogue of the configurations. The four most cited:
| Island | Configuration | Grammatical baseline | Island violation |
|---|---|---|---|
| Complex NP (relative clause) | NP containing a relative clause | What did John meet the man who bought? | *What did John meet [the man who bought ___]? |
| Complex NP (noun complement) | NP containing a finite clause complement | What did John believe that Mary bought? | *What did John believe [the claim that Mary bought ___]? |
| Wh-island | Embedded clause with filled SpecCP | What did Mary say John bought? | ?What did Mary wonder [whether John bought ___]? |
| Sentential subject | Clause functioning as a subject | — | *What is [that John bought ___] surprising? |
| Coordinate structure | One conjunct of a coordination | — | *What did John buy [a hat and ___]? |
| Adjunct (Huang 1982) | Adjunct clause | — | *What did John leave [before buying ___]? |
| Subject (Huang 1982) | Subject phrase, including derived subjects | — | *Who did [a picture of ___] surprise Mary? |
| Left Branch (Ross 1967) | Pre-nominal modifier | — | *Whose did you read ___ book? |
The Coordinate Structure Constraint has a famous exception, the "across-the-board" movement: What did John buy and Mary sell? is grammatical because the gap appears in both conjuncts. Williams (1978) gave the canonical analysis.
Worked example: "What does John know that Mary bought?" vs the violation
Take the legal example first:
What does John know that Mary bought ___?
The derivation:
- Base: John knows that Mary bought what.
- Move "what" to embedded SpecCP: John knows [CP what (that) Mary bought ___].
- Move "what" from embedded SpecCP to matrix SpecCP: [CP What does John know [CP ___ that Mary bought ___]].
Each step is local — between adjacent CP edges, no more than one bounding node crossed. Subjacency is satisfied, and the sentence is fine.
Now the violation:
*What does John know the claim that Mary bought ___?
The same wh-element starts in the same place, but its containing structure is now the claim that Mary bought what — a noun phrase with a clause-complement modifier. To leave the embedded clause, the wh-word must escape both the inner CP and the dominating NP simultaneously. There is no intermediate landing site within the NP for it to stop at. Subjacency (Chomsky 1973) is violated: the movement crosses both an S and an NP node in a single step. The result is sharply ungrammatical for English speakers — the canonical complex NP island violation.
Strong vs weak islands
Cinque (1990, Types of A-bar Dependencies) refined the picture. Some islands block all extraction; others only block certain kinds:
| Island type | Argument extraction (who/what) | Adjunct extraction (how/why/how-many) | Class |
|---|---|---|---|
| Complex NP | Blocked | Blocked | Strong |
| Adjunct | Blocked | Blocked | Strong |
| Subject | Blocked | Blocked | Strong |
| Wh-island | Marginal: ?Who do you wonder whether Mary saw? | Blocked: *Why do you wonder whether Mary left? | Weak |
| Factive island | Marginal | Blocked | Weak |
| Negative island | Marginal | Blocked | Weak |
The argument/adjunct asymmetry is the diagnostic for weakness. Adjuncts (how, why, how-much) carry less referential content and are more easily blocked by intervening operators; arguments (who, what) carry more, and can sometimes squeeze past.
Parametric variation
Most strong islands hold across languages, but the details vary:
- Italian (Rizzi 1982). Allows extraction from some wh-islands that English blocks: Tuo fratello, a cui mi domando che storie abbiano raccontato — "Your brother, to whom I wonder what stories they told" is fine. Rizzi attributed this to Italian taking IP rather than CP as a bounding node.
- Swedish (Engdahl 1985). Allows extraction from some relative clauses: Det huset känner jag en man som har målat — "That house I know a man who has painted". The acceptable cases involve definite, presuppositional structures.
- Hebrew, Irish, Welsh. Use resumptive pronouns to bypass islands: The man that Mary saw him — the pronoun fills the gap and avoids the violation. Resumption is the most common cross-linguistic island-rescue strategy.
- Mandarin and Japanese. Show island effects on in-situ wh-phrases too, evidence for covert wh-movement at LF (Huang 1982, Watanabe 1992).
Theoretical accounts
| Account | Author / year | Key idea |
|---|---|---|
| Subjacency | Chomsky 1973 | Movement cannot cross two bounding nodes (NP, S) in one step |
| ECP | Chomsky 1981 | Empty Category Principle: traces must be properly governed |
| Barriers | Chomsky 1986 | Movement-blocking phrases (barriers) defined relative to L-marking |
| Relativised Minimality | Rizzi 1990 | Movement of α blocked by intervening β of the same feature class |
| Phases | Chomsky 2000, 2001 | vP and CP are phases; their interiors become opaque after Spell-Out |
| Processing | Hofmeister & Sag 2010 | Some islands reduce to memory load and integration cost, not grammar |
The grammar-vs-processing debate has been particularly active since the 2000s. Sprouse, Wagers, and Phillips (2012) used factorial designs to argue that at least some islands (whether-islands, complex-NP) cannot be explained by working-memory load alone — extraction is harder than predicted by the additive cost of length plus complexity.
Variants and edge cases
- Parasitic gaps. A second gap inside an adjunct island becomes acceptable when there is a primary gap outside it: Which paper did you file ___ without reading ___? The licit gap "parasitises" on the legitimate one (Engdahl 1983).
- Across-the-board movement. Coordinate structures permit extraction if all conjuncts share the gap: What did John buy ___ and Mary sell ___?
- Sluicing as island repair. Ross (1969) noted that sluicing — John bought something but I don't know what — can repair some island violations, suggesting that the violation is a PF rather than LF effect.
- Relativisation tracks wh-movement. The same islands constrain relative-clause formation; resumptive pronouns are the main escape hatch in languages that allow them.
- Negative islands. "How didn't John fix it?" is degraded compared with the positive counterpart; negation creates a weak island for adjuncts.
Common pitfalls
- Treating every long extraction as island-violating. Long-distance wh-movement across multiple complement clauses is fine; islands are specific configurations, not a function of length.
- Confusing complex NP with relative clause. The Complex NP Constraint covers two distinct cases: noun-complement clauses ("the claim that ...") and relative clauses ("the man who ..."). Both are islands, but the explanations sometimes diverge.
- Assuming Italian-style wh-island violations exist in English. Italian and other Romance languages allow some wh-island extraction; English does not. Translation problems arise from the difference.
- Ignoring resumption. Hebrew and Irish embed pronouns where English would have a gap, and the result is grammatical. This is not an English-style violation, but a different repair strategy.
- Reducing all island effects to processing. Hofmeister and Sag (2010) make a real point, but Sprouse and colleagues have shown that working memory alone undergenerates the data. The grammar-only and processing-only positions are both extreme.
Frequently asked questions
Why are they called islands?
John Robert Ross's metaphor: a constituent that blocks extraction is like an island — material inside cannot escape to the mainland. The metaphor stuck because it captures the asymmetry. The wh-element can move freely through ordinary clauses, but once it tries to leave a relative clause, an adjunct, or a complex NP, the bridge collapses. Ross's 1967 MIT dissertation Constraints on Variables in Syntax, published as Infinite Syntax! in 1986, cataloged the configurations and gave each one its name: complex NP island, wh-island, sentential subject island, coordinate structure island.
What is the Complex NP Constraint?
Ross's first island: no element may be moved out of an NP that contains an S (clause) modifier. "What does John know that Mary bought?" is fine — the embedded clause is a verb's complement. But "*What did John believe the claim that Mary bought?" is ungrammatical — "the claim that Mary bought what" is a complex NP, and you cannot extract "what" out of it. The same applies to relative clauses: "*What did John meet the man who bought?" is bad because "who bought what" is a relative-clause modifier inside the NP "the man".
What is a wh-island?
A wh-island is an embedded clause already introduced by a wh-element. "*What do you wonder whether Mary bought?" is degraded because "whether" occupies the embedded SpecCP, leaving no room for "what" to stop there on its way up. Subjacency (Chomsky 1973) explained the violation: movement cannot cross two bounding nodes in one step, and a filled SpecCP forces the second step to traverse two clauses at once. Wh-islands are weaker than complex NP islands — many speakers find them mildly degraded rather than fully ungrammatical.
What are weak vs strong islands?
Cinque (1990, Types of A-bar Dependencies) divided islands into strong (complete blocks: complex NP, adjunct, subject) and weak (block argument extraction less than adjunct extraction: wh-islands, factive islands, negative islands). Test: strong islands forbid "who" and "what" alike; weak islands let "who/what" out but block "how", "why", "how many". "Who do you wonder whether Mary saw?" is marginal but possible; "*Why do you wonder whether Mary left?" is sharply worse. The argument/adjunct asymmetry is the core diagnostic for weak-island status.
Do all languages have the same islands?
Most island effects are robust cross-linguistically — every well-studied language shows complex-NP and adjunct effects in some form. But the boundaries and severities vary. Italian (Rizzi 1982) allows extraction from some wh-islands that English blocks, suggesting a parametrised choice of bounding node. Japanese in-situ wh-phrases obey island constraints (Huang 1982 on Mandarin, Watanabe 1992 on Japanese). Resumptive-pronoun languages (Hebrew, Irish, Welsh) avoid islands by leaving an overt pronoun in the gap site. Swedish allows extraction from some relative clauses (Engdahl 1985) that English does not.
Are islands purely syntactic or partly processing?
Hotly debated. The classical view (Chomsky 1973, Cinque 1990) treats islands as syntactic constraints encoded in the grammar. Hofmeister and Sag (2010), drawing on experimental work, argue that many island effects reduce to processing difficulty — long, complex extractions are simply hard to compute, and the perceived ungrammaticality is a memory effect rather than a grammatical violation. Sprouse, Wagers, and Phillips (2012) used factorial designs to argue that at least some islands cannot be reduced to processing alone. The grammar-vs-processing debate is live.