Syntax

Ellipsis (Linguistic)

The grammar of what isn't said — silent structure that listeners reconstruct effortlessly

Linguistic ellipsis is the omission of words whose meaning is recoverable from context. "John can swim and Mary can too" deletes the second VP; "John ate something but I don't know what" deletes everything except the wh-remnant. The phenomenon was systematized by John Robert Ross's 1969 dissertation work, which named sluicing, and elaborated by Ivan Sag's 1976 MIT thesis "Deletion and Logical Form". Ellipsis violates the usual one-to-one mapping between sound and meaning — speakers retrieve unspoken material from a salient antecedent. Standard subtypes include VP-ellipsis, gapping, sluicing, pseudogapping, NP-ellipsis, stripping, and answer fragments. The central debate is whether the unpronounced material is syntactically present (PF-deletion, Merchant 2001) or semantically reconstructed from a discourse antecedent (LF-copying or pro-form).

  • First systematic studyRoss (1969); Sag (1976)
  • Modern referenceMerchant, The Syntax of Silence (2001)
  • SubtypesVPE, gapping, sluicing, NP-ellipsis, stripping, pseudogapping, fragment answers
  • Identity conditione-GIVENness (Merchant); focus identity (Rooth)
  • Key ambiguitySloppy vs. strict pronoun readings
  • Cross-linguisticArgument ellipsis (Japanese, Korean); null subjects (Spanish, Italian)

Interactive visualization

Press play, or step through manually. The visualization is yours to drive — try it before reading on.

Open visualization fullscreen ↗

Watch the 60-second explainer

A condensed visual walkthrough — narrated, captioned, under a minute.

How ellipsis works

Open any ordinary conversation transcript and you will find sentences that are, on the surface, ungrammatical. "Yes, I can." "No, she didn't." "A coffee, please." "The big one." Each is a fragment, but each communicates a full proposition because the listener fills in unpronounced material from the surrounding discourse. Linguistic ellipsis is the formal study of when this filling-in is licensed and how the missing structure relates to the visible structure.

The central question is what occupies the silent slot. Three positions dominate the literature:

  1. PF-deletion. Full syntactic structure is built and interpreted; phonology then fails to pronounce a designated subtree. Sag (1976) introduced the position; Merchant (2001) gave it the modern formulation.
  2. LF-copying. The ellipsis site contains an empty pro-form that is filled at logical form by copying semantic structure from the antecedent. Williams (1977), Hardt (1993).
  3. Pro-form / null anaphor. The ellipsis site is a designated empty category — a null counterpart to do so or it — interpreted directly without internal syntax. Lobeck (1995).

Each view captures different data. Movement out of an elided VP ("I know which book John read but not which one Mary did") argues for PF-deletion: the wh-phrase escapes from internal structure that must be syntactically real. Discourse-only antecedents (a doorbell rings; you say "I'll get it" — no linguistic antecedent for "it" exists) argue for semantic recovery. Modern hybrid theories (Elbourne 2008) accept that different elliptical constructions sit at different points on the syntax-semantics gradient.

The major types of ellipsis

TypeExampleWhat is elidedLicensor / remnant
VP-ellipsis (VPE)John can swim, and Mary can too.Verb phrase after auxiliaryAuxiliary, modal, infinitival to, do-support
SluicingJohn ate something, but I don't know what.Embedded IP/CP under a wh-remnantWh-phrase in Spec,CP
GappingJohn ate fish, Mary chicken.Finite verb (and sometimes more) in a coordinate clauseCoordination; two contrastive remnants
PseudogappingJohn gave Bill a book, and Mary will Sue.Verb only; auxiliary and object remainAuxiliary plus contrastive XP remnant
NP-ellipsisJohn's car is faster than Mary's.Head noun and modifiers after a possessor or determinerPossessive marker or strong determiner
Stripping (bare argument ellipsis)John left, and Mary too.Everything but a single XP and a polarity elementCoordinator plus too, also, not
Fragment answerQ: What did Mary eat? — A: Fish.Everything but the focused constituentQuestion-answer pair; focus on remnant
Antecedent-contained deletionJohn read every book that Mary did.VP whose antecedent properly contains the ellipsis siteQuantifier raising of object out of VP

The diagnostics distinguishing these are subtle. Pseudogapping looks like VPE but leaves an object remnant; gapping refuses to nest under embedding ("*John said that he ate fish, and Bill that chicken" is bad). Sluicing tolerates island violations that overt wh-movement would not — the "island repair" effect (Ross 1969; Merchant 2008). NP-ellipsis is licensed in English only by certain determiners; possessor 's licenses it ("Mary's"), demonstrative "this" does not ("*this" alone). These language-specific licensing facts are part of why the typology runs deep.

Worked example: sluicing in detail

Take the sentence John ate something, but I don't know what. The second clause looks like an isolated wh-word, but it is interpreted as "I don't know what John ate." Merchant's (2001) PF-deletion analysis works as follows:

  1. The embedded clause is built fully: [CP what [C' [IP John ate t]]].
  2. The wh-phrase "what" moves to Spec,CP as in any embedded question.
  3. The IP "John ate t" satisfies the e-GIVENness identity condition relative to the antecedent IP "John ate something" (modulo focus on the wh-existential pair).
  4. An [E] feature on C licenses non-pronunciation of its IP complement at PF.
  5. The result: a single remnant "what" is heard, but the meaning is the full embedded question.

This account predicts that sluicing should support overt connectivity effects — case marking, preposition stranding, complementizer choice — that match the elided structure rather than the surface form. Russian sluiced wh-phrases bear the case the elided verb assigns; German sluicing reflects whether the antecedent verb takes accusative or dative. These "form-identity" effects (Merchant 2008) are central evidence for syntactic structure inside the silent IP.

Counterarguments and alternative views

Not all syntacticians accept that ellipsis sites contain full hidden syntax. Three classes of counterargument shape the modern debate:

  • Sag-style mismatches. Voice mismatches in pseudogapping ("The information was leaked by Mary, just as John had hoped to") and binding-condition mismatches show that some elliptical relations are looser than full syntactic identity would predict. Critics argue this favors a semantic identity condition.
  • Construction grammar. Goldberg and others treat fragments and elliptical replies as constructions in their own right — pairings of form and meaning learned independently — rather than derivatives of full clauses. This sidesteps the problem of recovering hidden structure by denying it exists.
  • Discourse-driven views. Hardt (1993, 1999) and Hankamer-Sag (1976) distinguish "surface anaphora" (requiring syntactic antecedent, e.g. VPE) from "deep anaphora" (allowing discourse antecedents, e.g. do it). They argue ellipsis straddles both — discourse alone licenses some null sites, undermining a uniformly syntactic theory.

The debate is unresolved. Most working syntacticians treat PF-deletion as the default for VPE, sluicing, gapping, and fragments, while reserving semantic or pro-form analyses for cases that resist syntactic identity.

Variants and edge cases

  • Antecedent-contained deletion. "John read every book that Mary did." Solved by quantifier raising the object at LF (May 1985; Sag 1976) so the antecedent VP shrinks below the ellipsis site.
  • Comparative ellipsis. "Mary read more books than John did." A close cousin of ACD; Kennedy (1997) extends the QR analysis.
  • Backward ellipsis. "Whoever can ___ should bring the dessert" in cleft-like structures with cataphoric resolution. Rare in English, more common in head-final languages.
  • Right-node raising. "John likes, and Mary hates, garlic." Often grouped with ellipsis but typically analyzed as across-the-board movement (Postal 1974).
  • Argument ellipsis (Japanese). A bare null object that licenses sloppy readings. Otani and Whitman (1991) argued this is a covert VPE; Oku (1998) and Saito (2007) argued for a separate argument-ellipsis mechanism.
  • Multiple sluicing. "John gave something to someone, but I don't know what to whom" — multiple wh-remnants, attested in Bulgarian, Romanian, and Japanese.

Common pitfalls in analysis

  • Confusing ellipsis with pro-forms. "John left, and Bill did too" (VPE) versus "John left, and Bill did so too" (do-so anaphora). The latter has internal pro-form structure; the former has hidden full structure. Diagnostics differ — extraction is allowed under VPE but not under do so.
  • Treating gapping as VPE. Gapping requires coordination and contrastive remnants; VPE does not. Gapping cannot embed; VPE can.
  • Ignoring focus structure. Sloppy readings, fragment answers, and contrastive remnants are governed by focus, not bare syntax. Rooth's (1992) focus identity and Merchant's e-GIVENness both encode this.
  • Forgetting recoverability. A deletion that is not recoverable from context is not ellipsis — it's ungrammatical or a fragment that fails to communicate. The Aspects (1965) recoverability principle remains baseline.
  • Overgeneralizing from English. English VPE is unusually well-licensed because of its rich auxiliary system. Many languages license different cuts of the structure; do not assume English is universal.

Frequently asked questions

What is sluicing?

Sluicing is ellipsis of a clausal complement under a wh-remnant. "John ate something, but I don't know what" leaves only "what"; the elided material is interpreted as "John ate." The construction was named in John Robert Ross's 1969 paper "Guess Who?" — the title is itself a sluice. Jason Merchant's 2001 book The Syntax of Silence systematized the modern theory: sluicing requires syntactic identity between the ellipsis site and its antecedent (his e-GIVENness condition), the wh-remnant moves out of the elided IP, and ellipsis at PF deletes the remainder. Sluicing licenses long-distance dependencies that would otherwise be ungrammatical — so-called "sluicing repair" of island violations, e.g. "They want to hire someone who speaks a Balkan language, but I don't know which."

What is the difference between VP-ellipsis and gapping?

VP-ellipsis (VPE) deletes a verb phrase but leaves auxiliaries: "John can swim and Mary can too." Gapping deletes a finite verb (and sometimes more) in a coordinated clause: "John ate fish, Mary chicken." The diagnostics differ. VPE is licensed by an auxiliary or modal in English ("can," "will," "to," do-support); gapping requires coordination, no auxiliary remains. VPE permits sloppy and strict readings of pronouns; gapping is restricted to a flatter parallelism. VPE is found across English and many Germanic languages; gapping is widely available cross-linguistically including in languages with no equivalent of VPE. Theorists treat VPE as PF-deletion of a fully syntactic VP (Merchant 2001), and gapping variously as across-the-board movement (Johnson 2009) or as ellipsis of a higher constituent.

What is antecedent-contained deletion?

Antecedent-contained deletion (ACD) is VP-ellipsis whose ellipsis site is contained inside its own antecedent: "John read every book that Mary did." The antecedent VP "read every book that Mary did" contains the ellipsis site "did". Naïvely substituting the antecedent into the gap creates infinite regress. May (1985) and Sag (1976) solved this with quantifier raising: the object "every book that Mary did" raises out of VP at LF, leaving a smaller antecedent VP whose copy fits the ellipsis site. ACD is a major argument for covert movement at the syntax-semantics interface. Kennedy (1997) extended the analysis to comparatives: "more books than Mary did."

Is ellipsis a syntactic or semantic phenomenon?

Both camps exist. The PF-deletion view (Sag 1976; Merchant 2001) holds that ellipsis sites contain full syntactic structure that is unpronounced at phonological form. Evidence: extraction out of ellipsis ("I know which book John read but not which Mary did") suggests internal syntax; voice and binding mismatches sometimes block ellipsis even when paraphrasable. The LF-copying or pro-form view (Hardt 1993; Lobeck 1995) holds that ellipsis sites are pro-forms reconstructed semantically. Evidence: discourse-only antecedents ("A: Are you bringing wine? B: I will"), free relatives, and split antecedents support a discourse-driven recovery. Modern hybrid accounts (Elbourne 2008) treat some ellipsis as syntactic and some as semantic.

What is sloppy versus strict identity?

Pronouns inside an elided VP can be interpreted strictly (referring to the same entity as in the antecedent) or sloppily (rebound to the new subject). "John loves his mother and Bill does too" is ambiguous: strict — Bill loves John's mother; sloppy — Bill loves Bill's mother. The ambiguity was first noted by Ross (1967) and is a flagship test for ellipsis theories. The PF-deletion view derives both readings from one structure with two binding options on "his". The semantic-copy view treats sloppy as a bound-variable reading and strict as a coreference reading. Sloppy readings are sometimes blocked by structural mismatches, motivating finer parallelism conditions like Rooth's (1992) focus condition or Merchant's e-GIVENness.

Do all languages have ellipsis?

All languages permit some recoverable omission, but the inventory varies. Japanese, Korean, and Chinese allow "argument ellipsis" — bare null subjects and objects with a discourse antecedent. Japanese null objects (Otani and Whitman 1991) license sloppy readings, similar to English VPE. Spanish and Italian have rich agreement that licenses null subjects without ellipsis proper. Hebrew has a productive VP-ellipsis-like construction (Doron 1990). Languages with strict isolating morphology like Mandarin tend to favor pro-forms over deletion. Cross-linguistic surveys (Aelbrecht, Haegeman, Nye 2012) document a typology of licensing conditions, identity requirements, and remnant possibilities.

What is the recoverability condition?

Chomsky's Aspects of the Theory of Syntax (1965) introduced the principle that deletion must be "recoverable" — the missing material must be reconstructible from context. The principle has been refined as semantic identity (Sag 1976), e-GIVENness (Merchant 2001), and Rooth-style focus matching (Rooth 1992). Modern formulations require that the elided constituent be mutually entailing (or focus-equivalent) with its antecedent up to focus-marked positions. "John bought a Honda and Mary did too" — the second clause's elided VP must be semantically equivalent to "bought a Honda." Recoverability rules out random deletion; it is what makes ellipsis interpretable rather than gibberish.