Syntax

Evidentiality

Grammar that forces you to say how you know

Evidentiality is the grammatical encoding of how a speaker came to know what they assert — by seeing it, hearing it, inferring it, or being told. About a quarter of the world's languages mark evidentiality obligatorily, with systems ranging from a two-way witnessed/non-witnessed split (Turkish, Bulgarian) to the five-way visual/non-visual/inferred/assumed/reported distinction of Tariana (Arawakan, Amazonia). English expresses the same notions only optionally and lexically, with adverbs like "reportedly" or hedges like "I heard".

  • Foundational referenceAikhenvald, Evidentiality (OUP, 2004)
  • Coverage~25% of world's languages mark grammatically
  • Smallest system2-way: firsthand vs non-firsthand (Turkish, Bulgarian)
  • Largest system5-way (Tariana); 6-way reported in some Tucanoan
  • English equivalentOptional adverbs: "reportedly", "apparently", "I heard"
  • Geographic hotspotsAndes, Amazonia, Caucasus, Balkans, Pacific NW

Interactive visualization

Press play, or step through manually. The visualization is yours to drive — try it before reading on.

Open visualization fullscreen ↗

Watch the 60-second explainer

A condensed visual walkthrough — narrated, captioned, under a minute.

How evidentiality works

An evidential is a grammatical morpheme — a suffix, clitic, particle, or auxiliary — that obligatorily marks the source of the speaker's information. The key word is obligatorily. Languages without grammatical evidentiality, like English, can express the same notions optionally with adverbs and parentheticals; languages with grammatical evidentiality cannot leave the source unspecified in the relevant clause types. If you tell someone in Tariana that it rained last night, the morphology of the verb forces you to commit to whether you saw the rain falling, heard it on the roof, inferred it from a wet street in the morning, assumed it because clouds had been gathering, or were told by a neighbour.

The Quechua family illustrates a typical three-way system. The suffixes -mi, -si, and -cha attach to the focused element of the clause:

  • Para-sha-n-mi — "It is raining" (direct evidence: I see the rain).
  • Para-sha-n-si — "It is raining (they say)" (reportative: someone told me).
  • Para-sha-n-cha — "It must be raining" (inferential: I see wet umbrellas).

Martina Faller's dissertation on Cuzco Quechua (2002) shows that these three suffixes are not interchangeable: a speaker who says -mi is committed to having direct evidence, and the hearer can challenge the assertion on those grounds. This is the diagnostic for genuine grammaticalisation — failing to use the right evidential is treated as a category error, not a stylistic lapse.

Tariana — the canonical five-way system

Alexandra Aikhenvald's fieldwork on Tariana, an Arawakan language with roughly 100 fluent speakers in the Vaupés region of Brazil, produced the most detailed grammar of evidentiality ever written (Aikhenvald 2003, A Grammar of Tariana from Northwest Amazonia). Tariana fuses tense and evidential into a single portmanteau suffix on the verb. For the simple past of na-keta "he found it":

SuffixTypeWhat the speaker is committing to
-kaVisualI saw him find it
-mahkaNon-visual sensoryI heard / smelled / felt the finding
-nihkaInferentialI see the result and infer the finding (he is holding the object)
-sikaAssumedI have general reason to believe it (he was searching all day)
-pidakaReportedSomeone told me he found it
(none)UngrammaticalYou cannot omit the suffix in past-tense statements

Tariana children acquire the system early. By age three they distinguish at least visual versus reported. Aikhenvald notes that mismatches — using -ka for events one did not actually witness — are treated as a moral failure on a level with overt lying.

Grammatical evidentiality vs English hearsay adverbs

Grammatical evidentiality (Tariana, Quechua)English lexical hedges
Obligatory?Yes, in the relevant tense/moodNo, always optional
FormBound morpheme on the verb or focusFree adverbs, parentheticals, "I heard" complements
Number of distinctions2 to 6 fixed grammatical categoriesOpen lexical class — "allegedly", "reportedly", "they say", "apparently"
Default if omittedSentence is ungrammaticalSentence is grammatical, source unspecified
Mismatch (wrong source)Treated as lyingTreated as imprecise wording
Acquisition orderBy age 3-4 (Papafragou et al. 2007 on Korean)Continues into adolescence
Interaction with questionMarks the source the questioner expects in the answerMostly absent; rises in indirect questions only

The functional load is different too. In English, "allegedly" cues distance and is mainly a journalistic register marker; in Tariana, the evidential is as routine as tense in English and carries no special stylistic colour.

Worked examples

Bulgarian (two-way: indicative vs renarrated). The renarrated mood (sometimes called the preizkazno naklonenie) is built from old perfect forms. Compare:

  • Ivan dojde. "Ivan came." (Indicative; I witnessed it.)
  • Ivan dosul. "Ivan came (they say)." (Renarrated; reported information.)

Newspapers and historical narrative use the renarrated form heavily; using the indicative for unwitnessed events sounds presumptuous (Friedman 1986).

Turkish (two-way: -DI direct vs -mIş indirect). The suffix -mIş covers reported, inferred, and surprise meanings:

  • Ali geldi. "Ali came." (Witnessed.)
  • Ali gelmiş. "Ali came, apparently / they say / it turns out." (Indirect — speaker did not directly observe arrival.)

Lars Johanson (2003) treats -mIş as an indirective rather than a pure evidential, because the cluster of meanings (reported, inferred, mirative surprise) tracks "the speaker is not the immediate source" rather than picking out one source type.

Eastern Pomo (Pacific Northwest, four-way). Sally McLendon (2003) describes four suffixes: -ink'e (non-visual sensory), -ine (inferential), -le (hearsay), and -ya (direct/visual). A statement about a person burning themselves picks a different suffix depending on whether the speaker saw it, smelled the burned skin, deduced it from a bandage, or was told by someone else.

Comparing evidential systems cross-linguistically

LanguageFamilyDistinctionsMarker typeObligatory?
TarianaArawakan5 (visual / non-visual / inferred / assumed / reported)Verbal suffix (fused with tense)Past + present
TucanoTucanoan5Verbal suffixPast + present
Cuzco QuechuaQuechuan3 (direct / reported / inferential)Enclitic on focused constituentMost assertions
AymaraAymaran3Suffix on verbMost assertions
TurkishTurkic2 (direct -DI / indirect -mIş)Verbal suffixPast tense
BulgarianSlavic (Balkan)2 (indicative / renarrated)Periphrastic moodOften, in narrative
KoreanKoreanic4-5 (direct, reported, inferential, retrospective)Sentence-final endingMany speech levels
Eastern PomoPomoan4 (visual / non-visual / inferred / hearsay)Verbal suffixPast, present
EnglishGermanic0 grammatical (lexical hedges only)Adverbs, "I heard", "reportedly"No

Variants and edge cases

  • Mirativity. A related category marking surprise or new information; the Tibeto-Burman mirative (DeLancey 1997) often shares morphology with non-visual evidentials. Turkish -mIş doubles as a mirative.
  • Egophoric systems. Tibetan and many other Sino-Tibetan languages mark whether the speaker is an "intentional source" of the action — a related but distinct category (Tournadre 2008).
  • Evidential strategies. Languages without dedicated evidential morphology may co-opt other categories — perfect tenses, passives, complement-taking verbs — for evidential-like work. German sollen ("is said to") and wollen ("claims to") are evidential strategies, not grammatical evidentials.
  • Reported speech as quasi-evidential. Direct quotation is universal; what matters for grammatical evidentiality is whether bare assertion can be reportative without a quotative frame.

Common pitfalls in describing evidentiality

  • Confusing evidentiality with epistemic modality. Source is not the same as confidence. A speaker can be highly confident about a reported event (the news is reliable) and uncertain about a witnessed one (it was foggy). Aikhenvald (2004) is firm that the two categories are formally distinct in languages where they are independently expressed.
  • Treating English "reportedly" as evidential. It is an evidential strategy, not a grammatical evidential — it is optional, free-standing, and lexical. Authors who call English an evidential language are using the term loosely.
  • Assuming evidentials always mark uncertainty. The Tariana visual evidential signals high confidence; non-visual and inferred can be associated with lower confidence, but reported is sometimes used precisely because the source is authoritative.
  • Glossing -mIş as "past tense". A common error in Turkish pedagogy. The -DI/-mIş contrast is not primarily temporal; both can be past, but they encode different sources.
  • Ignoring the question paradigm. In Quechua, the evidential in a question marks the source the questioner expects in the answer (conjunct/disjunct alignment). This is a major Aikhenvald-Faller diagnostic.

Frequently asked questions

Is English evidentiality grammatical?

No. English expresses information source lexically — "reportedly", "apparently", "I heard", "it seems" — and these elements are optional. Aikhenvald's typological criterion is that an evidential system is grammatical when marking the source is obligatory in some clause type: in Tariana you cannot tell someone that a man went unless you also tell them whether you saw him go, heard him go, inferred it from tracks, assumed it from habit, or were told. English speakers can simply say "a man went" and leave the source unspecified.

How does Tariana's five-way system work?

Tariana (Arawakan, north-west Amazonia, ~100 speakers) marks evidentiality with a portmanteau suffix that fuses tense and source. For the past tense of "José play football": -ka (visual: I saw him), -mahka (non-visual: I heard the kick or the shouting), -nihka (inferential: I saw the muddy boots), -sika (assumed: he plays every Sunday so probably did), -pidaka (reported: someone told me). Aikhenvald (2003, A Grammar of Tariana) documents that omitting the suffix is ungrammatical for past events, and choosing the wrong suffix is treated as lying.

What is the difference between evidentiality and epistemic modality?

Epistemic modality marks the speaker's degree of certainty ("must", "might", "probably"). Evidentiality marks the source. They overlap because non-firsthand sources usually correlate with lower confidence, but they are formally distinct. Aikhenvald (2004) argues evidentiality is its own grammatical category in languages like Tariana and Tucano; in others (German sollen, English "must have"), the same form does double duty. Diane Boye's Epistemic Meaning (2012) and Frank Palmer's Mood and Modality (2001) cover the boundary cases.

Do speakers of evidential languages think differently?

There is suggestive but limited evidence. Aikhenvald (2004) reports that Tariana speakers are unusually attentive to the source of every report and react with strong moral disapproval to source-mismatches. Anna Papafragou and colleagues (2007) showed Korean-speaking children outperform English-speaking peers on source-monitoring tasks at age 4. Stronger Whorfian claims — that grammatical evidentiality reshapes general epistemic cognition — remain contested; the experimental literature is small.

Which language families have grammatical evidentiality?

It is geographically clustered. Quechuan and Aymaran in the Andes, Tucanoan and Arawakan in the Amazon, Pomoan and many other families of the Pacific Northwest of North America, the Caucasus (Georgian, Lezgian), Turkic and Mongolic, Tibeto-Burman, the Balkan Sprachbund (Bulgarian, Macedonian, Albanian, Turkish), Korean, Japanese, and Iroquoian. Aikhenvald's WALS chapter 77 (Dryer and Haspelmath, 2013) maps about 240 languages with grammatical evidentiality of various depths, roughly one quarter of the surveyed sample.

What happens when you lie in a language with evidentials?

You commit two offences at once — saying something false, and using the wrong evidential. Aikhenvald reports that Tariana speakers consider it more shocking to use a visual evidential for an event you did not witness than to be wrong about whether the event happened. Children acquire the morality of evidential use early; choosing the reported evidential when one actually witnessed something is a common politeness or hedging strategy, while choosing the visual when one did not witness is a serious accusation of dishonesty.