Question 1

What was the original procedure?

Accepted Answer

A child sat alone in a quiet room with a single marshmallow (or pretzel, or cookie). The experimenter said: "I'll be back in fifteen minutes. If the marshmallow is still here, you can have two. If not, you can eat this one now." A bell let the child summon the experimenter early. About a third of children waited the full fifteen minutes. The dependent measure was wait time, with strategies — covering the eyes, singing, distracting attention — recorded.

Question 2

What did the longitudinal follow-ups find?

Accepted Answer

Mischel and Shoda's 1988 and 1990 follow-ups linked preschool wait time to adolescent SAT scores (correlation around 0.4 in the high-status sample), parent-rated competence, and resistance to temptation. Casey et al. (2011) scanned now-adult participants and found ventromedial prefrontal differences correlated with original wait time. The study became a touchstone for grit and self-control narratives in popular psychology.

Question 3

How does the 2018 replication change the picture?

Accepted Answer

Watts, Duncan, and Quan (2018) used the Eunice Kennedy Shriver National Institute of Child Health and Human Development data — 918 children sampled across SES strata. They replicated a smaller predictive correlation (r ≈ 0.28 unadjusted, dropping to r ≈ 0.10 after controlling for family income, parent education, and home environment). The wait was still informative but explained much less unique variance than the Stanford sample suggested.

Question 4

Was the Stanford sample biased?

Accepted Answer

Yes — overwhelmingly children of Stanford faculty and staff. Mischel acknowledged the sample's homogeneity. The original cohort grew up with stable food access, educated parents, and reliable caregivers. In such environments, willpower may be the rate-limiting factor for life outcomes. In samples where adults are unreliable or food is scarce, eating the marshmallow immediately can be the rational choice.

Question 5

What's the trust-environment interpretation?

Accepted Answer

Kidd, Palmeri, and Aslin (2013) primed children with reliable or unreliable adults before the test. Children in the unreliable condition waited a third as long. The result implies wait time partly measures whether the child trusts the experimenter to deliver the second marshmallow. Children from low-trust environments rationally take the certain first treat — a strategy, not a deficit.

Question 6

What strategies helped children wait?

Accepted Answer

Mischel and Mischel (1983) showed cognitive restructuring matters most. Children told to think of the marshmallow as a "puffy cloud" waited longer than those told to think of its taste. Distraction (singing, hiding the eyes) outperformed willpower. Mischel called this "cooling" — converting hot, tempting stimuli into cool, abstract representations. The technique generalizes to adult self-control interventions.

Question 7

Should parents train delay of gratification?

Accepted Answer

The training case is weaker after the 2018 replication. Predictive power comes partly from underlying SES and trust. Direct interventions to improve self-control (Diamond's tools-of-the-mind curriculum, executive function training) show modest, often non-transferring gains. Building reliable, predictable home environments and teaching specific cooling strategies is better supported than generic willpower drills.

Stanford Marshmallow Experiment

Interactive visualization

Watch the 60-second explainer

Why the marshmallow studies matter

Common misconceptions

Frequently asked questions

Interactive visualization

Watch the 60-second explainer

Why the marshmallow studies matter

Common misconceptions

Frequently asked questions

Related concepts