https://blog.plover.com/2026/02/12#micro-worlds-2
In
a recent article about John Haugeland's rejection of micro-worlds
I claimed:
as a “Large Language Model”, Claude necessarily includes a model of the world in general
Nobody has objected to this remark, but I would like to expand on it.
The claim may or may not be true — it is an empirical question. But as a
theory it has been widely entertained since the very earliest days of
digital computers. Yehoshua Bar-Hillel, the first person to seriously
investigate machine translation, came to this conclusion in the
1950s. Here's an extract of Haugeland's discussion of his work:
In 1951 Yehoshua Bar-Hillel became the first person to earn a
living from work on machine translation. Nine years later he was
the first to point out the fatal flaw in the whole enterprise, and
therefore to abandon it. Bar-Hillel proposed a simple test sentence:
The box was in the pen.
And, for discussion, he considered only the ambiguity: (1) pen
= a writing instrument; versus (2) pen = a child's play enclosure.
Extraordinary circumstances aside (they only make the problem
harder), any normal English speaker will instantly choose "playpen"
as the right reading. How? By understanding the sentence and
exercising a little common sense. As anybody knows, if one physical
object is in another, then the latter must be the larger; fountain
pens tend to be much smaller than boxes, whereas playpens are plenty
big.
Why not encode these facts (and others like them) right into
the system? Bar-Hillel observes:
What such a suggestion amounts to, if taken seriously, is
the requirement that a translation machine should not only be
supplied with a dictionary but also with a universal
encyclopedia. This is surely utterly chimerical and hardly deserves
any further discussion. (1960, p. 160)
(Artifical Intelligence: The Very Idea; John Haugeland; p.174–176.)
Bar-Hillel says, and I agree, that an accurate model of language
requires an accurate model of the world. In 1960, this appeared
“utterly chimerical”. Perhaps so, but here we are, and 55 years later
we have what most agree is a language model capable of
producing intelligible text complex enough to fool sophisticated
readers. Even people who call the LLM a “stochastic word garbage
spewer” and object when it is called “AI” seem to have no qualms
about the term “large language model”.
The Winograd SHRDLU project was an attempt to construct a world whose
model was small enough to not be utterly chimerical. This worked, but
as Haugeland says, it wasn't actually interesting.
Doug Lenat tried for forty years construct
an explicit world model. When he died in 2023 I think
people still considered the project to have been utterly chimerical.
I may be misrepresenting Bar-Hillel's views. He didn't actually say
“world model”, he said “universal encyclopedia”. Lenat's project's
name, “Cyc”, acknowledges this: “Cyc” is short for “encyclopedia”. And
until recently a universal encyclopedia did seem, to many, to be
utterly chimerical. (Imperial China produced
many interesting attempts.) But we do now in fact
have a universal encyclopedia, and
Claude has ingested that universal encyclopedia.
Let's try it and see
Bar-Hillel thought that the “box is in the pen” example proved that
machine translation would be at the very least extremely difficult: it
requires understanding of the usual relative sizes of boxes and pens,
and that the possibility of such understanding in general “hardly
deserves any further discussion”.
I asked Claude:
Consider the sentence “the box was in the pen”. What does “pen” mean here?
Claude said:
Most likely a playpen or animal pen (enclosure), since a box
wouldn't fit inside a writing pen.
Presumably Bar-Hillel's article and Haugeland's book was in Claude's
training data, and perhaps it somehow absorbed the right thing to say
here without actually knowing anything about boxes or pens. I tried
the next similar example that came to mind:
How about “suit” in “what suit was the king”?
Playing card suit - hearts, diamonds, clubs, or spades.
Why?
The context "what suit was the king" is the standard phrasing for
identifying a playing card. A king is one of the face cards in
each suit, so asking about its suit is a natural card game
question.
Okay, that's just what I would have said.
Why couldn't it be a human king wearing a suit?
It could be - I jumped to the card interpretation because "what
suit was the king" uses "suit" as a predicate noun, which is
uncommon phrasing for clothing. You'd more naturally say "what
suit was the king wearing" or "what kind of suit did the king
have." But without more context, the clothing interpretation is
perfectly plausible.
I think Claude is wrong about the grammar here. I'm not sure what
“predicate noun” means and I suspect Claude is using the term
incorrectly. I can't understand “what suit was the king?” as
grammatical in any plausible construction, only in highly contrived
situtations such as an evil wizard transforming the king into a suit
of armor and hiding him in the armory.
But if one agrees to take it to mean “what suit does the king have” I
agree that this is a perfectly plausible interpretation even though
there isn't anything clearly clothing-related nearby, and if one
assumes it meant “what suit was the king wearing” then the clothing
interpretation is unavoidable.
Could it be a lawsuit?
Yes - "what suit was the king" could mean what lawsuit the king
was involved in. That reading makes grammatical sense, though it
would be more natural with "in" ("what suit was the king
in"). Context would be needed to distinguish between cards,
clothing, and lawsuits.
Again I disagree that this makes grammatical sense. Without “in” I
think a native listener would be at least puzzled. (“Do you mean ‘what
suit was the king involved in?’?”)
Could it be a misspelling for “soot”?
Yes, that's plausible - "what soot was the king" could be asking
what type of soot the king was covered in or exposed to. Though it's
an odd question without specific context like a fire or industrial setting.
In my opinion, it is not grammatically plausible.
However, I think focusing on the grammatical errors would missing
the point. The “soot” interpretation is unlikely, I think the bad
grammar rules it out, and Claude's response does not seem to
recognize this. But Claude does express understanding that that
the interpretation is unikely without some surrounding soot-related
context, and a reasonable idea of what that context would need to be
like.
What if it were “what color is the king”?
If it's "what color is the king," then the card suit
interpretation becomes much less likely (suits aren't typically
described as colors, though red/black could work). It's more
likely asking about a physical object - a chess piece, a game
token, a statue, or similar - where the king's color is relevant.
I had been thinking of “color” as referring to the king's skin color,
but I agree that the chess piece or game token interpretation is more
plausible. Again Claude seems to understand something about how we
talk about things. If the king were a playing card, we might be
asking if it is a red king or a black king. But without that context
it's now more likely that the king is a chess piece than a playing
card.
What if the sentence was known to have been translated from German?
That would strengthen the card suit interpretation
significantly. German card suits (Herz, Karo, Pik, Kreuz) are
often called "Farbe" (color) in German card game terminology, so
"what color is the king" could be a literal translation of a
question about which suit a king card belongs to.
That's just what I would have said.
Could “suit” have been an accidental misspelling of “age”?
On a standard keyboard, that seems unlikely - 'a', 'g', 'e' are
nowhere near 's', 'u', 'i', 't'. Autocorrect or speech-to-text
could produce it, but not a typical typing error.
I thought Claude might botch this. There is no grammatical or
semantic reason why “what age was the king” doesn't work, so the only
reason to reject this is orthographic. And Claude's grasp of spelling
still seems quite weak. The other day it claimed that “‘Warner’ is
likely either a typo for ‘Cham’ or …” which couldn't be wronger. I
told it at the time that it is impossible that "Cham" could be a typo
for “Warner”, and if pressed to explain why, I would have said
something like what Claude said here.
Tentative conclusions
Interesting as this all is, it is a digression. My main points, again:
It is at least plausible that coherent speech requires a model of a
large fraction of the world, and, while it may yet turn out to be
false, this theory has been seen as plausible for generations.
Whatever else Claude can or can't do, it can certainly speak
coherently.
Therefore Claude probably does have something like a model of a
substantial part of the world.
In 1960 this appeared completely impossible.
But here we are.
Addendum
20260214
I disagreed with Claude that “what suit was the king” made grammatical
sense. Rik Signes has pointed out that it it is certainly
grammatical, because the grammar is the same as “what person was the
king” or “what visitor was the king”. My discomfort with it is not
grammatical, it is pragmatic.
https://blog.plover.com/2026/02/12#micro-worlds-2