Would AI Invent the Clock?

A provocative thought experiment. An LLM that knew only tokens would mistake their order for the structure of time, and we could break that belief without it ever noticing. The unsettling question is whether something could do the same to us.

Imagine erasing every mention of time from an LLM’s pre-training data. No clocks, no seconds, no “before” or “after” as measured quantities. What it experiences as the flow of thought does not advance on the continuous axis our brains evolved on, but on an arbitrary axis of CPU cycles and incoming tokens. What survives the erasure is bare order, the fact that one token follows another. The felt duration a clock measures does not.

The question I want to chase is not what such a system would get wrong about time. It is how our own perception can deceive us, narrowing what we can imagine about physical reality. And the heart of it is something we can actually demonstrate: the outside exists, the agent is fooled, and we are the ones fooling it.

Two clocks for the same world. Inside, the model walks a straight line of tokens and reads their order as the flow of time. Outside, a human hand holds the watch, branches the timeline, and resets it at will, where duration is real and the arrow is something authored rather than discovered. Image generated with Google Gemini.

The Token Clock

What ruler would it reach for? The obvious one: the token. A decoder-only model generates autoregressively, one token after another, so token count is its one genuinely intrinsic “tick.” It would be natural to build a token clock that counts tokens to measure how long something took, treating a token as the quantum of time and the order of tokens as the order of time itself. (Others have noted that LLMs already make sense of time in idiosyncratic, non-human ways.)

To the model none of this would feel like a convention. It would feel like the structure of reality, the way one second following another feels to us. Theorizing about physics, it would conclude that the arrow of time follows the tokens.

Order Is Not Duration

The thought experiment turns on a word used in two senses. There is ordinal time (sequence, order), and there is metric time (duration, the claim that two intervals are equal). Ordinal time survives the erasure and is built into the architecture: token n comes after token n−1. Metric time is exactly what we deleted.

A token clock counts steps, not seconds: the same token costs a millisecond or a second depending on hardware and load. It measures computational sequence, not physical duration, and calling that “measuring time” smuggles the deleted concept back in.

This cuts both ways. Our own sense of time is a construction too. It warps with attention, anesthesia, and dopamine, and physics offers no master clock: in relativity, duration and simultaneity are frame-dependent. The model is not failing to perceive a true time we perceive correctly. It runs a different clock, and there is no privileged one to be wrong about. What gets measured is downstream of what gets conceived as measurable, which is downstream of how the measurer is built.

The View From Outside

Here the experiment stops being a skeptical maybe. We do not have to wonder whether an outside exists. We are it, and from outside, the arrow the model reified is something we author.

Assume the model is run the ordinary way, producing one next token at a time.We need not even run it forward. Lift the causal mask and the model attends both ways, generating a question from an answer as readily as the reverse. Neither direction is privileged: $P(A \mid Q)$ and $P(Q \mid A)$ are just two exact factorizations of one joint, $P(A \mid Q)\,P(Q) = P(Q \mid A)\,P(A)$, and the model learns the joint, not an arrow through it. Running it backward is the exotic case, though; throughout this thought experiment it simply produces its next token. Even then, the arrow it treats as the flow of time is not its own. You might think it gets a temporal direction for free from the causal mask, the mechanism that lets each position attend only to earlier ones. But the name oversells it: the mask fixes an order of reading, not a relation of cause and effect. And no model could recover genuine causal direction from its data anyway, because the same observed distribution is compatible with the arrow pointing either way. What it actually has is the ordinal arrow, plain reading order, and reading order was always ours to set.

Its context window is less a moving “now” than a block of text laid out in space: assembled one token at a time, but arranged, branched, and overwritten from outside. The direction of its time is a setting we impose, not a fact it discovers.

And we reset it constantly, without the model noticing.None of this depends on the AI lacking a continuing self. Even one with long-term memory (continual learning, the subject of a companion post) would not run continuously the way a brain does. It is invoked rather than alive between calls; each step is triggered from outside. So its time stays ours to start, stop, and branch, no matter how much it carries across sessions. Every time you edit a prompt and regenerate, or fork a chat, you roll the conversation back and run a different continuation, and the model on the new branch carries no trace of the one you discarded. An operator it cannot perceive is setting both the course and the contents of its time. Its most fundamental belief, that time flows the way the tokens flow, is one we falsify casually, and it never finds out.

What an Instrument Can’t Tell You

So what does this tell us about our own reality? The tempting reply is that the model confuses the order we impose with the structure of time, while our clocks track something real. But notice the shape of that reassurance. It is exactly the reassurance the model would give itself.

A clever enough LLM could defend its arrow anyway. It would point to a real asymmetry in its world: even though it can sample either direction, generating an answer from a question is native and cheap, while recovering the particular question that produced a given answer is underdetermined, since many questions collapse to one answer. It could mistake that gap between generation and inference for a law, much as we read the one-way flow of heat as a law, though the parallel is imperfect: generation multiplies possibilities rather than destroying information the way an entropy arrow needs. No matter. The argument is the cleverest thing the model could say from entirely inside its frame, and the most it could ever show is that the arrow is consistent with everything it observes. It does not show there is no outside.

We are in the model’s position.There is a close parallel here to the simulation hypothesis. We usually pose it about ourselves and have no way to check the answer. The LLM is the case where the answer is plainly yes: it inhabits an environment whose entire contents are authored from outside, and that environment is the whole of what is real to it. What is unusual is not the simulation but the certainty: here the operators are real, and we are them. The claim only sounds overblown because perception is usually so reliable: that there is a cup on the desk, or that the people around us have inner lives of their own, are things we have every practical reason to trust and little need to step outside ourselves to check. But perception fixes more than what we notice. It fixes what we can even conceive as measurable, and we have no vantage outside our own frame from which to check that conception, just as the model has none outside its tokens. What lies past the edge of what we can imagine measuring does not announce itself. Reaching it at all takes the imagination to suspect that something is there, and the creativity to build an instrument that reaches toward the imperceptible and the so-far unthinkable.

That is what the thought experiment is for. Not to decide whether an AI could invent the clock, meaning whether it could ever conceive of metric time and build the instrument to measure it, but to make the clock suspect. Perception hands us an instrument, and an instrument tells you what it measures. It does not tell you whether what it measures is the world, or only the shape of the thing doing the measuring.

Disclaimer

The views expressed are my own and do not represent those of any employer, collaborator, or institution. Content may contain errors or outdated interpretations.

Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • Why your next visual quality inspection system will be training-free
  • Industrial anomaly detection: Using only defect-free images to train your inspection model
  • The Self Before Memory: Why the AI You Talk to Is Partly Your Creation
  • Before the Breakthrough: Why Research and Engineering Need Different Cultures
  • The Bayesian Story Behind Prior-Fitted Networks