How accurately does a Transformer store integer intervals?

by Ari Holtzman4 months ago
8

I wonder how we might study how LLMs 'count' in the residual stream. Normal arithmetic doesn't necessarily get to this, because it uses the notation system to offload capacity onto tokens e.g., asking 4903283094283 + 473834482378 can be largely solved through heuristics around digit manipulation. So instead, can we look at something like: Assume that the alphabet is a ring. What is N letters after C? Of course, one could break this down into numbers, then use digit manipulation. But if we ask the model to answer without thinking tokens, could we look at the hidden states and see counting happening from layer to layer in the residual stream?

If you are inspired by this idea, you can reach out to the authors for collaboration or cite it:

@misc{holtzman-how-accurately-does-2026,
  author = {Holtzman, Ari},
  title = {How accurately does a Transformer store integer intervals?},
  year = {2026},
  url = {https://hypogenic.ai/ideahub/idea/0TTNtk6yYKcwu3VC8Y7Q}
}

Comments (0)

Please sign in to comment on this idea.

No comments yet. Be the first to share your thoughts!