LGAICLOct 3, 2023

Language Models Represent Space and Time

arXiv:2310.02207v3327 citationsh-index: 10
Originality Incremental advance
AI Analysis

This addresses the debate on whether LLMs learn superficial statistics or grounded representations, providing evidence for the latter, though it is incremental in confirming existing hypotheses.

The study analyzed spatial and temporal datasets in Llama-2 models, finding that they learn linear representations of space and time, including individual neurons encoding coordinates, suggesting LLMs develop rich spatiotemporal world models.

The capabilities of large language models (LLMs) have sparked debate over whether such systems just learn an enormous collection of superficial statistics or a set of more coherent and grounded representations that reflect the real world. We find evidence for the latter by analyzing the learned representations of three spatial datasets (world, US, NYC places) and three temporal datasets (historical figures, artworks, news headlines) in the Llama-2 family of models. We discover that LLMs learn linear representations of space and time across multiple scales. These representations are robust to prompting variations and unified across different entity types (e.g. cities and landmarks). In addition, we identify individual "space neurons" and "time neurons" that reliably encode spatial and temporal coordinates. While further investigation is needed, our results suggest modern LLMs learn rich spatiotemporal representations of the real world and possess basic ingredients of a world model.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes