CLNENCMar 5, 2025

Three tiers of computation in transformers and in brain architectures

arXiv:2503.04848v21 citationsh-index: 28
Originality Incremental advance
AI Analysis

This work addresses the foundational problem of quantifying computational power in AI and neuroscience, offering a theoretically grounded framework to analyze and expand logic abilities in language models, though it is incremental in building on existing grammar-automata hierarchy.

The paper tackles the problem of understanding computational abilities in transformers and brains by identifying three hierarchical tiers and transitions, showing these correspond to specific capabilities in language models rather than just scaling, and provides a novel benchmark and empirical evaluations of humans and fifteen LMs to explain their abilities and shortfalls.

Human language and logic abilities are computationally quantified within the well-studied grammar-automata hierarchy. We identify three hierarchical tiers and two corresponding transitions and show their correspondence to specific abilities in transformer-based language models (LMs). These emergent abilities have often been described in terms of scaling; we show that it is the transition between tiers, rather than scaled size itself, that determines a system's capabilities. Specifically, humans effortlessly process language yet require critical training to perform arithmetic or logical reasoning tasks; and LMs possess language abilities absent from predecessor systems, yet still struggle with logical processing. We submit a novel benchmark of computational power, provide empirical evaluations of humans and fifteen LMs, and, most significantly, provide a theoretically grounded framework to promote careful thinking about these crucial topics. The resulting principled analyses provide explanatory accounts of the abilities and shortfalls of LMs, and suggest actionable insights into the expansion of their logic abilities.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes