LGAICLMar 13, 2025

Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More

arXiv:2503.10542v34 citationsh-index: 5ACL
Originality Incremental advance
AI Analysis

This addresses a fundamental limitation in language models for graph-based reasoning, with implications for improving supervision in next-token prediction tasks.

The paper tackles the path-star task, a minimal graph search problem where decoder-only language models fail beyond random chance due to a learned shortcut from excess supervision, and demonstrates solutions showing the task is solvable, revealing insights into LM training pathologies.

This work concerns the path-star task, a minimal example of searching over a graph. The graph, $G$, is star-shaped with $D$ arms radiating from a start node, $s$. A language model (LM) is given $G$, $s$, and a target node $t$, which ends one of the arms and is tasked with generating the arm containing $t$. The minimal nature of this task means only a single choice needs to be made: which of the $D$ arms contains $t$? Decoder-only LMs fail to solve this elementary task above $1/D$ chance due to a learned shortcut that absorbs training supervision. We show how this pathology is caused by excess supervision and we present a series of solutions demonstrating that the task is solvable via decoder-only LMs. We find that the task's minimal nature causes its difficulty, as it prevents task decomposition. Our solutions provide insight into the pathology and its implications for LMs trained via next-token prediction.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes