Are More Tokens Rational? Inference-Time Scaling in Language Models as Adaptive Resource Rationality

arXiv:2602.10329v10.6h-index: 22

Originality Incremental advance

AI Analysis

This work addresses the problem of understanding emergent resource rationality in AI models for researchers in machine learning and cognitive science, though it is incremental in exploring inference-time scaling effects.

The study investigated whether language models can adapt their reasoning strategies to task complexity without explicit computational cost rewards, finding that both instruction-tuned and large reasoning models transition from brute-force to analytic approaches as complexity increases, with large reasoning models showing robustness on XOR and XNOR functions.

Human reasoning is shaped by resource rationality -- optimizing performance under constraints. Recently, inference-time scaling has emerged as a powerful paradigm to improve the reasoning performance of Large Language Models by expanding test-time computation. Specifically, instruction-tuned (IT) models explicitly generate long reasoning steps during inference, whereas Large Reasoning Models (LRMs) are trained by reinforcement learning to discover reasoning paths that maximize accuracy. However, it remains unclear whether resource-rationality can emerge from such scaling without explicit reward related to computational costs. We introduce a Variable Attribution Task in which models infer which variables determine outcomes given candidate variables, input-output trials, and predefined logical functions. By varying the number of candidate variables and trials, we systematically manipulate task complexity. Both models exhibit a transition from brute-force to analytic strategies as complexity increases. IT models degrade on XOR and XNOR functions, whereas LRMs remain robust. These findings suggest that models can adjust their reasoning behavior in response to task complexity, even without explicit cost-based reward. It provides compelling evidence that resource rationality is an emergent property of inference-time scaling itself.

View on arXiv PDF

Similar