LG CE CLSep 17, 2025

PiERN: Token-Level Routing for Integrating High-Precision Computation and Reasoning

Hengbo Xiao, Jingyuan Fan, Xin Tong, Jingzhao Zhang, Chao Lu, Guannan He

arXiv:2509.18169v24.1h-index: 2

Originality Highly original

AI Analysis

It addresses the inefficiency and scalability issues in multi-agent systems for computation-reasoning tasks, offering a more efficient paradigm for scientific applications.

The paper tackles the problem of integrating high-precision numerical computation into large language models (LLMs) to support complex decision-making tasks, proposing PiERN, which achieves higher accuracy than finetuning LLMs and improves response latency, token usage, and GPU energy consumption compared to multi-agent approaches.

Tasks on complex systems require high-precision numerical computation to support decisions, but current large language models (LLMs) cannot integrate such computations as an intrinsic and interpretable capability with existing architectures. Multi-agent approaches can leverage external experts, but inevitably introduce communication overhead and suffer from inefficiency caused by limited scalability. To this end, we propose Physically-isolated Experts Routing Network (PiERN), an architecture for integrating computation and reasoning. Instead of the tool-use workflows or function-calling, PiERN endogenously integrates computational capabilities into neural networks after separately training experts, a text-to-computation module, and a router. At inference, the router directs computation and reasoning at the token level, thereby enabling iterative alternation within a single chain of thought. We evaluate PiERN on representative linear and nonlinear computation-reasoning tasks against LLM finetuning and the multi-agent system approaches. Results show that the PiERN architecture achieves not only higher accuracy than directly finetuning LLMs but also significant improvements in response latency, token usage, and GPU energy consumption compared with mainstream multi-agent approaches. PiERN offers an efficient, interpretable, and scalable paradigm for interfacing language models with scientific systems.

View on arXiv PDF

Similar