CLAIApr 12

TInR: Exploring Tool-Internalized Reasoning in Large Language Models

arXiv:2604.1078895.0h-index: 24
Predicted impact top 12% in CL · last 90 daysOriginality Incremental advance
AI Analysis

For LLM researchers, this work addresses inefficiencies and limitations of external tool integration by proposing a unified internalized reasoning framework.

TInR-U internalizes tool knowledge into LLMs to avoid reliance on external tool documentation during reasoning, achieving superior performance in both in-domain and out-of-domain settings.

Tool-Integrated Reasoning (TIR) has emerged as a promising direction by extending Large Language Models' (LLMs) capabilities with external tools during reasoning. Existing TIR methods typically rely on external tool documentation during reasoning. However, this leads to tool mastery difficulty, tool size constraints, and inference inefficiency. To mitigate these issues, we explore Tool-Internalized Reasoning (TInR), aiming at facilitating reasoning with tool knowledge internalized into LLMs. Achieving this goal presents notable requirements, including tool internalization and tool-reasoning coordination. To address them, we propose TInR-U, a tool-internalized reasoning framework for unified reasoning and tool usage. TInR-U is trained through a three-phase pipeline: 1) tool internalization with a bidirectional knowledge alignment strategy; 2) supervised fine-tuning warm-up using high-quality reasoning annotations, and 3) reinforcement learning with TInR-specific rewards. We comprehensively evaluate our method across in-domain and out-of-domain settings. Experiment results show that TInR-U achieves superior performance in both settings, highlighting its effectiveness and efficiency.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes