AIDec 16, 2025

Intention Chain-of-Thought Prompting with Dynamic Routing for Code Generation

Shen Li, Li Huang, Shaoxiong Zhan, Weifeng Sun, Tao Yin, Zhongxin Liu, Meng Yan

arXiv:2512.14048v15.81 citationsh-index: 17

Originality Highly original

AI Analysis

This addresses inefficiencies in code generation for developers and researchers, though it is incremental as it builds on existing prompting methods.

The paper tackles the problem of inefficient and surface-level reasoning in chain-of-thought prompting for code generation by proposing RoutingGen, a dynamic routing framework that adapts prompting strategies based on task difficulty, achieving state-of-the-art performance and reducing token usage by 46.37% on average.

Large language models (LLMs) exhibit strong generative capabilities and have shown great potential in code generation. Existing chain-of-thought (CoT) prompting methods enhance model reasoning by eliciting intermediate steps, but suffer from two major limitations: First, their uniform application tends to induce overthinking on simple tasks. Second, they lack intention abstraction in code generation, such as explicitly modeling core algorithmic design and efficiency, leading models to focus on surface-level structures while neglecting the global problem objective. Inspired by the cognitive economy principle of engaging structured reasoning only when necessary to conserve cognitive resources, we propose RoutingGen, a novel difficulty-aware routing framework that dynamically adapts prompting strategies for code generation. For simple tasks, it adopts few-shot prompting; for more complex ones, it invokes a structured reasoning strategy, termed Intention Chain-of-Thought (ICoT), which we introduce to guide the model in capturing task intention, such as the core algorithmic logic and its time complexity. Experiments across three models and six standard code generation benchmarks show that RoutingGen achieves state-of-the-art performance in most settings, while reducing total token usage by 46.37% on average across settings. Furthermore, ICoT outperforms six existing prompting baselines on challenging benchmarks.

View on arXiv PDF

Similar