AICLJul 13, 2025

Towards Concise and Adaptive Thinking in Large Reasoning Models: A Survey

arXiv:2507.09662v117 citationsh-index: 1
Originality Synthesis-oriented
AI Analysis

This is an incremental survey that helps researchers understand and advance methods for making large reasoning models more efficient in real-world applications.

The survey addresses the problem of large reasoning models generating unnecessarily lengthy reasoning chains for trivial questions, which wastes inference resources and increases response time, and it provides an overview of recent progress in concise and adaptive thinking methods to improve efficiency.

Large reasoning models (LRMs) like OpenAI o1 and DeepSeek R1 have demonstrated impressive performance on complex reasoning tasks like mathematics and programming with long Chain-of-Thought (CoT) reasoning sequences (slow-thinking), compared with traditional large language models (fast-thinking). However, these reasoning models also face a huge challenge that generating unnecessarily lengthy and redundant reasoning chains even for trivial questions. This phenomenon leads to a significant waste of inference resources, increases the response time for simple queries, and hinders the practical application of LRMs in real-world products. To this end, it is crucial to shorten lengthy reasoning chains and learn adaptive reasoning between fast and slow thinking based on input difficulty. In this survey, we provide a comprehensive overview of recent progress in concise and adaptive thinking for efficient reasoning of LRMs, including methodologies, benchmarks, and challenges for future exploration. We hope this survey can help researchers quickly understand the landscape of this field and inspire novel adaptive thinking ideas to facilitate better usage of LRMs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes