AIAug 25, 2025

Language Models Coupled with Metacognition Can Outperform Reasoning Models

arXiv:2508.17959v12 citationsh-index: 33
Originality Incremental advance
AI Analysis

This addresses the problem of balancing speed and accuracy in AI reasoning for tasks requiring strict logic, though it is incremental as it builds on existing SOFAI and metacognition concepts.

The authors tackled the trade-off between fast but less rigorous LLMs and slow but powerful LRMs by developing SOFAI-LM, a metacognitive architecture that coordinates them with iterative feedback, enabling LLMs to match or exceed LRM performance on tasks like graph coloring and code debugging while reducing inference time.

Large language models (LLMs) excel in speed and adaptability across various reasoning tasks, but they often struggle when strict logic or constraint enforcement is required. In contrast, Large Reasoning Models (LRMs) are specifically designed for complex, step-by-step reasoning, although they come with significant computational costs and slower inference times. To address these trade-offs, we employ and generalize the SOFAI (Slow and Fast AI) cognitive architecture into SOFAI-LM, which coordinates a fast LLM with a slower but more powerful LRM through metacognition. The metacognitive module actively monitors the LLM's performance and provides targeted, iterative feedback with relevant examples. This enables the LLM to progressively refine its solutions without requiring the need for additional model fine-tuning. Extensive experiments on graph coloring and code debugging problems demonstrate that our feedback-driven approach significantly enhances the problem-solving capabilities of the LLM. In many instances, it achieves performance levels that match or even exceed those of standalone LRMs while requiring considerably less time. Additionally, when the LLM and feedback mechanism alone are insufficient, we engage the LRM by providing appropriate information collected during the LLM's feedback loop, tailored to the specific characteristics of the problem domain and leads to improved overall performance. Evaluations on two contrasting domains: graph coloring, requiring globally consistent solutions, and code debugging, demanding localized fixes, demonstrate that SOFAI-LM enables LLMs to match or outperform standalone LRMs in accuracy while maintaining significantly lower inference time.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes