CLMay 30, 2025

ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented Generation

Hao Chen, Yukun Yan, Sen Mei, Wanxiang Che, Zhenghao Liu, Qi Shi, Xinze Li, Yuchun Fan, Pengcheng Huang, Qiushi Xiong, Zhiyuan Liu, Maosong Sun

arXiv:2505.24388v23 citationsh-index: 21Has CodeEMNLP

Originality Highly original

AI Analysis

This work addresses the challenge of improving factuality and interpretability in RAG systems for users relying on LLMs, representing an incremental advancement through a novel method for a known bottleneck.

The paper tackles the problem of underutilization of retrieved documents in Retrieval-Augmented Generation (RAG) systems, which leads to poor reasoning, by proposing ClueAnchor, a framework that extracts key clues and optimizes reasoning paths, resulting in significant improvements in reasoning completeness and robustness over prior baselines.

Retrieval-Augmented Generation (RAG) augments Large Language Models (LLMs) with external knowledge to improve factuality. However, existing RAG systems frequently underutilize the retrieved documents, failing to extract and integrate the key clues needed to support faithful and interpretable reasoning, especially in cases where relevant evidence is implicit, scattered, or obscured by noise. To address this issue, we propose ClueAnchor, a novel framework for enhancing RAG via clue-anchored reasoning exploration and optimization. ClueAnchor extracts key clues from retrieved content and generates multiple reasoning paths based on different knowledge configurations, optimizing the model by selecting the most appropriate reasoning path for the given context through reward-based preference optimization. Experiments show that ClueAnchor significantly outperforms prior RAG baselines in the completeness and robustness of reasoning. Further analysis confirms its strong resilience to noisy or partially relevant retrieved content, as well as its capability to identify supporting evidence even in the absence of explicit clue supervision during inference. All codes are available at https://github.com/thunlp/ClueAnchor.

View on arXiv PDF Code

Similar