CLFeb 16, 2025

RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation

Amazon
arXiv:2502.10996v214 citationsh-index: 14Has Code
Originality Highly original
AI Analysis

This addresses the issue of brittle reasoning in retrieval-augmented generation for users of LLMs on knowledge-intensive tasks, representing a novel method for a known bottleneck.

The paper tackled the problem of LLMs struggling with multi-step reasoning due to unstructured retrieved context in knowledge-intensive tasks, and proposed RAS, a framework that dynamically constructs query-specific knowledge graphs, achieving up to 7.0% gains on benchmarks.

Large language models (LLMs) have achieved impressive performance on knowledge-intensive tasks, yet they often struggle with multi-step reasoning due to the unstructured nature of retrieved context. While retrieval-augmented generation (RAG) methods provide external information, the lack of explicit organization among retrieved passages limits their effectiveness, leading to brittle reasoning pathways. Recent interpretability studies highlighting the importance of structured intermediate reasoning further align with this perspective. We propose Retrieval-And-Structuring (RAS), a framework that dynamically constructs query-specific knowledge graphs through iterative retrieval and structured knowledge building. RAS interleaves targeted retrieval planning with incremental graph construction, enabling models to assemble and reason over evolving knowledge structures tailored to each query. On seven knowledge-intensive benchmarks, RAS consistently outperforms strong baselines, achieving up to 6.4% and 7.0% gains with open-source and proprietary LLMs, respectively. Our results demonstrate that dynamic, query-specific knowledge structuring offers a robust path to improving reasoning accuracy and robustness in language model generation. Our data and code can be found at https://github.com/pat-jj/RAS.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes