CLAIIRMar 2, 2025

Optimizing Multi-Hop Document Retrieval Through Intermediate Representations

arXiv:2503.04796v24 citationsh-index: 2Has CodeACL
Originality Incremental advance
AI Analysis

This addresses the problem of high computational cost in multi-hop question-answering for AI systems, offering an incremental improvement over prior RAG methods.

The paper tackles the challenge of computationally expensive multi-hop document retrieval in retrieval-augmented generation (RAG) by proposing Layer-wise RAG (L-RAG), which leverages intermediate layer representations to achieve performance comparable to multi-step methods while maintaining inference overhead similar to standard RAG, outperforming existing methods on datasets like MuSiQue, HotpotQA, and 2WikiMultiHopQA.

Retrieval-augmented generation (RAG) encounters challenges when addressing complex queries, particularly multi-hop questions. While several methods tackle multi-hop queries by iteratively generating internal queries and retrieving external documents, these approaches are computationally expensive. In this paper, we identify a three-stage information processing pattern in LLMs during layer-by-layer reasoning, consisting of extraction, processing, and subsequent extraction steps. This observation suggests that the representations in intermediate layers contain richer information compared to those in other layers. Building on this insight, we propose Layer-wise RAG (L-RAG). Unlike prior methods that focus on generating new internal queries, L-RAG leverages intermediate representations from the middle layers, which capture next-hop information, to retrieve external knowledge. L-RAG achieves performance comparable to multi-step approaches while maintaining inference overhead similar to that of standard RAG. Experimental results show that L-RAG outperforms existing RAG methods on open-domain multi-hop question-answering datasets, including MuSiQue, HotpotQA, and 2WikiMultiHopQA. The code is available in https://anonymous.4open.science/r/L-RAG-ADD5/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes