CLSep 3, 2024

In Defense of RAG in the Era of Long-Context Language Models

arXiv:2409.01666v154 citationsh-index: 3
Originality Incremental advance
AI Analysis

This work addresses the challenge of maintaining answer quality in long-context applications for users relying on retrieval-augmented systems, presenting an incremental improvement over existing RAG methods.

This paper tackles the problem of diminished focus on relevant information in long-context LLMs, showing that their proposed OP-RAG mechanism improves answer quality with fewer tokens, achieving higher performance at sweet points in an inverted U-shaped curve.

Overcoming the limited context limitations in early-generation LLMs, retrieval-augmented generation (RAG) has been a reliable solution for context-based answer generation in the past. Recently, the emergence of long-context LLMs allows the models to incorporate much longer text sequences, making RAG less attractive. Recent studies show that long-context LLMs significantly outperform RAG in long-context applications. Unlike the existing works favoring the long-context LLM over RAG, we argue that the extremely long context in LLMs suffers from a diminished focus on relevant information and leads to potential degradation in answer quality. This paper revisits the RAG in long-context answer generation. We propose an order-preserve retrieval-augmented generation (OP-RAG) mechanism, which significantly improves the performance of RAG for long-context question-answer applications. With OP-RAG, as the number of retrieved chunks increases, the answer quality initially rises, and then declines, forming an inverted U-shaped curve. There exist sweet points where OP-RAG could achieve higher answer quality with much less tokens than long-context LLM taking the whole context as input. Extensive experiments on public benchmark demonstrate the superiority of our OP-RAG.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes