CLAIJun 20, 2024

QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMs

arXiv:2406.14277v224 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses retrieval challenges in open-domain QA for users relying on LLMs, but it is incremental as it builds on existing RAG approaches.

The paper tackles the problem of ambiguous or complex questions degrading retrieval quality in open-domain question answering by proposing QPaug, a method that augments questions with sub-questions and passages with self-generated content, resulting in outperforming previous state-of-the-art methods with significant performance gains.

Retrieval-augmented generation (RAG) has received much attention for Open-domain question-answering (ODQA) tasks as a means to compensate for the parametric knowledge of large language models (LLMs). While previous approaches focused on processing retrieved passages to remove irrelevant context, they still rely heavily on the quality of retrieved passages which can degrade if the question is ambiguous or complex. In this paper, we propose a simple yet efficient method called question and passage augmentation (QPaug) via LLMs for open-domain QA. QPaug first decomposes the original questions into multiple-step sub-questions. By augmenting the original question with detailed sub-questions and planning, we are able to make the query more specific on what needs to be retrieved, improving the retrieval performance. In addition, to compensate for the case where the retrieved passages contain distracting information or divided opinions, we augment the retrieved passages with self-generated passages by LLMs to guide the answer extraction. Experimental results show that QPaug outperforms the previous state-of-the-art and achieves significant performance gain over existing RAG methods. The source code is available at \url{https://github.com/kmswin1/QPaug}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes