CLAIJul 7, 2023

TRAQ: Trustworthy Retrieval Augmented Question Answering via Conformal Prediction

Georgia Tech
arXiv:2307.04642v239 citationsh-index: 37Has Code
Originality Incremental advance
AI Analysis

This addresses the issue of unreliable responses in open-domain QA for users needing trustworthy AI systems, though it is incremental as it builds on existing RAG and conformal prediction methods.

The paper tackles the problem of hallucinations in retrieval-augmented question answering by proposing TRAQ, which provides the first end-to-end statistical correctness guarantee using conformal prediction, reducing prediction set size by 16.2% on average compared to an ablation.

When applied to open-domain question answering, large language models (LLMs) frequently generate incorrect responses based on made-up facts, which are called $\textit{hallucinations}$. Retrieval augmented generation (RAG) is a promising strategy to avoid hallucinations, but it does not provide guarantees on its correctness. To address this challenge, we propose the Trustworthy Retrieval Augmented Question Answering, or $\textit{TRAQ}$, which provides the first end-to-end statistical correctness guarantee for RAG. TRAQ uses conformal prediction, a statistical technique for constructing prediction sets that are guaranteed to contain the semantically correct response with high probability. Additionally, TRAQ leverages Bayesian optimization to minimize the size of the constructed sets. In an extensive experimental evaluation, we demonstrate that TRAQ provides the desired correctness guarantee while reducing prediction set size by 16.2% on average compared to an ablation. The implementation is available at $\href{https://github.com/shuoli90/TRAQ.git}{TRAQ}$.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes