IRAIMay 1

"I Don't Know" -- Towards Appropriate Trust with Certainty-Aware Retrieval Augmented Generation

arXiv:2605.0095761.0h-index: 11
Predicted impact top 56% in IR · last 90 daysOriginality Incremental advance
AI Analysis

For users of LLMs, this work addresses the problem of overconfident and hallucinated responses by enabling systems to express uncertainty, thereby fostering appropriate trust.

The paper proposes CERTA, a certainty-aware RAG system that reflects uncertainty in answers to build appropriate user trust, and introduces a benchmark of 90 non-objective question-context pairs. Experiments show CERTA reduces over-agreement and improves cautious behavior in moral judgments compared to baseline RAG.

Achieving the right amount of trust in AI systems is important, but challenging. The problem is exacerbated with the rise of Large Language Models (LLMs) as they provide human-level communication capabilities, but potentially hallucinate in the content that they generate. Moreover, they express over-confidence in their answers, making it difficult for users to judge their truthfulness. An important human value that users seek is benevolence, which can be met by LLM's self-reflection leading to reliable and honest answers. Accordingly, this paper proposes conveying appropriate levels of self-reflected certainty to build appropriate trust. Our contributions are twofold: 1) We develop CERTA (Certainty Enhanced RAG for Trustworthy Answers), a specialized Retrieval Augmented Generation (RAG) system that incorporates the relevance between question, context, and answer to reflect its uncertainty in answering questions; 2) We create the Certainty Benchmark with 90 question-context pairs of non-objective questions, divided over four categories (factuality, preference, sycophancy, morality) and three types of contexts (relevant, incomplete, irrelevant). We run experiments with a baseline RAG system and three CERTA settings using two LLMs. Our evaluations indicate that CERTA helps identify answers that are uncertain, decreases the cases of over-agreeing, and provides cautious behavior when prompted for moral judgments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes