CLIROct 22, 2024

Enhancing Answer Attribution for Faithful Text Generation with Large Language Models

arXiv:2410.17112v11 citationsh-index: 10KDIR
Originality Incremental advance
AI Analysis

This work addresses the challenge of making LLM-generated answers more trustworthy by enhancing traceability to sources, though it is incremental in nature.

The paper tackled the problem of answer attribution in Large Language Models by analyzing existing methods and proposing new techniques for claim segmentation and evidence retrieval, which improved performance on attribution components.

The increasing popularity of Large Language Models (LLMs) in recent years has changed the way users interact with and pose questions to AI-based conversational systems. An essential aspect for increasing the trustworthiness of generated LLM answers is the ability to trace the individual claims from responses back to relevant sources that support them, the process known as answer attribution. While recent work has started exploring the task of answer attribution in LLMs, some challenges still remain. In this work, we first perform a case study analyzing the effectiveness of existing answer attribution methods, with a focus on subtasks of answer segmentation and evidence retrieval. Based on the observed shortcomings, we propose new methods for producing more independent and contextualized claims for better retrieval and attribution. The new methods are evaluated and shown to improve the performance of answer attribution components. We end with a discussion and outline of future directions for the task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes