CL IROct 22, 2024

Enhancing Answer Attribution for Faithful Text Generation with Large Language Models

Juraj Vladika, Luca Mülln, Florian Matthes

arXiv:2410.17112v11.91 citationsh-index: 10KDIR

Originality Incremental advance

AI Analysis

This work addresses the challenge of making LLM-generated answers more trustworthy by enhancing traceability to sources, though it is incremental in nature.

The paper tackled the problem of answer attribution in Large Language Models by analyzing existing methods and proposing new techniques for claim segmentation and evidence retrieval, which improved performance on attribution components.

The increasing popularity of Large Language Models (LLMs) in recent years has changed the way users interact with and pose questions to AI-based conversational systems. An essential aspect for increasing the trustworthiness of generated LLM answers is the ability to trace the individual claims from responses back to relevant sources that support them, the process known as answer attribution. While recent work has started exploring the task of answer attribution in LLMs, some challenges still remain. In this work, we first perform a case study analyzing the effectiveness of existing answer attribution methods, with a focus on subtasks of answer segmentation and evidence retrieval. Based on the observed shortcomings, we propose new methods for producing more independent and contextualized claims for better retrieval and attribution. The new methods are evaluated and shown to improve the performance of answer attribution components. We end with a discussion and outline of future directions for the task.

View on arXiv PDF

Similar