CLLGJun 8, 2024

FacLens: Transferable Probe for Foreseeing Non-Factuality in Fact-Seeking Question Answering of Large Language Models

arXiv:2406.05328v46 citations
Originality Incremental advance
AI Analysis

This addresses the issue of non-factual responses in LLMs for users relying on accurate information, but it is incremental as it builds on prior non-factuality prediction methods.

The paper tackles the problem of predicting non-factual responses in fact-seeking question answering by large language models (LLMs) before generation, proposing FacLens, a lightweight probe that achieves superior effectiveness and efficiency in experiments.

Despite advancements in large language models (LLMs), non-factual responses still persist in fact-seeking question answering. Unlike extensive studies on post-hoc detection of these responses, this work studies non-factuality prediction (NFP), predicting whether an LLM will generate a non-factual response prior to the response generation. Previous NFP methods have shown LLMs' awareness of their knowledge, but they face challenges in terms of efficiency and transferability. In this work, we propose a lightweight model named Factuality Lens (FacLens), which effectively probes hidden representations of fact-seeking questions for the NFP task. Moreover, we discover that hidden question representations sourced from different LLMs exhibit similar NFP patterns, enabling the transferability of FacLens across different LLMs to reduce development costs. Extensive experiments highlight FacLens's superiority in both effectiveness and efficiency.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes