CLAIJul 25, 2023

FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios

CMU
arXiv:2307.13528v2304 citationsh-index: 91Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of factuality in generative AI for users relying on automated text generation, though it is incremental as it builds on existing tool-augmented methods.

The paper tackles the problem of detecting factual errors in text generated by large language models across multiple tasks and domains, proposing FacTool, a framework that shows efficacy in experiments on knowledge-based QA, code generation, mathematical reasoning, and scientific literature review.

The emergence of generative pre-trained models has facilitated the synthesis of high-quality text, but it has also posed challenges in identifying factual errors in the generated text. In particular: (1) A wider range of tasks now face an increasing risk of containing factual errors when handled by generative models. (2) Generated texts tend to be lengthy and lack a clearly defined granularity for individual facts. (3) There is a scarcity of explicit evidence available during the process of fact checking. With the above challenges in mind, in this paper, we propose FacTool, a task and domain agnostic framework for detecting factual errors of texts generated by large language models (e.g., ChatGPT). Experiments on four different tasks (knowledge-based QA, code generation, mathematical reasoning, and scientific literature review) show the efficacy of the proposed method. We release the code of FacTool associated with ChatGPT plugin interface at https://github.com/GAIR-NLP/factool .

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes