CLAILGOct 13, 2023

KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection

Tencent
arXiv:2310.09044v1153 citationsh-index: 18
Originality Incremental advance
AI Analysis

This addresses the risk of misinformation in LLMs for users relying on accurate text generation, offering a plug-and-play solution to reduce hallucinations without costly fine-tuning.

The paper tackles the hallucination problem in large language models by proposing KCTS, a knowledge-constrained decoding method that uses a knowledge classifier and Monte-Carlo Tree Search to guide text generation, reducing hallucinations in knowledge-grounded dialogue and abstractive summarization without retraining.

Large Language Models (LLMs) have demonstrated remarkable human-level natural language generation capabilities. However, their potential to generate misinformation, often called the hallucination problem, poses a significant risk to their deployment. A common approach to address this issue is to retrieve relevant knowledge and fine-tune the LLM with the knowledge in its input. Unfortunately, this method incurs high training costs and may cause catastrophic forgetting for multi-tasking models. To overcome these limitations, we propose a knowledge-constrained decoding method called KCTS (Knowledge-Constrained Tree Search), which guides a frozen LM to generate text aligned with the reference knowledge at each decoding step using a knowledge classifier score and MCTS (Monte-Carlo Tree Search). To adapt the sequence-level knowledge classifier to token-level guidance, we also propose a novel token-level hallucination detection method called RIPA (Reward Inflection Point Approximation). Our empirical results on knowledge-grounded dialogue and abstractive summarization demonstrate the strength of KCTS as a plug-and-play, model-agnostic decoding method that can effectively reduce hallucinations in natural language generation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes