CLApr 16, 2024

Enhancing Confidence Expression in Large Language Models Through Learning from Past Experience

arXiv:2404.10315v117 citationsh-index: 22
Originality Incremental advance
AI Analysis

This addresses the issue of unreliable confidence in LLMs for users relying on their outputs, though it is an incremental improvement over existing methods.

The paper tackles the problem of large language models (LLMs) generating inaccurate information with high confidence by proposing a method to enhance their confidence expression capability, resulting in improved alignment between expressed confidence and true correctness probabilities across four datasets using Llama models.

Large Language Models (LLMs) have exhibited remarkable performance across various downstream tasks, but they may generate inaccurate or false information with a confident tone. One of the possible solutions is to empower the LLM confidence expression capability, in which the confidence expressed can be well-aligned with the true probability of the generated answer being correct. However, leveraging the intrinsic ability of LLMs or the signals from the output logits of answers proves challenging in accurately capturing the response uncertainty in LLMs. Therefore, drawing inspiration from cognitive diagnostics, we propose a method of Learning from Past experience (LePe) to enhance the capability for confidence expression. Specifically, we first identify three key problems: (1) How to capture the inherent confidence of the LLM? (2) How to teach the LLM to express confidence? (3) How to evaluate the confidence expression of the LLM? Then we devise three stages in LePe to deal with these problems. Besides, to accurately capture the confidence of an LLM when constructing the training data, we design a complete pipeline including question preparation and answer sampling. We also conduct experiments using the Llama family of LLMs to verify the effectiveness of our proposed method on four datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes