LG CLMar 27

A Formal Framework for Uncertainty Analysis of Text Generation with Large Language Models

arXiv:2603.2636383.2h-index: 21

AI Analysis

This work provides a foundational framework for analyzing uncertainty in LLMs, which is crucial for improving reliability and interpretability in AI applications.

The authors tackled the problem of measuring uncertainty in text generation by Large Language Models by introducing a formal framework that models prompting, generation, and interpretation as interconnected autoregressive processes, showing how existing methods relate and identifying unexplored aspects of uncertainty.

The generation of texts using Large Language Models (LLMs) is inherently uncertain, with sources of uncertainty being not only the generation of texts, but also the prompt used and the downstream interpretation. Within this work, we provide a formal framework for the measurement of uncertainty that takes these different aspects into account. Our framework models prompting, generation, and interpretation as interconnected autoregressive processes that can be combined into a single sampling tree. We introduce filters and objective functions to describe how different aspects of uncertainty can be expressed over the sampling tree and demonstrate how to express existing approaches towards uncertainty through these functions. With our framework we show not only how different methods are formally related and can be reduced to a common core, but also point out additional aspects of uncertainty that have not yet been studied.

View on arXiv PDF

Similar