LGAIMar 3

Eliciting Numerical Predictive Distributions of LLMs Without Autoregression

arXiv:2603.02913v12 citationsh-index: 74
Originality Highly original
AI Analysis

This work addresses the problem of efficient uncertainty estimation in numerical regression tasks for users of Large Language Models, providing an incremental step towards more efficient and accurate predictions.

This work investigates whether Large Language Models (LLMs) can provide predictive distributions over numerical targets without autoregressive decoding, with results suggesting that LLM embeddings carry informative signals about summary statistics of their predictive distributions. The approach shows potential for lightweight alternatives to sampling-based approaches for uncertainty-aware numerical predictions.

Large Language Models (LLMs) have recently been successfully applied to regression tasks -- such as time series forecasting and tabular prediction -- by leveraging their in-context learning abilities. However, their autoregressive decoding process may be ill-suited to continuous-valued outputs, where obtaining predictive distributions over numerical targets requires repeated sampling, leading to high computational cost and inference time. In this work, we investigate whether distributional properties of LLM predictions can be recovered without explicit autoregressive generation. To this end, we study a set of regression probes trained to predict statistical functionals (e.g., mean, median, quantiles) of the LLM's numerical output distribution directly from its internal representations. Our results suggest that LLM embeddings carry informative signals about summary statistics of their predictive distributions, including the numerical uncertainty. This investigation opens up new questions about how LLMs internally encode uncertainty in numerical tasks, and about the feasibility of lightweight alternatives to sampling-based approaches for uncertainty-aware numerical predictions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes