LGFeb 9

Diffusion-Inspired Reconfiguration of Transformers for Uncertainty Calibration

Manh Cuong Dao, Quang Hung Pham, Phi Le Nguyen, Thao Nguyen Truong, Bryan Kian Hsiang Low, Trong Nghia Hoang

arXiv:2602.08920v11.4h-index: 5

Originality Highly original

AI Analysis

This addresses the need for reliable uncertainty propagation in transformers for risk-sensitive applications, representing a novel method for a known bottleneck.

The paper tackles the problem of uncertainty calibration in pre-trained transformers by proposing a diffusion-inspired reconfiguration that models each feature transformation block as a probabilistic mapping, achieving superior calibration and predictive accuracy across vision and language benchmarks.

Uncertainty calibration in pre-trained transformers is critical for their reliable deployment in risk-sensitive applications. Yet, most existing pre-trained transformers do not have a principled mechanism for uncertainty propagation through their feature transformation stack. In this work, we propose a diffusion-inspired reconfiguration of transformers in which each feature transformation block is modeled as a probabilistic mapping. Composing these probabilistic mappings reveals a probability path that mimics the structure of a diffusion process, transporting data mass from the input distribution to the pre-trained feature distribution. This probability path can then be recompiled on a diffusion process with a unified transition model to enable principled propagation of representation uncertainty throughout the pre-trained model's architecture while maintaining its original predictive performance. Empirical results across a variety of vision and language benchmarks demonstrate that our method achieves superior calibration and predictive accuracy compared to existing uncertainty-aware transformers.

View on arXiv PDF

Similar