MLAICLLGDec 7, 2024

Training-Free Bayesianization for Low-Rank Adapters of Large Language Models

arXiv:2412.05723v30.124 citationsh-index: 8Has Code
AI Analysis55

This addresses the problem of uncertainty quantification for LLM users by providing a more efficient method, though it is incremental as it builds on existing low-rank adapter techniques.

The paper tackles the challenge of estimating uncertainty in Large Language Models by proposing Training-Free Bayesianization (TFB), a framework that converts trained low-rank adapters into Bayesian ones without extra training, achieving superior uncertainty estimation and generalization compared to existing methods.

Estimating the uncertainty of responses from Large Language Models (LLMs) remains a critical challenge. While recent Bayesian methods have demonstrated effectiveness in quantifying uncertainty through low-rank weight updates, they typically require complex fine-tuning or post-training procedures. In this paper, we propose Training-Free Bayesianization (TFB), a simple yet theoretically grounded framework that efficiently transforms trained low-rank adapters into Bayesian ones without additional training. TFB systematically searches for the maximally acceptable level of variance in the weight posterior, constrained within a family of low-rank isotropic Gaussian distributions. Our theoretical analysis shows that under mild conditions, this search process is equivalent to KL-regularized variational optimization, a generalized form of variational inference. Through comprehensive experiments, we show that TFB achieves superior uncertainty estimation and generalization compared to existing methods while eliminating the need for complex Bayesianization training procedures. Code will be available at https://github.com/Wang-ML-Lab/bayesian-peft.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes