LGMLFeb 13, 2023

Density-Softmax: Efficient Test-time Model for Uncertainty Estimation and Robustness under Distribution Shifts

arXiv:2302.06495v314 citationsh-index: 3
Originality Incremental advance
AI Analysis

This addresses the problem of scalable uncertainty estimation for low-resource devices and real-time applications, representing an incremental improvement over existing methods.

The paper tackled the computational inefficiency of sampling-based uncertainty estimation methods by proposing Density-Softmax, a deterministic framework that reduces model size and latency while achieving competitive uncertainty and robustness results under distribution shifts.

Sampling-based methods, e.g., Deep Ensembles and Bayesian Neural Nets have become promising approaches to improve the quality of uncertainty estimation and robust generalization. However, they suffer from a large model size and high latency at test-time, which limits the scalability needed for low-resource devices and real-time applications. To resolve these computational issues, we propose Density-Softmax, a sampling-free deterministic framework via combining a density function built on a Lipschitz-constrained feature extractor with the softmax layer. Theoretically, we show that our model is the solution of minimax uncertainty risk and is distance-aware on feature space, thus reducing the over-confidence of the standard softmax under distribution shifts. Empirically, our method enjoys competitive results with state-of-the-art techniques in terms of uncertainty and robustness, while having a lower number of model parameters and a lower latency at test-time.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes