LGCVMLJun 1, 2024

On the Use of Anchoring for Training Vision Models

arXiv:2406.00529v11 citations
AI Analysis

This addresses a critical limitation in anchored training for vision models, though it appears incremental as it builds on an existing principle.

The paper tackles the problem of undesirable shortcut learning in anchored training for vision models, introducing a regularized protocol that achieves substantial performance gains in generalization and safety metrics.

Anchoring is a recent, architecture-agnostic principle for training deep neural networks that has been shown to significantly improve uncertainty estimation, calibration, and extrapolation capabilities. In this paper, we systematically explore anchoring as a general protocol for training vision models, providing fundamental insights into its training and inference processes and their implications for generalization and safety. Despite its promise, we identify a critical problem in anchored training that can lead to an increased risk of learning undesirable shortcuts, thereby limiting its generalization capabilities. To address this, we introduce a new anchored training protocol that employs a simple regularizer to mitigate this issue and significantly enhances generalization. We empirically evaluate our proposed approach across datasets and architectures of varying scales and complexities, demonstrating substantial performance gains in generalization and safety metrics compared to the standard training protocol.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes