LG MLFeb 12, 2025

Monge SAM: Robust Reparameterization-Invariant Sharpness-Aware Minimization Based on Loss Geometry

Albert Kjøller Jacobsen, Georgios Arvanitidis

arXiv:2502.08448v17.11 citationsh-index: 12

Originality Highly original

AI Analysis

This work addresses the problem of sharpness-aware minimization for machine learning practitioners, providing a more robust and reliable method for training deep neural networks.

The authors tackled the problem of sharpness-aware minimization in deep neural networks, proposing Monge SAM, which achieves reparametrization invariance and improves robustness to hyperparameter selection. Compared to previous approaches, Monge SAM demonstrates increased robustness and reduced attraction to suboptimal equilibria.

Recent studies on deep neural networks show that flat minima of the loss landscape correlate with improved generalization. Sharpness-aware minimization (SAM) efficiently finds flat regions by updating the parameters according to the gradient at an adversarial perturbation. The perturbation depends on the Euclidean metric, making SAM non-invariant under reparametrizations, which blurs sharpness and generalization. We propose Monge SAM (M-SAM), a reparametrization invariant version of SAM by considering a Riemannian metric in the parameter space induced naturally by the loss surface. Compared to previous approaches, M-SAM works under any modeling choice, relies only on mild assumptions while being as computationally efficient as SAM. We theoretically argue that M-SAM varies between SAM and gradient descent (GD), which increases robustness to hyperparameter selection and reduces attraction to suboptimal equilibria like saddle points. We demonstrate this behavior both theoretically and empirically on a multi-modal representation alignment task.

View on arXiv PDF

Similar