LG AIJul 31, 2023

Lookbehind-SAM: k steps back, 1 step forward

Gonçalo Mordido, Pranshu Malviya, Aristide Baratin, Sarath Chandar

arXiv:2307.16704v38.84 citationsh-index: 21Has Code

Originality Incremental advance

AI Analysis

This work addresses the need for more efficient optimization methods in machine learning, particularly for tasks requiring robustness and lifelong learning, though it appears incremental as it builds on existing SAM and Lookahead techniques.

The authors tackled the problem of improving the efficiency of sharpness-aware minimization (SAM) by enhancing both its maximization and minimization steps, resulting in better generalization, robustness, and reduced catastrophic forgetting across various tasks.

Sharpness-aware minimization (SAM) methods have gained increasing popularity by formulating the problem of minimizing both loss value and loss sharpness as a minimax objective. In this work, we increase the efficiency of the maximization and minimization parts of SAM's objective to achieve a better loss-sharpness trade-off. By taking inspiration from the Lookahead optimizer, which uses multiple descent steps ahead, we propose Lookbehind, which performs multiple ascent steps behind to enhance the maximization step of SAM and find a worst-case perturbation with higher loss. Then, to mitigate the variance in the descent step arising from the gathered gradients across the multiple ascent steps, we employ linear interpolation to refine the minimization step. Lookbehind leads to a myriad of benefits across a variety of tasks. Particularly, we show increased generalization performance, greater robustness against noisy weights, as well as improved learning and less catastrophic forgetting in lifelong learning settings. Our code is available at https://github.com/chandar-lab/Lookbehind-SAM.

View on arXiv PDF Code

Similar