LG MLSep 3, 2025

LSAM: Asynchronous Distributed Training with Landscape-Smoothed Sharpness-Aware Minimization

arXiv:2509.03110v14.1

Originality Incremental advance

AI Analysis

This addresses a bottleneck in distributed deep learning training, offering improved efficiency for large-scale models, though it appears incremental as it builds directly on SAM.

The paper tackled the inefficiency of Sharpness-Aware Minimization (SAM) in distributed large-batch training by introducing LSAM, which integrates SAM with asynchronous distributed sampling to eliminate synchronization bottlenecks, resulting in accelerated convergence and higher final accuracy compared to data-parallel SAM.

While Sharpness-Aware Minimization (SAM) improves generalization in deep neural networks by minimizing both loss and sharpness, it suffers from inefficiency in distributed large-batch training. We present Landscape-Smoothed SAM (LSAM), a novel optimizer that preserves SAM's generalization advantages while offering superior efficiency. LSAM integrates SAM's adversarial steps with an asynchronous distributed sampling strategy, generating an asynchronous distributed sampling scheme, producing a smoothed sharpness-aware loss landscape for optimization. This design eliminates synchronization bottlenecks, accelerates large-batch convergence, and delivers higher final accuracy compared to data-parallel SAM.

View on arXiv PDF

Similar