LGAIDec 8, 2025

FOAM: Blocked State Folding for Memory-Efficient LLM Training

arXiv:2512.07112v1h-index: 7
Originality Highly original
AI Analysis

This addresses memory constraints for researchers and practitioners training large language models, offering a practical improvement over existing memory-efficient approaches.

The paper tackles the memory bottleneck in large language model training by proposing FOAM, a method that compresses optimizer states using block-wise gradient means with residual correction, reducing total training memory by approximately 50% and optimizer state memory by up to 90% while maintaining convergence rates equivalent to vanilla Adam.

Large language models (LLMs) have demonstrated remarkable performance due to their large parameter counts and extensive training data. However, their scale leads to significant memory bottlenecks during training, especially when using memory-intensive optimizers like Adam. Existing memory-efficient approaches often rely on techniques such as singular value decomposition (SVD), projections, or weight freezing, which can introduce substantial computational overhead, require additional memory for projections, or degrade model performance. In this paper, we propose Folded Optimizer with Approximate Moment (FOAM), a method that compresses optimizer states by computing block-wise gradient means and incorporates a residual correction to recover lost information. Theoretically, FOAM achieves convergence rates equivalent to vanilla Adam under standard non-convex optimization settings. Empirically, FOAM reduces total training memory by approximately 50\%, eliminates up to 90\% of optimizer state memory overhead, and accelerates convergence. Furthermore, FOAM is compatible with other memory-efficient optimizers, delivering performance and throughput that match or surpass both full-rank and existing memory-efficient baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes