LGJun 13, 2025

An Efficient Compression of Deep Neural Network Checkpoints Based on Prediction and Context Modeling

arXiv:2506.12000v12 citationsh-index: 1

Originality Incremental advance

AI Analysis

This is an incremental improvement for storage-limited environments in deep learning.

The paper tackles the problem of compressing neural network checkpoints for storage efficiency, achieving substantial bit size reduction while enabling near-lossless training recovery.

This paper is dedicated to an efficient compression of weights and optimizer states (called checkpoints) obtained at different stages during a neural network training process. First, we propose a prediction-based compression approach, where values from the previously saved checkpoint are used for context modeling in arithmetic coding. Second, in order to enhance the compression performance, we also propose to apply pruning and quantization of the checkpoint values. Experimental results show that our approach achieves substantial bit size reduction, while enabling near-lossless training recovery from restored checkpoints, preserving the model's performance and making it suitable for storage-limited environments.

View on arXiv PDF

Similar