LG AIAug 18, 2025

Toward Storage-Aware Learning with Compressed Data An Empirical Exploratory Study on JPEG

Kichang Lee, Songkuk Kim, JaeYeon Park, JeongGil Ko

arXiv:2508.12833v14.11 citationsh-index: 5ICT express

Originality Synthesis-oriented

AI Analysis

It addresses storage constraints for on-device ML in continuous data collection, but the work is incremental as it provides empirical insights without proposing a new method.

This paper tackles the problem of limited storage in on-device machine learning by empirically studying the trade-off between data quantity and quality through compression, finding that naive strategies like uniform dropping are suboptimal and that data samples have varying sensitivities to compression, supporting adaptive strategies.

On-device machine learning is often constrained by limited storage, particularly in continuous data collection scenarios. This paper presents an empirical study on storage-aware learning, focusing on the trade-off between data quantity and quality via compression. We demonstrate that naive strategies, such as uniform data dropping or one-size-fits-all compression, are suboptimal. Our findings further reveal that data samples exhibit varying sensitivities to compression, supporting the feasibility of a sample-wise adaptive compression strategy. These insights provide a foundation for developing a new class of storage-aware learning systems. The primary contribution of this work is the systematic characterization of this under-explored challenge, offering valuable insights that advance the understanding of storage-aware learning.

View on arXiv PDF

Similar