DCLGJun 17, 2024

What Operations can be Performed Directly on Compressed Arrays, and with What Error?

arXiv:2406.11209v1
Originality Highly original
AI Analysis

This work addresses data movement bottlenecks in computing for applications involving large matrices and tensors, representing a novel method for a known bottleneck.

The paper tackles the problem of high data movement costs in large matrix and tensor computations by developing a lossy compressor, PyBlaz, that allows a dozen fundamental operations directly on compressed data, achieving good compression ratios and modest errors within acceptable limits.

In response to the rapidly escalating costs of computing with large matrices and tensors caused by data movement, several lossy compression methods have been developed to significantly reduce data volumes. Unfortunately, all these methods require the data to be decompressed before further computations are done. In this work, we develop a lossy compressor that allows a dozen fairly fundamental operations directly on compressed data while offering good compression ratios and modest errors. We implement a new compressor PyBlaz based on the familiar GPU-powered PyTorch framework, and evaluate it on three non-trivial applications, choosing different number systems for internal representation. Our results demonstrate that the compressed-domain operations achieve good scalability with problem sizes while incurring errors well within acceptable limits. To our best knowledge, this is the first such lossy compressor that supports compressed-domain operations while achieving acceptable performance as well as error.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes