DC LGJun 17, 2024

What Operations can be Performed Directly on Compressed Arrays, and with What Error?

Tripti Agarwal, Harvey Dam, Dorra Ben Khalifa, Matthieu Martel, P. Sadayappan, Ganesh Gopalakrishnan

arXiv:2406.11209v11.2

Originality Highly original

AI Analysis

This work addresses data movement bottlenecks in computing for applications involving large matrices and tensors, representing a novel method for a known bottleneck.

The paper tackles the problem of high data movement costs in large matrix and tensor computations by developing a lossy compressor, PyBlaz, that allows a dozen fundamental operations directly on compressed data, achieving good compression ratios and modest errors within acceptable limits.

In response to the rapidly escalating costs of computing with large matrices and tensors caused by data movement, several lossy compression methods have been developed to significantly reduce data volumes. Unfortunately, all these methods require the data to be decompressed before further computations are done. In this work, we develop a lossy compressor that allows a dozen fairly fundamental operations directly on compressed data while offering good compression ratios and modest errors. We implement a new compressor PyBlaz based on the familiar GPU-powered PyTorch framework, and evaluate it on three non-trivial applications, choosing different number systems for internal representation. Our results demonstrate that the compressed-domain operations achieve good scalability with problem sizes while incurring errors well within acceptable limits. To our best knowledge, this is the first such lossy compressor that supports compressed-domain operations while achieving acceptable performance as well as error.

View on arXiv PDF

Similar