IVCVLGSep 12, 2024

Learned Compression for Images and Point Clouds

arXiv:2409.08376v1h-index: 4
Originality Incremental advance
AI Analysis

This work addresses compression challenges in multimedia codecs, offering incremental improvements for specialized applications like point cloud classification.

The thesis tackles data compression for images and point clouds by introducing an efficient entropy model that adapts to inputs, a lightweight point cloud codec for classification with reduced bitrate, and analysis of motion in latent spaces for video.

Over the last decade, deep learning has shown great success at performing computer vision tasks, including classification, super-resolution, and style transfer. Now, we apply it to data compression to help build the next generation of multimedia codecs. This thesis provides three primary contributions to this new field of learned compression. First, we present an efficient low-complexity entropy model that dynamically adapts the encoding distribution to a specific input by compressing and transmitting the encoding distribution itself as side information. Secondly, we propose a novel lightweight low-complexity point cloud codec that is highly specialized for classification, attaining significant reductions in bitrate compared to non-specialized codecs. Lastly, we explore how motion within the input domain between consecutive video frames is manifested in the corresponding convolutionally-derived latent space.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes