CVAug 4, 2025

Content-Aware Mamba for Learned Image Compression

arXiv:2508.02192v43 citationsh-index: 6
Originality Highly original
AI Analysis

This work addresses a bottleneck in learned image compression for applications requiring efficient global redundancy elimination, representing a novel method rather than an incremental improvement.

The paper tackled the problem of rigid, content-agnostic scans in Mamba-style state-space models for learned image compression, introducing Content-Aware Mamba (CAM) to dynamically adapt processing to image content, resulting in state-of-the-art rate-distortion performance with BD-rate improvements of 15.91% to 21.34% over VTM-21.0 on standard datasets.

Recent learned image compression (LIC) leverages Mamba-style state-space models (SSMs) for global receptive fields with linear complexity. However, the standard Mamba adopts content-agnostic, predefined raster (or multi-directional) scans under strict causality. This rigidity hinders its ability to effectively eliminate redundancy between tokens that are content-correlated but spatially distant. We introduce Content-Aware Mamba (CAM), an SSM that dynamically adapts its processing to the image content. Specifically, CAM overcomes prior limitations with two novel mechanisms. First, it replaces the rigid scan with a content-adaptive token permutation strategy to prioritize interactions between content-similar tokens regardless of their location. Second, it overcomes the sequential dependency by injecting sample-specific global priors into the state-space model, which effectively mitigates the strict causality without multi-directional scans. These innovations enable CAM to better capture global redundancy while preserving computational efficiency. Our Content-Aware Mamba-based LIC model (CMIC) achieves state-of-the-art rate-distortion performance, surpassing VTM-21.0 by 15.91%, 21.34%, and 17.58% in BD-rate on the Kodak, Tecnick, and CLIC datasets, respectively. Code and checkpoints will be released later.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes