CVMay 6

SAMIC: A Lightweight Semantic-Aware Mamba for Efficient Perceptual Image Compression

arXiv:2605.0456012.5h-index: 8Has Code
Predicted impact top 65% in CV · last 90 daysOriginality Incremental advance
AI Analysis

For image compression researchers, SAMIC offers an efficient alternative to heavy generative models by leveraging state space models for perceptual quality.

SAMIC introduces a lightweight perceptual image compression method using a semantic-aware Mamba block and SVD-inspired redundancy reduction, achieving competitive rate-distortion-perception tradeoff with lower model complexity compared to GAN/diffusion-based approaches.

Perceptual image compression focuses on preserving high visual quality under low-bitrate constraints. Most existing approaches to perceptual compression leverage the strong generative capabilities of generative adversarial networks or diffusion models, at the cost of substantial model complexity. To this end, we present an efficient perceptual image compression method that exploits the long-range modeling capability and linear computational complexity of state space models, with a particular focus on Mamba. Unlike existing methods that rely on an inherently fixed scanning order and consequently impair semantic continuity and spatial correlation, we develop a semantic-aware Mamba block (SAMB) to enable scanning guided by dynamically clustered semantic features, thereby alleviating the strict causality constraints and long-range information decay inherent to Mamba. Inspired by singular value decomposition, we design an SVD-inspired redundancy reduction module (SVD-RRM) that performs a low-rank approximation on the latent features by introducing a learnable soft threshold, leading to channel-wise redundancy information reduction. The proposed SAMB is integrated into both the encoder and decoder of the compression framework, whereas the SVD-RRM is incorporated only in the encoder. Extensive experiments demonstrate that our method performs favorably against state-of-the-art approaches in terms of rate-distortion-perception tradeoff and model complexity. The source code and pretrained models will be available at https://github.com/Jasmine-aiq/SAMIC.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes