CVDec 28, 2024

MaIR: A Locality- and Continuity-Preserving Mamba for Image Restoration

arXiv:2412.20066v265 citationsh-index: 11Has CodeCVPR
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in image restoration for computer vision applications, offering incremental improvements over existing Mamba-based methods.

The paper tackles the problem of preserving local relationships and spatial continuity in Mamba-based image restoration by proposing a novel scanning strategy and sequence aggregation method, achieving state-of-the-art performance across 14 datasets and surpassing 40 baselines in tasks like super-resolution, denoising, deblurring, and dehazing.

Recent advancements in Mamba have shown promising results in image restoration. These methods typically flatten 2D images into multiple distinct 1D sequences along rows and columns, process each sequence independently using selective scan operation, and recombine them to form the outputs. However, such a paradigm overlooks two vital aspects: i) the local relationships and spatial continuity inherent in natural images, and ii) the discrepancies among sequences unfolded through totally different ways. To overcome the drawbacks, we explore two problems in Mamba-based restoration methods: i) how to design a scanning strategy preserving both locality and continuity while facilitating restoration, and ii) how to aggregate the distinct sequences unfolded in totally different ways. To address these problems, we propose a novel Mamba-based Image Restoration model (MaIR), which consists of Nested S-shaped Scanning strategy (NSS) and Sequence Shuffle Attention block (SSA). Specifically, NSS preserves locality and continuity of the input images through the stripe-based scanning region and the S-shaped scanning path, respectively. SSA aggregates sequences through calculating attention weights within the corresponding channels of different sequences. Thanks to NSS and SSA, MaIR surpasses 40 baselines across 14 challenging datasets, achieving state-of-the-art performance on the tasks of image super-resolution, denoising, deblurring and dehazing. The code is available at https://github.com/XLearning-SCU/2025-CVPR-MaIR.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes