CVAug 19, 2024

Multi-Scale Representation Learning for Image Restoration with State-Space Model

arXiv:2408.10145v116 citationsh-index: 9
AI Analysis

This work improves image restoration for photography and computer vision systems by offering a more efficient and effective method, though it is incremental as it builds on existing paradigms with novel modules.

The paper tackles image restoration by proposing a Multi-Scale State-Space Model (MS-Mamba) to address computational inefficiency and limited receptive fields in existing methods, achieving state-of-the-art performance on nine benchmarks across four tasks while maintaining low complexity.

Image restoration endeavors to reconstruct a high-quality, detail-rich image from a degraded counterpart, which is a pivotal process in photography and various computer vision systems. In real-world scenarios, different types of degradation can cause the loss of image details at various scales and degrade image contrast. Existing methods predominantly rely on CNN and Transformer to capture multi-scale representations. However, these methods are often limited by the high computational complexity of Transformers and the constrained receptive field of CNN, which hinder them from achieving superior performance and efficiency in image restoration. To address these challenges, we propose a novel Multi-Scale State-Space Model-based (MS-Mamba) for efficient image restoration that enhances the capacity for multi-scale representation learning through our proposed global and regional SSM modules. Additionally, an Adaptive Gradient Block (AGB) and a Residual Fourier Block (RFB) are proposed to improve the network's detail extraction capabilities by capturing gradients in various directions and facilitating learning details in the frequency domain. Extensive experiments on nine public benchmarks across four classic image restoration tasks, image deraining, dehazing, denoising, and low-light enhancement, demonstrate that our proposed method achieves new state-of-the-art performance while maintaining low computational complexity. The source code will be publicly available.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes