CVSep 28, 2024

Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration

arXiv:2409.19403v126 citationsh-index: 8Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of handling multiple image degradations with a single model for computer vision applications, representing an incremental improvement.

The paper tackles all-in-one blind image restoration by proposing a pipeline that uses Mask Image Modeling to focus on image content rather than degradation types, achieving state-of-the-art performance across multiple tasks.

All-in-one image restoration aims to handle multiple degradation types using one model. This paper proposes a simple pipeline for all-in-one blind image restoration to Restore Anything with Masks (RAM). We focus on the image content by utilizing Mask Image Modeling to extract intrinsic image information rather than distinguishing degradation types like other methods. Our pipeline consists of two stages: masked image pre-training and fine-tuning with mask attribute conductance. We design a straightforward masking pre-training approach specifically tailored for all-in-one image restoration. This approach enhances networks to prioritize the extraction of image content priors from various degradations, resulting in a more balanced performance across different restoration tasks and achieving stronger overall results. To bridge the gap of input integrity while preserving learned image priors as much as possible, we selectively fine-tuned a small portion of the layers. Specifically, the importance of each layer is ranked by the proposed Mask Attribute Conductance (MAC), and the layers with higher contributions are selected for finetuning. Extensive experiments demonstrate that our method achieves state-of-the-art performance. Our code and model will be released at \href{https://github.com/Dragonisss/RAM}{https://github.com/Dragonisss/RAM}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes