CVDec 4, 2021

U2-Former: A Nested U-shaped Transformer for Image Restoration

arXiv:2112.02279v234 citations
Originality Highly original
AI Analysis

This work addresses inefficiencies in Transformer-based image restoration, offering a novel architecture that enhances performance for applications in computer vision, though it is incremental in improving existing methods.

The paper tackles the challenge of applying Transformers effectively in image restoration by proposing U2-Former, a deep Transformer-based network that uses a nested U-shaped structure and feature-filtering to improve efficiency and interactions across layers, achieving state-of-the-art results on tasks like reflection removal, rain streak removal, and dehazing.

While Transformer has achieved remarkable performance in various high-level vision tasks, it is still challenging to exploit the full potential of Transformer in image restoration. The crux lies in the limited depth of applying Transformer in the typical encoder-decoder framework for image restoration, resulting from heavy self-attention computation load and inefficient communications across different depth (scales) of layers. In this paper, we present a deep and effective Transformer-based network for image restoration, termed as U2-Former, which is able to employ Transformer as the core operation to perform image restoration in a deep encoding and decoding space. Specifically, it leverages the nested U-shaped structure to facilitate the interactions across different layers with different scales of feature maps. Furthermore, we optimize the computational efficiency for the basic Transformer block by introducing a feature-filtering mechanism to compress the token representation. Apart from the typical supervision ways for image restoration, our U2-Former also performs contrastive learning in multiple aspects to further decouple the noise component from the background image. Extensive experiments on various image restoration tasks, including reflection removal, rain streak removal and dehazing respectively, demonstrate the effectiveness of the proposed U2-Former.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes