CVDec 29, 2023

Efficient Multi-scale Network with Learnable Discrete Wavelet Transform for Blind Motion Deblurring

arXiv:2401.00027v262 citationsh-index: 7CVPR
Originality Incremental advance
AI Analysis

This addresses motion blur in images, a common issue in photography and vision tasks, with an incremental improvement over existing deep learning methods.

The paper tackles blind motion deblurring by proposing a multi-scale network with a learnable discrete wavelet transform (MLWNet), which simplifies coarse-to-fine schemes and improves detail restoration, achieving state-of-the-art performance on real-world datasets with enhanced computational efficiency.

Coarse-to-fine schemes are widely used in traditional single-image motion deblur; however, in the context of deep learning, existing multi-scale algorithms not only require the use of complex modules for feature fusion of low-scale RGB images and deep semantics, but also manually generate low-resolution pairs of images that do not have sufficient confidence. In this work, we propose a multi-scale network based on single-input and multiple-outputs(SIMO) for motion deblurring. This simplifies the complexity of algorithms based on a coarse-to-fine scheme. To alleviate restoration defects impacting detail information brought about by using a multi-scale architecture, we combine the characteristics of real-world blurring trajectories with a learnable wavelet transform module to focus on the directional continuity and frequency features of the step-by-step transitions between blurred images to sharp images. In conclusion, we propose a multi-scale network with a learnable discrete wavelet transform (MLWNet), which exhibits state-of-the-art performance on multiple real-world deblurred datasets, in terms of both subjective and objective quality as well as computational efficiency.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes