CVIVSep 15, 2021

Resolution-robust Large Mask Inpainting with Fourier Convolutions

arXiv:2109.07161v21320 citationsHas Code
AI Analysis

This addresses a key limitation in image inpainting systems for applications like photo editing and restoration, offering a novel approach with practical efficiency gains.

The paper tackles the problem of image inpainting for large missing areas and high-resolution images by proposing LaMa, a method using Fourier convolutions and high receptive field loss, which improves state-of-the-art performance across datasets and generalizes well to higher resolutions with lower computational costs.

Modern image inpainting systems, despite the significant progress, often struggle with large missing areas, complex geometric structures, and high-resolution images. We find that one of the main reasons for that is the lack of an effective receptive field in both the inpainting network and the loss function. To alleviate this issue, we propose a new method called large mask inpainting (LaMa). LaMa is based on i) a new inpainting network architecture that uses fast Fourier convolutions (FFCs), which have the image-wide receptive field; ii) a high receptive field perceptual loss; iii) large training masks, which unlocks the potential of the first two components. Our inpainting network improves the state-of-the-art across a range of datasets and achieves excellent performance even in challenging scenarios, e.g. completion of periodic structures. Our model generalizes surprisingly well to resolutions that are higher than those seen at train time, and achieves this at lower parameter&time costs than the competitive baselines. The code is available at \url{https://github.com/saic-mdal/lama}.

Code Implementations8 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes