CVFeb 7, 2022

Patch-Based Stochastic Attention for Image Editing

arXiv:2202.03163v49 citationsHas Code
Originality Highly original
AI Analysis

This work addresses a bottleneck in attention-based models for image editing, offering a more scalable solution for high-resolution image processing.

The authors tackled the high computational and memory costs of full attention mechanisms in deep learning by proposing a Patch-based Stochastic Attention Layer (PSAL) based on the PatchMatch algorithm, achieving efficient scaling to high-resolution images for tasks like inpainting, colorization, and super-resolution.

Attention mechanisms have become of crucial importance in deep learning in recent years. These non-local operations, which are similar to traditional patch-based methods in image processing, complement local convolutions. However, computing the full attention matrix is an expensive step with heavy memory and computational loads. These limitations curb network architectures and performances, in particular for the case of high resolution images. We propose an efficient attention layer based on the stochastic algorithm PatchMatch, which is used for determining approximate nearest neighbors. We refer to our proposed layer as a "Patch-based Stochastic Attention Layer" (PSAL). Furthermore, we propose different approaches, based on patch aggregation, to ensure the differentiability of PSAL, thus allowing end-to-end training of any network containing our layer. PSAL has a small memory footprint and can therefore scale to high resolution images. It maintains this footprint without sacrificing spatial precision and globality of the nearest neighbors, which means that it can be easily inserted in any level of a deep architecture, even in shallower levels. We demonstrate the usefulness of PSAL on several image editing tasks, such as image inpainting, guided image colorization, and single-image super-resolution. Our code is available at: https://github.com/ncherel/psal

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes