CVGRLGApr 10, 2019

Pixel-Adaptive Convolutional Neural Networks

arXiv:1904.05373v1332 citations
Originality Incremental advance
AI Analysis

This addresses the problem of content-agnostic convolutions in CNNs for computer vision researchers, offering a versatile method with incremental improvements in specific applications.

The paper tackles the limitation of standard convolutions being content-agnostic by proposing pixel-adaptive convolution (PAC), which multiplies filter weights with learnable local pixel features, achieving state-of-the-art performance in deep joint image upsampling and competitive results with faster inference in CRF alternatives.

Convolutions are the fundamental building block of CNNs. The fact that their weights are spatially shared is one of the main reasons for their widespread use, but it also is a major limitation, as it makes convolutions content agnostic. We propose a pixel-adaptive convolution (PAC) operation, a simple yet effective modification of standard convolutions, in which the filter weights are multiplied with a spatially-varying kernel that depends on learnable, local pixel features. PAC is a generalization of several popular filtering techniques and thus can be used for a wide range of use cases. Specifically, we demonstrate state-of-the-art performance when PAC is used for deep joint image upsampling. PAC also offers an effective alternative to fully-connected CRF (Full-CRF), called PAC-CRF, which performs competitively, while being considerably faster. In addition, we also demonstrate that PAC can be used as a drop-in replacement for convolution layers in pre-trained networks, resulting in consistent performance improvements.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes