LGCVMLJun 19, 2020

From Discrete to Continuous Convolution Layers

arXiv:2006.11120v118 citations
Originality Incremental advance
AI Analysis

This addresses the problem of rigid spatial resizing in CNNs for researchers and practitioners, enabling more adaptable and efficient network designs, though it is an incremental advancement over existing convolution methods.

The paper tackles the limitation of fixed integer strides in convolutional layers by proposing Continuous Convolution (CC) Layers, which allow dynamic and learnable resizing of feature maps to any size, including non-integer scales, resulting in improved alignment and flexibility in CNN architectures.

A basic operation in Convolutional Neural Networks (CNNs) is spatial resizing of feature maps. This is done either by strided convolution (donwscaling) or transposed convolution (upscaling). Such operations are limited to a fixed filter moving at predetermined integer steps (strides). Spatial sizes of consecutive layers are related by integer scale factors, predetermined at architectural design, and remain fixed throughout training and inference time. We propose a generalization of the common Conv-layer, from a discrete layer to a Continuous Convolution (CC) Layer. CC Layers naturally extend Conv-layers by representing the filter as a learned continuous function over sub-pixel coordinates. This allows learnable and principled resizing of feature maps, to any size, dynamically and consistently across scales. Once trained, the CC layer can be used to output any scale/size chosen at inference time. The scale can be non-integer and differ between the axes. CC gives rise to new freedoms for architectural design, such as dynamic layer shapes at inference time, or gradual architectures where the size changes by a small factor at each layer. This gives rise to many desired CNN properties, new architectural design capabilities, and useful applications. We further show that current Conv-layers suffer from inherent misalignments, which are ameliorated by CC layers.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes