CVAILGNov 28, 2020

Truly shift-invariant convolutional neural networks

arXiv:2011.14214v488 citations
AI Analysis

This work solves the problem of shift variance for convolutional neural networks, which is important for applications requiring robust performance under input shifts.

This paper addresses the problem of shift variance in convolutional neural networks, which arises from downsampling layers. The authors propose adaptive polyphase sampling (APS), a subsampling scheme that enables CNNs to achieve 100% consistency in classification performance under shifts without accuracy loss.

Thanks to the use of convolution and pooling layers, convolutional neural networks were for a long time thought to be shift-invariant. However, recent works have shown that the output of a CNN can change significantly with small shifts in input: a problem caused by the presence of downsampling (stride) layers. The existing solutions rely either on data augmentation or on anti-aliasing, both of which have limitations and neither of which enables perfect shift invariance. Additionally, the gains obtained from these methods do not extend to image patterns not seen during training. To address these challenges, we propose adaptive polyphase sampling (APS), a simple sub-sampling scheme that allows convolutional neural networks to achieve 100% consistency in classification performance under shifts, without any loss in accuracy. With APS, the networks exhibit perfect consistency to shifts even before training, making it the first approach that makes convolutional neural networks truly shift-invariant.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes