CVLGMLDec 14, 2016

Harmonic Networks: Deep Translation and Rotation Equivariance

arXiv:1612.04642v2773 citations
Originality Highly original
AI Analysis

This addresses the challenge of rotation invariance in computer vision tasks, offering a parameter-efficient solution that can be integrated with modern architectures, though it is incremental in improving existing CNN capabilities.

The paper tackles the problem of achieving rotation equivariance in convolutional neural networks (CNNs) by introducing Harmonic Networks (H-Nets), which replace regular filters with circular harmonics to enable patch-wise translation and 360-degree rotation equivariance, achieving state-of-the-art classification on rotated-MNIST and competitive results on other benchmarks.

Translating or rotating an input image should not affect the results of many computer vision tasks. Convolutional neural networks (CNNs) are already translation equivariant: input image translations produce proportionate feature map translations. This is not the case for rotations. Global rotation equivariance is typically sought through data augmentation, but patch-wise equivariance is more difficult. We present Harmonic Networks or H-Nets, a CNN exhibiting equivariance to patch-wise translation and 360-rotation. We achieve this by replacing regular CNN filters with circular harmonics, returning a maximal response and orientation for every receptive field patch. H-Nets use a rich, parameter-efficient and low computational complexity representation, and we show that deep feature maps within the network encode complicated rotational invariants. We demonstrate that our layers are general enough to be used in conjunction with the latest architectures and techniques, such as deep supervision and batch normalization. We also achieve state-of-the-art classification on rotated-MNIST, and competitive results on other benchmark challenges.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes