CVLGNov 2, 2023

H-NeXt: The next step towards roto-translation invariant networks

arXiv:2311.01111v13 citationsh-index: 41
Originality Highly original
AI Analysis

This addresses the need for robust models that generalize to unseen deformations, offering a parameter-efficient solution for computer vision tasks.

The paper tackled the problem of achieving roto-translation invariance in neural networks without using augmented training data, resulting in H-NeXt, which outperformed state-of-the-art methods on MNIST and CIFAR-10 datasets.

The widespread popularity of equivariant networks underscores the significance of parameter efficient models and effective use of training data. At a time when robustness to unseen deformations is becoming increasingly important, we present H-NeXt, which bridges the gap between equivariance and invariance. H-NeXt is a parameter-efficient roto-translation invariant network that is trained without a single augmented image in the training set. Our network comprises three components: an equivariant backbone for learning roto-translation independent features, an invariant pooling layer for discarding roto-translation information, and a classification layer. H-NeXt outperforms the state of the art in classification on unaugmented training sets and augmented test sets of MNIST and CIFAR-10.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes