CV LGNov 2, 2023

H-NeXt: The next step towards roto-translation invariant networks

Tomas Karella, Filip Sroubek, Jan Flusser, Jan Blazek, Vasek Kosik

arXiv:2311.01111v12.83 citationsh-index: 41Has Code

Originality Highly original

AI Analysis

This addresses the need for robust models that generalize to unseen deformations, offering a parameter-efficient solution for computer vision tasks.

The paper tackled the problem of achieving roto-translation invariance in neural networks without using augmented training data, resulting in H-NeXt, which outperformed state-of-the-art methods on MNIST and CIFAR-10 datasets.

The widespread popularity of equivariant networks underscores the significance of parameter efficient models and effective use of training data. At a time when robustness to unseen deformations is becoming increasingly important, we present H-NeXt, which bridges the gap between equivariance and invariance. H-NeXt is a parameter-efficient roto-translation invariant network that is trained without a single augmented image in the training set. Our network comprises three components: an equivariant backbone for learning roto-translation independent features, an invariant pooling layer for discarding roto-translation information, and a classification layer. H-NeXt outperforms the state of the art in classification on unaugmented training sets and augmented test sets of MNIST and CIFAR-10.

View on arXiv PDF Code

Similar