LGCVJun 29, 2023

Restore Translation Using Equivariant Neural Networks

arXiv:2306.16938v1h-index: 8
Originality Incremental advance
AI Analysis

This addresses the issue of transformation sensitivity in neural networks for computer vision applications, offering a practical solution that maintains standard dataset performance.

The paper tackles the problem of CNNs being sensitive to translations and rotations by proposing a pre-classifier restorer that recovers transformed inputs to their original state before classification, achieving improved performance without modifying the classifier.

Invariance to spatial transformations such as translations and rotations is a desirable property and a basic design principle for classification neural networks. However, the commonly used convolutional neural networks (CNNs) are actually very sensitive to even small translations. There exist vast works to achieve exact or approximate transformation invariance by designing transformation-invariant models or assessing the transformations. These works usually make changes to the standard CNNs and harm the performance on standard datasets. In this paper, rather than modifying the classifier, we propose a pre-classifier restorer to recover translated (or even rotated) inputs to the original ones which will be fed into any classifier for the same dataset. The restorer is based on a theoretical result which gives a sufficient and necessary condition for an affine operator to be translational equivariant on a tensor space.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes