LGApr 14, 2022

Relaxing Equivariance Constraints with Non-stationary Continuous Filters

Tycho F. A. van der Ouderaa, David W. Romero, Mark van der Wilk

arXiv:2204.07178v222.946 citationsh-index: 27

Originality Incremental advance

AI Analysis

This work addresses the need for adjustable symmetry in neural networks for tasks where fixed equivariance is too restrictive, offering a method to learn equivariance from data, though it is incremental as it builds on existing equivariance concepts.

The authors tackled the problem of equivariance constraints being overly restrictive for tasks not strictly following symmetries by proposing a parameter-efficient relaxation that can interpolate between non-equivariant, strict-equivariant, and invariant mappings, achieving similar or improved performance compared to cross-validation and outperforming baselines on CIFAR-10 and CIFAR-100 image classification tasks.

Equivariances provide useful inductive biases in neural network modeling, with the translation equivariance of convolutional neural networks being a canonical example. Equivariances can be embedded in architectures through weight-sharing and place symmetry constraints on the functions a neural network can represent. The type of symmetry is typically fixed and has to be chosen in advance. Although some tasks are inherently equivariant, many tasks do not strictly follow such symmetries. In such cases, equivariance constraints can be overly restrictive. In this work, we propose a parameter-efficient relaxation of equivariance that can effectively interpolate between a (i) non-equivariant linear product, (ii) a strict-equivariant convolution, and (iii) a strictly-invariant mapping. The proposed parameterisation can be thought of as a building block to allow adjustable symmetry structure in neural networks. In addition, we demonstrate that the amount of equivariance can be learned from the training data using backpropagation. Gradient-based learning of equivariance achieves similar or improved performance compared to the best value found by cross-validation and outperforms baselines with partial or strict equivariance on CIFAR-10 and CIFAR-100 image classification tasks.

View on arXiv PDF

Similar