LG ST MLNov 13, 2023

Affine Invariance in Continuous-Domain Convolutional Neural Networks

arXiv:2311.09245v22.01 citationsh-index: 1

Originality Incremental advance

AI Analysis

This work addresses the problem of improving pattern recognition under affine transformations for machine learning researchers, but it appears incremental as it builds on existing group invariance concepts.

The paper tackles the problem of achieving affine invariance in convolutional neural networks by introducing a new criterion for assessing invariance under affine transformations and embedding images into the affine Lie group for group convolution operations, with the result being a theoretical framework that could extend the handling of geometric transformations in deep learning.

The notion of group invariance helps neural networks in recognizing patterns and features under geometric transformations. Group convolutional neural networks enhance traditional convolutional neural networks by incorporating group-based geometric structures into their design. This research studies affine invariance on continuous-domain convolutional neural networks. Despite other research considering isometric invariance or similarity invariance, we focus on the full structure of affine transforms generated by the group of all invertible $2 \times 2$ real matrices (generalized linear group $\mathrm{GL}_2(\mathbb{R})$). We introduce a new criterion to assess the invariance of two signals under affine transformations. The input image is embedded into the affine Lie group $G_2 = \mathbb{R}^2 \ltimes \mathrm{GL}_2(\mathbb{R})$ to facilitate group convolution operations that respect affine invariance. Then, we analyze the convolution of embedded signals over $G_2$. In sum, our research could eventually extend the scope of geometrical transformations that usual deep-learning pipelines can handle.

View on arXiv PDF

Similar