LG AIOct 29, 2024

Learning Infinitesimal Generators of Continuous Symmetries from Data

arXiv:2410.21853v216.413 citationsh-index: 2Has CodeNIPS

Originality Highly original

AI Analysis

This work addresses the challenge of leveraging symmetry in data for machine learning, which is crucial for improving model performance in domains like image processing and scientific computing, though it is incremental by extending existing symmetry learning methods to nonlinear transformations.

The paper tackles the problem of identifying ambiguous symmetries in data by proposing a novel algorithm that learns infinitesimal generators of continuous symmetries from data, achieving improved sample efficiency and generalization in image data and partial differential equations.

Exploiting symmetry inherent in data can significantly improve the sample efficiency of a learning procedure and the generalization of learned models. When data clearly reveals underlying symmetry, leveraging this symmetry can naturally inform the design of model architectures or learning strategies. Yet, in numerous real-world scenarios, identifying the specific symmetry within a given data distribution often proves ambiguous. To tackle this, some existing works learn symmetry in a data-driven manner, parameterizing and learning expected symmetry through data. However, these methods often rely on explicit knowledge, such as pre-defined Lie groups, which are typically restricted to linear or affine transformations. In this paper, we propose a novel symmetry learning algorithm based on transformations defined with one-parameter groups, continuously parameterized transformations flowing along the directions of vector fields called infinitesimal generators. Our method is built upon minimal inductive biases, encompassing not only commonly utilized symmetries rooted in Lie groups but also extending to symmetries derived from nonlinear generators. To learn these symmetries, we introduce a notion of a validity score that examine whether the transformed data is still valid for the given task. The validity score is designed to be fully differentiable and easily computable, enabling effective searches for transformations that achieve symmetries innate to the data. We apply our method mainly in two domains: image data and partial differential equations, and demonstrate its advantages. Our codes are available at \url{https://github.com/kogyeonghoon/learning-symmetry-from-scratch.git}.

View on arXiv PDF Code

Similar