LGAIOct 29, 2024

Learning Infinitesimal Generators of Continuous Symmetries from Data

arXiv:2410.21853v213 citationsh-index: 2Has CodeNIPS
Originality Highly original
AI Analysis

This work addresses the challenge of leveraging symmetry in data for machine learning, which is crucial for improving model performance in domains like image processing and scientific computing, though it is incremental by extending existing symmetry learning methods to nonlinear transformations.

The paper tackles the problem of identifying ambiguous symmetries in data by proposing a novel algorithm that learns infinitesimal generators of continuous symmetries from data, achieving improved sample efficiency and generalization in image data and partial differential equations.

Exploiting symmetry inherent in data can significantly improve the sample efficiency of a learning procedure and the generalization of learned models. When data clearly reveals underlying symmetry, leveraging this symmetry can naturally inform the design of model architectures or learning strategies. Yet, in numerous real-world scenarios, identifying the specific symmetry within a given data distribution often proves ambiguous. To tackle this, some existing works learn symmetry in a data-driven manner, parameterizing and learning expected symmetry through data. However, these methods often rely on explicit knowledge, such as pre-defined Lie groups, which are typically restricted to linear or affine transformations. In this paper, we propose a novel symmetry learning algorithm based on transformations defined with one-parameter groups, continuously parameterized transformations flowing along the directions of vector fields called infinitesimal generators. Our method is built upon minimal inductive biases, encompassing not only commonly utilized symmetries rooted in Lie groups but also extending to symmetries derived from nonlinear generators. To learn these symmetries, we introduce a notion of a validity score that examine whether the transformed data is still valid for the given task. The validity score is designed to be fully differentiable and easily computable, enabling effective searches for transformations that achieve symmetries innate to the data. We apply our method mainly in two domains: image data and partial differential equations, and demonstrate its advantages. Our codes are available at \url{https://github.com/kogyeonghoon/learning-symmetry-from-scratch.git}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes