LGCVDec 5, 2024

Learning Symmetries via Weight-Sharing with Doubly Stochastic Tensors

arXiv:2412.04594v26 citationsh-index: 8NIPS
AI Analysis

This addresses the need for flexible symmetry learning in deep learning to improve generalization and efficiency, though it is incremental as it builds on existing group equivariance methods.

The paper tackled the problem of dynamically discovering and applying symmetries in neural networks without requiring prior knowledge of the groups, by proposing a method using learnable doubly stochastic matrices for soft weight-sharing, which converges to regular group convolutions when symmetries are strong and handles partial symmetries effectively.

Group equivariance has emerged as a valuable inductive bias in deep learning, enhancing generalization, data efficiency, and robustness. Classically, group equivariant methods require the groups of interest to be known beforehand, which may not be realistic for real-world data. Additionally, baking in fixed group equivariance may impose overly restrictive constraints on model architecture. This highlights the need for methods that can dynamically discover and apply symmetries as soft constraints. For neural network architectures, equivariance is commonly achieved through group transformations of a canonical weight tensor, resulting in weight sharing over a given group $G$. In this work, we propose to learn such a weight-sharing scheme by defining a collection of learnable doubly stochastic matrices that act as soft permutation matrices on canonical weight tensors, which can take regular group representations as a special case. This yields learnable kernel transformations that are jointly optimized with downstream tasks. We show that when the dataset exhibits strong symmetries, the permutation matrices will converge to regular group representations and our weight-sharing networks effectively become regular group convolutions. Additionally, the flexibility of the method enables it to effectively pick up on partial symmetries.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes