LGBMDec 2, 2025

Training Dynamics of Learning 3D-Rotational Equivariance

arXiv:2512.02303v1h-index: 3Trans. Mach. Learn. Res.
Originality Incremental advance
AI Analysis

This addresses the problem of understanding training dynamics for symmetry learning in machine learning, with incremental insights into efficiency trade-offs for non-equivariant models.

The paper investigates how quickly and effectively models learn 3D-rotational equivariance, finding that they reduce equivariance error to ≤2% held-out loss within 1k-10k training steps on molecular tasks, due to a smoother loss landscape compared to the main prediction task.

While data augmentation is widely used to train symmetry-agnostic models, it remains unclear how quickly and effectively they learn to respect symmetries. We investigate this by deriving a principled measure of equivariance error that, for convex losses, calculates the percent of total loss attributable to imperfections in learned symmetry. We focus our empirical investigation to 3D-rotation equivariance on high-dimensional molecular tasks (flow matching, force field prediction, denoising voxels) and find that models reduce equivariance error quickly to $\leq$2\% held-out loss within 1k-10k training steps, a result robust to model and dataset size. This happens because learning 3D-rotational equivariance is an easier learning task, with a smoother and better-conditioned loss landscape, than the main prediction task. For 3D rotations, the loss penalty for non-equivariant models is small throughout training, so they may achieve lower test loss than equivariant models per GPU-hour unless the equivariant ``efficiency gap'' is narrowed. We also experimentally and theoretically investigate the relationships between relative equivariance error, learning gradients, and model parameters.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes