LGAISep 8, 2025

Riemannian Batch Normalization: A Gyro Approach

arXiv:2509.07115v25 citationsh-index: 30Has Code
Originality Incremental advance
AI Analysis

This work addresses the need for principled normalization layers in deep learning for non-Euclidean data, which is incremental as it builds on existing Riemannian methods.

The authors tackled the problem of extending batch normalization to data on Riemannian manifolds by introducing GyroBN, a framework for gyrogroups, and demonstrated its effectiveness across seven geometries including the Grassmannian and constant curvature spaces.

Normalization layers are crucial for deep learning, but their Euclidean formulations are inadequate for data on manifolds. On the other hand, many Riemannian manifolds in machine learning admit gyro-structures, enabling principled extensions of Euclidean neural networks to non-Euclidean domains. Inspired by this, we introduce GyroBN, a principled Riemannian batch normalization framework for gyrogroups. We establish two necessary conditions, namely \emph{pseudo-reduction} and \emph{gyroisometric gyrations}, that guarantee GyroBN with theoretical control over sample statistics, and show that these conditions hold for all known gyrogroups in machine learning. Our framework also incorporates several existing Riemannian normalization methods as special cases. We further instantiate GyroBN on seven representative geometries, including the Grassmannian, five constant curvature spaces, and the correlation manifold, and derive novel gyro and Riemannian structures to enable these instantiations. Experiments across these geometries demonstrate the effectiveness of GyroBN. The code is available at https://github.com/GitZH-Chen/GyroBN.git.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes