LGMLOct 20, 2025

Symmetries in PAC-Bayesian Learning

arXiv:2510.17303v11 citationsh-index: 1
Originality Incremental advance
AI Analysis

This work provides theoretical evidence for the benefits of symmetric models in broader real-world settings, though it is incremental as it builds on the PAC-Bayes framework.

The paper tackles the limited theoretical guarantees for symmetries in machine learning by extending generalization bounds to non-compact symmetries and non-invariant data distributions, demonstrating improved guarantees on a rotated MNIST dataset.

Symmetries are known to improve the empirical performance of machine learning models, yet theoretical guarantees explaining these gains remain limited. Prior work has focused mainly on compact group symmetries and often assumes that the data distribution itself is invariant, an assumption rarely satisfied in real-world applications. In this work, we extend generalization guarantees to the broader setting of non-compact symmetries, such as translations and to non-invariant data distributions. Building on the PAC-Bayes framework, we adapt and tighten existing bounds, demonstrating the approach on McAllester's PAC-Bayes bound while showing that it applies to a wide range of PAC-Bayes bounds. We validate our theory with experiments on a rotated MNIST dataset with a non-uniform rotation group, where the derived guarantees not only hold but also improve upon prior results. These findings provide theoretical evidence that, for symmetric data, symmetric models are preferable beyond the narrow setting of compact groups and invariant distributions, opening the way to a more general understanding of symmetries in machine learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes