ITLGSep 1, 2025

Hierarchical Maximum Entropy via the Renormalization Group

arXiv:2509.01424v1
Originality Highly original
AI Analysis

This work addresses the challenge of multilevel modeling for researchers in probability, information theory, and statistical mechanics, presenting a novel theoretical framework with potential computational benefits.

The paper tackles the problem of modeling hierarchical structures in statistical and machine-learning models by introducing a hierarchical maximum entropy framework, showing that Pareto optimal distributions can be derived using renormalization-group procedures from physics, with computational efficiency improvements in specific invariance settings.

Hierarchical structures, which include multiple levels, are prevalent in statistical and machine-learning models as well as physical systems. Extending the foundational result that the maximum entropy distribution under mean constraints is given by the exponential Gibbs-Boltzmann form, we introduce the framework of "hierarchical maximum entropy" to address these multilevel models. We demonstrate that Pareto optimal distributions, which maximize entropies across all levels of hierarchical transformations, can be obtained via renormalization-group procedures from theoretical physics. This is achieved by formulating multilevel extensions of the Gibbs variational principle and the Donsker-Varadhan variational representation of entropy. Moreover, we explore settings with hierarchical invariances that significantly simplify the renormalization-group procedures, enhancing computational efficiency: quadratic modular loss functions, logarithmic loss functions, and nearest-neighbor loss functions. This is accomplished through the introduction of the concept of parameter flows, which serves as an analog to renormalization flows in renormalization group theory. This work connects ideas from probability theory, information theory, and statistical mechanics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes