LGApr 26

Quasi-Equivariant Metanetworks

Viet-Hoang Tran, An Nguyen, Benoît Guérand, Thieu N. Vo, Tan M. Nguyen

arXiv:2604.2372054.7

AI Analysis

For researchers designing metanetworks for weight-space learning, this work addresses the rigidity of strict equivariance, enabling more expressive models without sacrificing functional symmetry.

Metanetworks that process pretrained weights often ignore functional identity due to non-injective parameter-function mapping. The authors propose quasi-equivariant metanetworks that relax strict equivariance to improve expressivity while preserving functional identity, achieving better trade-offs across feedforward, convolutional, and transformer architectures.

Metanetworks are neural architectures designed to operate directly on pretrained weights to perform downstream tasks. However, the parameter space serves only as a proxy for the underlying function class, and the parameter-function mapping is inherently non-injective: distinct parameter configurations may yield identical input-output behaviors. As a result, metanetworks that rely solely on raw parameters risk overlooking the intrinsic symmetries of the architecture. Reasoning about functional identity is therefore essential for effective metanetwork design, motivating the development of equivariant metanetworks, which incorporate equivariance principles to respect architectural symmetries. Existing approaches, however, typically enforce strict equivariance, which imposes rigid constraints and often leads to sparse and less expressive models. To address this limitation, we introduce the novel concept of quasi-equivariance, which allows metanetworks to move beyond the rigidity of strict equivariance while still preserving functional identity. We lay down a principled basis for this framework and demonstrate its broad applicability across diverse neural architectures, including feedforward, convolutional, and transformer networks. Through empirical evaluation, we show that quasi-equivariant metanetworks achieve good trade-offs between symmetry preservation and representational expressivity. These findings advance the theoretical understanding of weight-space learning and provide a principled foundation for the design of more expressive and functionally robust metanetworks.

View on arXiv PDF

Similar