LGSep 18, 2024

Monomial Matrix Group Equivariant Neural Functional Networks

Viet-Hoang Tran, Thieu N. Vo, Tho H. Tran, An T. Nguyen, Tan M. Nguyen

arXiv:2409.11697v318.215 citationsh-index: 13Has Code

Originality Incremental advance

AI Analysis

This work addresses efficiency and symmetry modeling in neural functional networks, which are used for tasks like predicting network generalization and classifying implicit neural representations, but it appears incremental as it builds directly on prior NFN designs.

The paper tackles the problem that existing neural functional networks (NFNs) ignore weight scaling and sign-flipping symmetries in neural networks, and extends group actions from permutation matrices to monomial matrices to incorporate these symmetries. The result is Monomial-NFN, which has fewer trainable parameters than baseline NFNs, enhancing efficiency while achieving competitive performance.

Neural functional networks (NFNs) have recently gained significant attention due to their diverse applications, ranging from predicting network generalization and network editing to classifying implicit neural representation. Previous NFN designs often depend on permutation symmetries in neural networks' weights, which traditionally arise from the unordered arrangement of neurons in hidden layers. However, these designs do not take into account the weight scaling symmetries of $\ReLU$ networks, and the weight sign flipping symmetries of $\sin$ or $\Tanh$ networks. In this paper, we extend the study of the group action on the network weights from the group of permutation matrices to the group of monomial matrices by incorporating scaling/sign-flipping symmetries. Particularly, we encode these scaling/sign-flipping symmetries by designing our corresponding equivariant and invariant layers. We name our new family of NFNs the Monomial Matrix Group Equivariant Neural Functional Networks (Monomial-NFN). Because of the expansion of the symmetries, Monomial-NFN has much fewer independent trainable parameters compared to the baseline NFNs in the literature, thus enhancing the model's efficiency. Moreover, for fully connected and convolutional neural networks, we theoretically prove that all groups that leave these networks invariant while acting on their weight spaces are some subgroups of the monomial matrix group. We provide empirical evidence to demonstrate the advantages of our model over existing baselines, achieving competitive performance and efficiency.

View on arXiv PDF Code

Similar