LGAIMLMar 18, 2024

Graph Neural Networks for Learning Equivariant Representations of Neural Networks

arXiv:2403.12143v359 citationsh-index: 67Has CodeICLR
Originality Highly original
AI Analysis

This addresses the challenge of processing neural network parameters with permutation symmetry for researchers and practitioners in machine learning, offering a novel approach that is not incremental.

The paper tackles the problem of learning from neural network parameters by representing them as computational graphs, enabling a single model to handle diverse architectures and outperforming state-of-the-art methods in tasks like classification, editing, and generalization prediction.

Neural networks that process the parameters of other neural networks find applications in domains as diverse as classifying implicit neural representations, generating neural network weights, and predicting generalization errors. However, existing approaches either overlook the inherent permutation symmetry in the neural network or rely on intricate weight-sharing patterns to achieve equivariance, while ignoring the impact of the network architecture itself. In this work, we propose to represent neural networks as computational graphs of parameters, which allows us to harness powerful graph neural networks and transformers that preserve permutation symmetry. Consequently, our approach enables a single model to encode neural computational graphs with diverse architectures. We showcase the effectiveness of our method on a wide range of tasks, including classification and editing of implicit neural representations, predicting generalization performance, and learning to optimize, while consistently outperforming state-of-the-art methods. The source code is open-sourced at https://github.com/mkofinas/neural-graphs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes