LGAIDSNEFeb 13

Which Algorithms Can Graph Neural Networks Learn?

arXiv:2602.13106v11 citationsh-index: 12
Originality Incremental advance
AI Analysis

This work addresses the challenge of integrating algorithmic reasoning into neural pipelines for researchers in machine learning and AI, offering formal guarantees that are incremental over prior empirical or expressivity-focused studies.

The paper tackles the problem of understanding when graph neural networks (MPNNs) can learn discrete algorithms from small training instances and generalize to arbitrary-sized inputs, establishing a theoretical framework that provides sufficient conditions for learning algorithms like shortest paths and minimum spanning trees, with impossibility results for some tasks and refined analysis for Bellman-Ford, supported by empirical validation.

In recent years, there has been growing interest in understanding neural architectures' ability to learn to execute discrete algorithms, a line of work often referred to as neural algorithmic reasoning. The goal is to integrate algorithmic reasoning capabilities into larger neural pipelines. Many such architectures are based on (message-passing) graph neural networks (MPNNs), owing to their permutation equivariance and ability to deal with sparsity and variable-sized inputs. However, existing work is either largely empirical and lacks formal guarantees or it focuses solely on expressivity, leaving open the question of when and how such architectures generalize beyond a finite training set. In this work, we propose a general theoretical framework that characterizes the sufficient conditions under which MPNNs can learn an algorithm from a training set of small instances and provably approximate its behavior on inputs of arbitrary size. Our framework applies to a broad class of algorithms, including single-source shortest paths, minimum spanning trees, and general dynamic programming problems, such as the $0$-$1$ knapsack problem. In addition, we establish impossibility results for a wide range of algorithmic tasks, showing that standard MPNNs cannot learn them, and we derive more expressive MPNN-like architectures that overcome these limitations. Finally, we refine our analysis for the Bellman-Ford algorithm, yielding a substantially smaller required training set and significantly extending the recent work of Nerem et al. [2025] by allowing for a differentiable regularization loss. Empirical results largely support our theoretical findings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes