LGAIMLOct 9, 2025

gLSTM: Mitigating Over-Squashing by Increasing Storage Capacity

arXiv:2510.08450v24 citationsh-index: 6
Originality Incremental advance
AI Analysis

This addresses a known bottleneck in GNNs for applications requiring information propagation over large graphs, though it is an incremental adaptation of existing sequence modeling ideas.

The paper tackles the problem of over-squashing in Graph Neural Networks (GNNs) by introducing a new architecture, gLSTM, which improves storage capacity, and demonstrates strong performance on synthetic and real-world benchmarks.

Graph Neural Networks (GNNs) leverage the graph structure to transmit information between nodes, typically through the message-passing mechanism. While these models have found a wide variety of applications, they are known to suffer from over-squashing, where information from a large receptive field of node representations is collapsed into a single fixed sized vector, resulting in an information bottleneck. In this paper, we re-examine the over-squashing phenomenon through the lens of model storage and retrieval capacity, which we define as the amount of information that can be stored in a node's representation for later use. We study some of the limitations of existing tasks used to measure over-squashing and introduce a new synthetic task to demonstrate that an information bottleneck can saturate this capacity. Furthermore, we adapt ideas from the sequence modeling literature on associative memories, fast weight programmers, and the xLSTM model to develop a novel GNN architecture with improved capacity. We demonstrate strong performance of this architecture both on our capacity synthetic task, as well as a range of real-world graph benchmarks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes