LG AINov 25, 2025

Short-Range Oversquashing

Yaaqov Mishayev, Yonatan Sverdlov, Tal Amir, Nadav Dym

arXiv:2511.20406v12 citations

Originality Highly original

AI Analysis

This work addresses the problem of oversquashing in graph learning for researchers and practitioners, showing that existing MPNN solutions like virtual nodes are insufficient, positioning transformers as a more effective alternative.

The paper demonstrates that oversquashing in Message Passing Neural Networks (MPNNs) occurs not only in long-range tasks but also in short-range problems, revealing two distinct mechanisms: a bottleneck phenomenon in low-range settings and a vanishing gradient phenomenon in long-range tasks.

Message Passing Neural Networks (MPNNs) are widely used for learning on graphs, but their ability to process long-range information is limited by the phenomenon of oversquashing. This limitation has led some researchers to advocate Graph Transformers as a better alternative, whereas others suggest that it can be mitigated within the MPNN framework, using virtual nodes or other rewiring techniques. In this work, we demonstrate that oversquashing is not limited to long-range tasks, but can also arise in short-range problems. This observation allows us to disentangle two distinct mechanisms underlying oversquashing: (1) the bottleneck phenomenon, which can arise even in low-range settings, and (2) the vanishing gradient phenomenon, which is closely associated with long-range tasks. We further show that the short-range bottleneck effect is not captured by existing explanations for oversquashing, and that adding virtual nodes does not resolve it. In contrast, transformers do succeed in such tasks, positioning them as the more compelling solution to oversquashing, compared to specialized MPNNs.

View on arXiv PDF

Similar