LGFeb 11, 2025

Effects of Dropout on Performance in Long-range Graph Learning Tasks

Jasraj Singh, Keyue Jiang, Brooks Paige, Laura Toni

arXiv:2502.07364v211.42 citationsh-index: 6

Originality Incremental advance

AI Analysis

This addresses a critical gap for researchers and practitioners using deep graph neural networks on tasks requiring long-range interactions, representing an incremental improvement over existing methods.

The paper tackles the problem of over-squashing in Message Passing Neural Networks (MPNNs) for long-range graph learning tasks, showing that existing Dropout-style methods like DropEdge reduce sensitivity to distant nodes and introducing DropSens, a sensitivity-aware variant that outperforms graph rewiring techniques on synthetic and real-world datasets.

Message Passing Neural Networks (MPNNs) are a class of Graph Neural Networks (GNNs) that propagate information across the graph via local neighborhoods. The scheme gives rise to two key challenges: over-smoothing and over-squashing. While several Dropout-style algorithms, such as DropEdge and DropMessage, have successfully addressed over-smoothing, their impact on over-squashing remains largely unexplored. This represents a critical gap in the literature, as failure to mitigate over-squashing would make these methods unsuitable for long-range tasks -- the intended use case of deep MPNNs. In this work, we study the aforementioned algorithms, and closely related edge-dropping algorithms -- DropNode, DropAgg and DropGNN -- in the context of over-squashing. We present theoretical results showing that DropEdge-variants reduce sensitivity between distant nodes, limiting their suitability for long-range tasks. To address this, we introduce DropSens, a sensitivity-aware variant of DropEdge that explicitly controls the proportion of information lost due to edge-dropping, thereby increasing sensitivity to distant nodes despite dropping the same number of edges. Our experiments on long-range synthetic and real-world datasets confirm the predicted limitations of existing edge-dropping and feature-dropping methods. Moreover, DropSens consistently outperforms graph rewiring techniques designed to mitigate over-squashing, suggesting that simple, targeted modifications can substantially improve a model's ability to capture long-range interactions. Our conclusions highlight the need to re-evaluate and re-design existing methods for training deep GNNs, with a renewed focus on modelling long-range interactions.

View on arXiv PDF

Similar