LGJun 17, 2025

Evaluating Loss Functions for Graph Neural Networks: Towards Pretraining and Generalization

Khushnood Abbas, Ruizhe Hou, Zhou Wengang, Dong Shi, Niu Ling, Satyaki Nan, Alireza Abbasi

arXiv:2506.14114v14.1h-index: 9

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of selecting optimal model-loss combinations for GNNs, providing empirical insights for researchers and practitioners, though it is incremental as it focuses on systematic evaluation rather than introducing new methods.

The paper conducted a large-scale evaluation of seven GNN architectures and 30 loss functions across three real-world datasets, finding that hybrid loss functions generally yield superior and robust performance in inductive settings, with GIN architecture showing the highest average performance, especially with Cross-Entropy loss.

Graph Neural Networks (GNNs) became useful for learning on non-Euclidean data. However, their best performance depends on choosing the right model architecture and the training objective, also called the loss function. Researchers have studied these parts separately, but a large-scale evaluation has not looked at how GNN models and many loss functions work together across different tasks. To fix this, we ran a thorough study - it included seven well-known GNN architectures. We also used a large group of 30 single plus mixed loss functions. The study looked at both inductive and transductive settings. Our evaluation spanned three distinct real-world datasets, assessing performance in both inductive and transductive settings using 21 comprehensive evaluation metrics. From these extensive results (detailed in supplementary information 1 \& 2), we meticulously analyzed the top ten model-loss combinations for each metric based on their average rank. Our findings reveal that, especially for the inductive case: 1) Hybrid loss functions generally yield superior and more robust performance compared to single loss functions, indicating the benefit of multi-objective optimization. 2) The GIN architecture always showed the highest-level average performance, especially with Cross-Entropy loss. 3) Although some combinations had overall lower average ranks, models such as GAT, particularly with certain hybrid losses, demonstrated incredible specialized strengths, maximizing the most top-1 results among the individual metrics, emphasizing subtle strengths for particular task demands. 4) On the other hand, the MPNN architecture typically lagged behind the scenarios it was tested against.

View on arXiv PDF

Similar