LG MLDec 14, 2024

Multi-Class and Multi-Task Strategies for Neural Directed Link Prediction

Claudio Moroni, Claudio Borile, Carolina Mattsson, Michele Starnini, André Panisson

arXiv:2412.10895v12.6h-index: 6Has CodeECML/PKDD

Originality Incremental advance

AI Analysis

This addresses a specific bottleneck in graph representation learning for applications like knowledge graph completion, though it is incremental in improving existing neural methods.

The paper tackles the problem of directed link prediction in graphs, where existing methods often fail to handle directionality and bidirectionality across all sub-tasks, by proposing multi-class and multi-task strategies that outperform traditional approaches on multiple datasets.

Link Prediction is a foundational task in Graph Representation Learning, supporting applications like link recommendation, knowledge graph completion and graph generation. Graph Neural Networks have shown the most promising results in this domain and are currently the de facto standard approach to learning from graph data. However, a key distinction exists between Undirected and Directed Link Prediction: the former just predicts the existence of an edge, while the latter must also account for edge directionality and bidirectionality. This translates to Directed Link Prediction (DLP) having three sub-tasks, each defined by how training, validation and test sets are structured. Most research on DLP overlooks this trichotomy, focusing solely on the "existence" sub-task, where training and test sets are random, uncorrelated samples of positive and negative directed edges. Even in the works that recognize the aforementioned trichotomy, models fail to perform well across all three sub-tasks. In this study, we experimentally demonstrate that training Neural DLP (NDLP) models only on the existence sub-task, using methods adapted from Neural Undirected Link Prediction, results in parameter configurations that fail to capture directionality and bidirectionality, even after rebalancing edge classes. To address this, we propose three strategies that handle the three tasks simultaneously. Our first strategy, the Multi-Class Framework for Neural Directed Link Prediction (MC-NDLP) maps NDLP to a Multi-Class training objective. The second and third approaches adopt a Multi-Task perspective, either with a Multi-Objective (MO-DLP) or a Scalarized (S-DLP) strategy. Our results show that these methods outperform traditional approaches across multiple datasets and models, achieving equivalent or superior performance in addressing the three DLP sub-tasks.

View on arXiv PDF Code

Similar