LGMar 29

TMTE: Effective Multimodal Graph Learning with Task-aware Modality and Topology Co-evolution

arXiv:2603.2772375.21 citationsh-index: 17Has Code
Predicted impact top 20% in LG · last 90 daysOriginality Incremental advance
AI Analysis

For researchers in multimodal graph learning, TMTE provides a novel framework that consistently outperforms existing methods on diverse tasks, addressing a known bottleneck of task-agnostic graph structures.

TMTE addresses the problem of suboptimal graph topology in multimodal-attributed graphs by jointly evolving topology and representations toward the target task, achieving state-of-the-art performance across 9 MAG datasets and 6 tasks.

Multimodal-attributed graphs (MAGs) are a fundamental data structure for multimodal graph learning (MGL), enabling both graph-centric and modality-centric tasks. However, our empirical analysis reveals inherent topology quality limitations in real-world MAGs, including noisy interactions, missing connections, and task-agnostic relational structures. A single graph derived from generic relationships is therefore unlikely to be universally optimal for diverse downstream tasks. To address this challenge, we propose Task-aware Modality and Topology co-Evolution (TMTE), a novel MGL framework that jointly and iteratively optimizes graph topology and multimodal representations toward the target task. TMTE is motivated by the bidirectional coupling between modality and topology: multimodal attributes induce relational structures, while graph topology shapes modality representations. Concretely, TMTE casts topology evolution as multi-perspective metric learning over modality embeddings with an anchor-based approximation, and formulates modality evolution as smoothness-regularized fusion with cross-modal alignment, yielding a closed-loop task-aware co-evolution process. Extensive experiments on 9 MAG datasets and 1 non-graph multimodal dataset across 6 graph-centric and modality-centric tasks show that TMTE consistently achieves state-of-the-art performance. Our code is available at https://anonymous.4open.science/r/TMTE-1873.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes