IRAIFeb 6

Sequences as Nodes for Contrastive Multimodal Graph Recommendation

arXiv:2602.07208v11 citationsh-index: 1Has Code
Originality Incremental advance
AI Analysis

This work addresses cold-start and sparsity issues in recommender systems, offering a novel method that is incremental over existing multimodal and sequential techniques.

The paper tackles cold-start and data sparsity in recommender systems by proposing MuSICRec, a multi-view graph-based model that combines collaborative, sequential, and multimodal signals, achieving state-of-the-art performance on Amazon datasets with significant gains for short-history users.

To tackle cold-start and data sparsity issues in recommender systems, numerous multimodal, sequential, and contrastive techniques have been proposed. While these augmentations can boost recommendation performance, they tend to add noise and disrupt useful semantics. To address this, we propose MuSICRec (Multimodal Sequence-Item Contrastive Recommender), a multi-view graph-based recommender that combines collaborative, sequential, and multimodal signals. We build a sequence-item (SI) view by attention pooling over the user's interacted items to form sequence nodes. We propagate over the SI graph, obtaining a second view organically as an alternative to artificial data augmentation, while simultaneously injecting sequential context signals. Additionally, to mitigate modality noise and align the multimodal information, the contribution of text and visual features is modulated according to an ID-guided gate. We evaluate under a strict leave-two-out split against a broad range of sequential, multimodal, and contrastive baselines. On the Amazon Baby, Sports, and Electronics datasets, MuSICRec outperforms state-of-the-art baselines across all model types. We observe the largest gains for short-history users, mitigating sparsity and cold-start challenges. Our code is available at https://anonymous.4open.science/r/MuSICRec-3CEE/ and will be made publicly available.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes