IR AIMay 18

SynGR: Unleashing the Potential of Cross-Modal Synergy for Generative Recommendation

Wei Chen, Xingyu Guo, Shuang Li, Fuwei Zhang, Meng Yuan, Jing Fan, Zhao Zhang, Deqing Wang, Fuzhen Zhuang

arXiv:2605.1892031.6

Predicted impact top 9% in IR · last 90 daysOriginality Incremental advance

AI Analysis

For recommendation systems, this work addresses the underexplored role of cross-modal synergy in generative models, moving beyond alignment-centric fusion to improve item representation and user preference modeling.

SynGR introduces a generative recommendation framework that explicitly leverages cross-modal synergistic information to capture emergent item properties beyond single-modality signals, achieving superior performance on three benchmark datasets.

Generative Recommendation (GR) has emerged as a promising paradigm by formulating item recommendation as a sequence-to-sequence generation task over item identifiers. Recent studies have incorporated multimodal signals to provide richer token-level evidence for generation. However, existing approaches largely rely on alignment-centric fusion and underexplore synergistic information across modalities. In practice, synergistic information plays a critical role in capturing emergent item properties that cannot be inferred from any single modality alone. Such properties encode intrinsic item semantics and guide user preferences, enabling models to move beyond surface-level feature matching. To address this limitation, we propose \textbf{SynGR}, a synergistic generative recommendation framework that explicitly encourages the exploitation of cross-modal dependencies during generation. By constraining overreliance on dominant modalities, SynGR enables the model to capture emergent item semantics beyond shared or modality-specific signals. Extensive experiments across three benchmark datasets demonstrate that SynGR achieves superior performance.

View on arXiv PDF

Similar