IRAIITLGSIApr 22, 2024

Collaborative Filtering Based on Diffusion Models: Unveiling the Potential of High-Order Connectivity

arXiv:2404.14240v143 citationsh-index: 9SIGIR
Originality Incremental advance
AI Analysis

This addresses the need for more accurate collaborative filtering in recommender systems by incorporating high-order connectivity, though it appears incremental as an enhancement to existing diffusion model approaches.

The paper tackles the problem of diffusion model-based recommender systems not leveraging high-order connectivity for collaborative filtering, proposing CF-Diff which uses a cross-attention-guided multi-hop autoencoder to incorporate multi-hop neighbors. The method achieves up to 7.29% improvement over benchmarks on real-world datasets while maintaining computational efficiency.

A recent study has shown that diffusion models are well-suited for modeling the generative process of user-item interactions in recommender systems due to their denoising nature. However, existing diffusion model-based recommender systems do not explicitly leverage high-order connectivities that contain crucial collaborative signals for accurate recommendations. Addressing this gap, we propose CF-Diff, a new diffusion model-based collaborative filtering (CF) method, which is capable of making full use of collaborative signals along with multi-hop neighbors. Specifically, the forward-diffusion process adds random noise to user-item interactions, while the reverse-denoising process accommodates our own learning model, named cross-attention-guided multi-hop autoencoder (CAM-AE), to gradually recover the original user-item interactions. CAM-AE consists of two core modules: 1) the attention-aided AE module, responsible for precisely learning latent representations of user-item interactions while preserving the model's complexity at manageable levels, and 2) the multi-hop cross-attention module, which judiciously harnesses high-order connectivity information to capture enhanced collaborative signals. Through comprehensive experiments on three real-world datasets, we demonstrate that CF-Diff is (a) Superior: outperforming benchmark recommendation methods, achieving remarkable gains up to 7.29% compared to the best competitor, (b) Theoretically-validated: reducing computations while ensuring that the embeddings generated by our model closely approximate those from the original cross-attention, and (c) Scalable: proving the computational efficiency that scales linearly with the number of users or items.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes