IR AIOct 17, 2024

Preference Diffusion for Recommendation

Shuo Liu, An Zhang, Guoqing Hu, Hong Qian, Tat-seng Chua

arXiv:2410.13117v28.111 citationsh-index: 28Has CodeICLR

Originality Incremental advance

AI Analysis

This work addresses the challenge of better aligning diffusion models with personalized ranking tasks in recommender systems, representing an incremental improvement over existing methods.

The paper tackles the problem of improving personalized ranking in diffusion model-based recommender systems by proposing PreferDiff, a tailored optimization objective that transforms BPR into a log-likelihood ranking objective and integrates multiple negative samples, resulting in superior recommendation performance validated across three benchmarks.

Recommender systems predict personalized item rankings based on user preference distributions derived from historical behavior data. Recently, diffusion models (DMs) have gained attention in recommendation for their ability to model complex distributions, yet current DM-based recommenders often rely on traditional objectives like mean squared error (MSE) or recommendation objectives, which are not optimized for personalized ranking tasks or fail to fully leverage DM's generative potential. To address this, we propose PreferDiff, a tailored optimization objective for DM-based recommenders. PreferDiff transforms BPR into a log-likelihood ranking objective and integrates multiple negative samples to better capture user preferences. Specifically, we employ variational inference to handle the intractability through minimizing the variational upper bound and replaces MSE with cosine error to improve alignment with recommendation tasks. Finally, we balance learning generation and preference to enhance the training stability of DMs. PreferDiff offers three key benefits: it is the first personalized ranking loss designed specifically for DM-based recommenders and it improves ranking and faster convergence by addressing hard negatives. We also prove that it is theoretically connected to Direct Preference Optimization which indicates that it has the potential to align user preferences in DM-based recommenders via generative modeling. Extensive experiments across three benchmarks validate its superior recommendation performance and commendable general sequential recommendation capabilities. Our codes are available at https://github.com/lswhim/PreferDiff.

View on arXiv PDF Code

Similar