LG IRJun 23, 2022

Rethinking Collaborative Metric Learning: Toward an Efficient Alternative without Negative Sampling

Shilong Bao, Qianqian Xu, Zhiyong Yang, Xiaochun Cao, Qingming Huang

arXiv:2206.11549v124 citationsh-index: 82

Originality Incremental advance

AI Analysis

This work addresses a theoretical flaw in recommendation systems by eliminating sampling bias, offering a more reliable method for practitioners, though it is incremental as it builds on existing CML paradigms.

The authors identified that negative sampling in Collaborative Metric Learning introduces a bias in generalization error, quantified by Total Variance, and proposed Sampling-Free Collaborative Metric Learning (SFCML) as an efficient alternative, which showed superior performance in experiments across seven benchmark datasets.

The recently proposed Collaborative Metric Learning (CML) paradigm has aroused wide interest in the area of recommendation systems (RS) owing to its simplicity and effectiveness. Typically, the existing literature of CML depends largely on the \textit{negative sampling} strategy to alleviate the time-consuming burden of pairwise computation. However, in this work, by taking a theoretical analysis, we find that negative sampling would lead to a biased estimation of the generalization error. Specifically, we show that the sampling-based CML would introduce a bias term in the generalization bound, which is quantified by the per-user \textit{Total Variance} (TV) between the distribution induced by negative sampling and the ground truth distribution. This suggests that optimizing the sampling-based CML loss function does not ensure a small generalization error even with sufficiently large training data. Moreover, we show that the bias term will vanish without the negative sampling strategy. Motivated by this, we propose an efficient alternative without negative sampling for CML named \textit{Sampling-Free Collaborative Metric Learning} (SFCML), to get rid of the sampling bias in a practical sense. Finally, comprehensive experiments over seven benchmark datasets speak to the superiority of the proposed algorithm.

View on arXiv PDF

Similar