IRLGJun 26, 2023

A Collaborative Transfer Learning Framework for Cross-domain Recommendation

arXiv:2306.16425v131 citationsh-index: 52
Originality Incremental advance
AI Analysis

This work solves the challenge of improving click-through rate prediction across multiple business domains in recommendation systems, with significant business impact, though it appears incremental as it builds on existing transfer learning techniques.

The paper tackles the problem of cross-domain recommendation by addressing domain shift and negative transfer, proposing the Collaborative Cross-Domain Transfer Learning Framework (CCTL) that achieved state-of-the-art offline metrics and, when deployed at Meituan, resulted in a 4.37% CTR and 5.43% GMV lift.

In the recommendation systems, there are multiple business domains to meet the diverse interests and needs of users, and the click-through rate(CTR) of each domain can be quite different, which leads to the demand for CTR prediction modeling for different business domains. The industry solution is to use domain-specific models or transfer learning techniques for each domain. The disadvantage of the former is that the data from other domains is not utilized by a single domain model, while the latter leverage all the data from different domains, but the fine-tuned model of transfer learning may trap the model in a local optimum of the source domain, making it difficult to fit the target domain. Meanwhile, significant differences in data quantity and feature schemas between different domains, known as domain shift, may lead to negative transfer in the process of transferring. To overcome these challenges, we propose the Collaborative Cross-Domain Transfer Learning Framework (CCTL). CCTL evaluates the information gain of the source domain on the target domain using a symmetric companion network and adjusts the information transfer weight of each source domain sample using the information flow network. This approach enables full utilization of other domain data while avoiding negative migration. Additionally, a representation enhancement network is used as an auxiliary task to preserve domain-specific features. Comprehensive experiments on both public and real-world industrial datasets, CCTL achieved SOTA score on offline metrics. At the same time, the CCTL algorithm has been deployed in Meituan, bringing 4.37% CTR and 5.43% GMV lift, which is significant to the business.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes