MLIRLGApr 21, 2018

Entire Space Multi-Task Model: An Effective Approach for Estimating Post-Click Conversion Rate

arXiv:1804.07931v2519 citations
Originality Incremental advance
AI Analysis

This work solves a critical problem for industrial ranking systems by enabling more reliable CVR predictions, though it is incremental in its approach.

The paper tackles the problem of accurately estimating post-click conversion rate (CVR) in recommendation and advertising systems by addressing sample selection bias and data sparsity, resulting in significant performance improvements over competitive methods on a dataset from Taobao's recommender system.

Estimating post-click conversion rate (CVR) accurately is crucial for ranking systems in industrial applications such as recommendation and advertising. Conventional CVR modeling applies popular deep learning methods and achieves state-of-the-art performance. However it encounters several task-specific problems in practice, making CVR modeling challenging. For example, conventional CVR models are trained with samples of clicked impressions while utilized to make inference on the entire space with samples of all impressions. This causes a sample selection bias problem. Besides, there exists an extreme data sparsity problem, making the model fitting rather difficult. In this paper, we model CVR in a brand-new perspective by making good use of sequential pattern of user actions, i.e., impression -> click -> conversion. The proposed Entire Space Multi-task Model (ESMM) can eliminate the two problems simultaneously by i) modeling CVR directly over the entire space, ii) employing a feature representation transfer learning strategy. Experiments on dataset gathered from Taobao's recommender system demonstrate that ESMM significantly outperforms competitive methods. We also release a sampling version of this dataset to enable future research. To the best of our knowledge, this is the first public dataset which contains samples with sequential dependence of click and conversion labels for CVR modeling.

Code Implementations6 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes