IR LGMay 20, 2020

M2GRL: A Multi-task Multi-view Graph Representation Learning Framework for Web-scale Recommender Systems

Menghan Wang, Yujie Lin, Guli Lin, Keping Yang, Xiao-ming Wu

arXiv:2005.10110v317.393 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses scalability and bias issues in industrial recommender systems by enabling more effective use of abundant multi-view data, though it is incremental as it builds on existing multi-view and graph learning trends.

The paper tackles the problem of integrating multi-view data in graph representation learning for web-scale recommender systems by proposing M2GRL, a multi-task multi-view framework that uses representation alignment instead of fusion, resulting in significant performance improvements as shown in offline metrics and online A/B tests at Taobao with training on 57 billion examples.

Combining graph representation learning with multi-view data (side information) for recommendation is a trend in industry. Most existing methods can be categorized as \emph{multi-view representation fusion}; they first build one graph and then integrate multi-view data into a single compact representation for each node in the graph. However, these methods are raising concerns in both engineering and algorithm aspects: 1) multi-view data are abundant and informative in industry and may exceed the capacity of one single vector, and 2) inductive bias may be introduced as multi-view data are often from different distributions. In this paper, we use a \emph{multi-view representation alignment} approach to address this issue. Particularly, we propose a multi-task multi-view graph representation learning framework (M2GRL) to learn node representations from multi-view graphs for web-scale recommender systems. M2GRL constructs one graph for each single-view data, learns multiple separate representations from multiple graphs, and performs alignment to model cross-view relations. M2GRL chooses a multi-task learning paradigm to learn intra-view representations and cross-view relations jointly. Besides, M2GRL applies homoscedastic uncertainty to adaptively tune the loss weights of tasks during training. We deploy M2GRL at Taobao and train it on 57 billion examples. According to offline metrics and online A/B tests, M2GRL significantly outperforms other state-of-the-art algorithms. Further exploration on diversity recommendation in Taobao shows the effectiveness of utilizing multiple representations produced by \method{}, which we argue is a promising direction for various industrial recommendation tasks of different focus.

View on arXiv PDF Code

Similar