LGMLMay 27, 2020

General-Purpose User Embeddings based on Mobile App Usage

arXiv:2005.13303v142 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of reducing manual feature engineering for user modeling in mobile apps, benefiting applications like advertising and recommendations, but it is incremental as it builds on existing neural network approaches.

The paper tackles user modeling from mobile app usage by proposing an AutoEncoder-coupled Transformer Network (AETN) to overcome challenges like heterogeneous behaviors and data sparsity, achieving effective user embeddings deployed at Tencent with demonstrated performance in downstream applications.

In this paper, we report our recent practice at Tencent for user modeling based on mobile app usage. User behaviors on mobile app usage, including retention, installation, and uninstallation, can be a good indicator for both long-term and short-term interests of users. For example, if a user installs Snapseed recently, she might have a growing interest in photographing. Such information is valuable for numerous downstream applications, including advertising, recommendations, etc. Traditionally, user modeling from mobile app usage heavily relies on handcrafted feature engineering, which requires onerous human work for different downstream applications, and could be sub-optimal without domain experts. However, automatic user modeling based on mobile app usage faces unique challenges, including (1) retention, installation, and uninstallation are heterogeneous but need to be modeled collectively, (2) user behaviors are distributed unevenly over time, and (3) many long-tailed apps suffer from serious sparsity. In this paper, we present a tailored AutoEncoder-coupled Transformer Network (AETN), by which we overcome these challenges and achieve the goals of reducing manual efforts and boosting performance. We have deployed the model at Tencent, and both online/offline experiments from multiple domains of downstream applications have demonstrated the effectiveness of the output user embeddings.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes