IRAILGNov 15, 2024

KuaiFormer: Transformer-Based Retrieval at Kuaishou

arXiv:2411.10057v115 citationsh-index: 17
Originality Incremental advance
AI Analysis

This work addresses retrieval bottlenecks in industrial recommendation systems for platforms like Kuaishou, though it appears incremental as it adapts existing transformer approaches to a specific domain.

The paper tackles the challenge of deploying transformer models for retrieval in large-scale content recommendation systems by introducing KuaiFormer, which shifts from traditional score estimation to a Next Action Prediction paradigm, resulting in increased average daily usage time for over 400 million daily active users.

In large-scale content recommendation systems, retrieval serves as the initial stage in the pipeline, responsible for selecting thousands of candidate items from billions of options to pass on to ranking modules. Traditionally, the dominant retrieval method has been Embedding-Based Retrieval (EBR) using a Deep Neural Network (DNN) dual-tower structure. However, applying transformer in retrieval tasks has been the focus of recent research, though real-world industrial deployment still presents significant challenges. In this paper, we introduce KuaiFormer, a novel transformer-based retrieval framework deployed in a large-scale content recommendation system. KuaiFormer fundamentally redefines the retrieval process by shifting from conventional score estimation tasks (such as click-through rate estimate) to a transformer-driven Next Action Prediction paradigm. This shift enables more effective real-time interest acquisition and multi-interest extraction, significantly enhancing retrieval performance. KuaiFormer has been successfully integrated into Kuaishou App's short-video recommendation system since May 2024, serving over 400 million daily active users and resulting in a marked increase in average daily usage time of Kuaishou users. We provide insights into both the technical and business aspects of deploying transformer in large-scale recommendation systems, addressing practical challenges encountered during industrial implementation. Our findings offer valuable guidance for engineers and researchers aiming to leverage transformer models to optimize large-scale content recommendation systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes