IRAIMar 15, 2024

PPM : A Pre-trained Plug-in Model for Click-through Rate Prediction

arXiv:2403.10049v13 citationsh-index: 3WWW
Originality Incremental advance
AI Analysis

This addresses cold-start and efficiency problems in industrial recommender systems, offering an incremental improvement by integrating pre-trained knowledge without heavy latency costs.

The paper tackles the cold-start and efficiency limitations of identity-based CTR prediction models by introducing PPM, a pre-trained plug-in model that uses multi-modal features and large-scale pre-training, achieving improved performance and iteration efficiency without significant latency increases in offline and online tests at JD E-commerce.

Click-through rate (CTR) prediction is a core task in recommender systems. Existing methods (IDRec for short) rely on unique identities to represent distinct users and items that have prevailed for decades. On one hand, IDRec often faces significant performance degradation on cold-start problem; on the other hand, IDRec cannot use longer training data due to constraints imposed by iteration efficiency. Most prior studies alleviate the above problems by introducing pre-trained knowledge(e.g. pre-trained user model or multi-modal embeddings). However, the explosive growth of online latency can be attributed to the huge parameters in the pre-trained model. Therefore, most of them cannot employ the unified model of end-to-end training with IDRec in industrial recommender systems, thus limiting the potential of the pre-trained model. To this end, we propose a $\textbf{P}$re-trained $\textbf{P}$lug-in CTR $\textbf{M}$odel, namely PPM. PPM employs multi-modal features as input and utilizes large-scale data for pre-training. Then, PPM is plugged in IDRec model to enhance unified model's performance and iteration efficiency. Upon incorporating IDRec model, certain intermediate results within the network are cached, with only a subset of the parameters participating in training and serving. Hence, our approach can successfully deploy an end-to-end model without causing huge latency increases. Comprehensive offline experiments and online A/B testing at JD E-commerce demonstrate the efficiency and effectiveness of PPM.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes