IRLGMLApr 12, 2018

DeepFM: An End-to-End Wide & Deep Learning Framework for CTR Prediction

arXiv:1804.04950v266 citations
Originality Incremental advance
AI Analysis

This addresses CTR prediction for recommender systems, offering a practical improvement over existing methods, though it is incremental as it builds on the Wide & Deep model.

The paper tackles the problem of learning feature interactions for click-through rate (CTR) prediction in recommender systems by proposing DeepFM, an end-to-end framework that combines factorization machines and deep learning to handle both low- and high-order interactions without feature engineering. Results show DeepFM-D achieves over 10% CTR improvement in online A/B tests compared to a logistic regression model.

Learning sophisticated feature interactions behind user behaviors is critical in maximizing CTR for recommender systems. Despite great progress, existing methods have a strong bias towards low- or high-order interactions, or rely on expertise feature engineering. In this paper, we show that it is possible to derive an end-to-end learning model that emphasizes both low- and high-order feature interactions. The proposed framework, DeepFM, combines the power of factorization machines for recommendation and deep learning for feature learning in a new neural network architecture. Compared to the latest Wide & Deep model from Google, DeepFM has a shared raw feature input to both its "wide" and "deep" components, with no need of feature engineering besides raw features. DeepFM, as a general learning framework, can incorporate various network architectures in its deep component. In this paper, we study two instances of DeepFM where its "deep" component is DNN and PNN respectively, for which we denote as DeepFM-D and DeepFM-P. Comprehensive experiments are conducted to demonstrate the effectiveness of DeepFM-D and DeepFM-P over the existing models for CTR prediction, on both benchmark data and commercial data. We conduct online A/B test in Huawei App Market, which reveals that DeepFM-D leads to more than 10% improvement of click-through rate in the production environment, compared to a well-engineered LR model. We also covered related practice in deploying our framework in Huawei App Market.

Code Implementations8 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes