IR CLMay 19, 2024

EmbSum: Leveraging the Summarization Capabilities of Large Language Models for Content-Based Recommendations

Chiyu Zhang, Yifei Sun, Minghao Wu, Jun Chen, Jie Lei, Muhammad Abdul-Mageed, Rong Jin, Angli Liu, Ji Zhu, Sem Park, Ning Yao, Bo Long

arXiv:2405.11441v29.213 citationsh-index: 19RecSys

Originality Incremental advance

AI Analysis

This work addresses personalized content recommendation for users in digital platforms, representing an incremental improvement through a novel hybrid method.

The paper tackles the problem of content-based recommendation by introducing EmbSum, a framework that uses large language models to summarize user engagement histories and compute embeddings, achieving higher accuracy with fewer parameters than state-of-the-art methods on two datasets.

Content-based recommendation systems play a crucial role in delivering personalized content to users in the digital world. In this work, we introduce EmbSum, a novel framework that enables offline pre-computations of users and candidate items while capturing the interactions within the user engagement history. By utilizing the pretrained encoder-decoder model and poly-attention layers, EmbSum derives User Poly-Embedding (UPE) and Content Poly-Embedding (CPE) to calculate relevance scores between users and candidate items. EmbSum actively learns the long user engagement histories by generating user-interest summary with supervision from large language model (LLM). The effectiveness of EmbSum is validated on two datasets from different domains, surpassing state-of-the-art (SoTA) methods with higher accuracy and fewer parameters. Additionally, the model's ability to generate summaries of user interests serves as a valuable by-product, enhancing its usefulness for personalized content recommendations.

View on arXiv PDF

Similar