IRCLMay 19, 2024

EmbSum: Leveraging the Summarization Capabilities of Large Language Models for Content-Based Recommendations

arXiv:2405.11441v213 citationsh-index: 19RecSys
Originality Incremental advance
AI Analysis

This work addresses personalized content recommendation for users in digital platforms, representing an incremental improvement through a novel hybrid method.

The paper tackles the problem of content-based recommendation by introducing EmbSum, a framework that uses large language models to summarize user engagement histories and compute embeddings, achieving higher accuracy with fewer parameters than state-of-the-art methods on two datasets.

Content-based recommendation systems play a crucial role in delivering personalized content to users in the digital world. In this work, we introduce EmbSum, a novel framework that enables offline pre-computations of users and candidate items while capturing the interactions within the user engagement history. By utilizing the pretrained encoder-decoder model and poly-attention layers, EmbSum derives User Poly-Embedding (UPE) and Content Poly-Embedding (CPE) to calculate relevance scores between users and candidate items. EmbSum actively learns the long user engagement histories by generating user-interest summary with supervision from large language model (LLM). The effectiveness of EmbSum is validated on two datasets from different domains, surpassing state-of-the-art (SoTA) methods with higher accuracy and fewer parameters. Additionally, the model's ability to generate summaries of user interests serves as a valuable by-product, enhancing its usefulness for personalized content recommendations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes