IRCLApr 16, 2025

Rethinking LLM-Based Recommendations: A Personalized Query-Driven Parallel Integration

arXiv:2504.11889v23 citationsh-index: 4EMNLP
Originality Incremental advance
AI Analysis

This work addresses efficiency and bias issues in LLM-based recommendation systems for users, though it appears incremental as it builds on existing integration efforts.

The paper tackles challenges in integrating large language models (LLMs) into recommendation systems, such as bias and bottlenecks, by proposing a parallel framework that decouples LLMs from candidate pre-selection, resulting in up to 57% performance improvement and enhanced novelty and diversity.

Recent studies have explored integrating large language models (LLMs) into recommendation systems but face several challenges, including training-induced bias and bottlenecks from serialized architecture. To effectively address these issues, we propose a Query-toRecommendation, a parallel recommendation framework that decouples LLMs from candidate pre-selection and instead enables direct retrieval over the entire item pool. Our framework connects LLMs and recommendation models in a parallel manner, allowing each component to independently utilize its strengths without interfering with the other. In this framework, LLMs are utilized to generate feature-enriched item descriptions and personalized user queries, allowing for capturing diverse preferences and enabling rich semantic matching in a zero-shot manner. To effectively combine the complementary strengths of LLM and collaborative signals, we introduce an adaptive reranking strategy. Extensive experiments demonstrate an improvement in performance up to 57%, while also improving the novelty and diversity of recommendations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes