IRAILGMLDec 11, 2024

Preference Discerning with LLM-Enhanced Generative Retrieval

arXiv:2412.08604v20.149 citationsh-index: 13Has CodeTrans. Mach. Learn. Res.
AI Analysis85

This addresses the issue of echo chambers and inflexibility in recommendation systems for users, though it is incremental as it builds on existing generative models.

The paper tackles the problem of sequential recommendation models' inability to dynamically adapt to changing user preferences by proposing a new paradigm called preference discerning, which conditions generative models on natural language preferences, and introduces a method named Mender that achieves state-of-the-art performance on a novel benchmark.

In sequential recommendation, models recommend items based on user's interaction history. To this end, current models usually incorporate information such as item descriptions and user intent or preferences. User preferences are usually not explicitly given in open-source datasets, and thus need to be approximated, for example via large language models (LLMs). Current approaches leverage approximated user preferences only during training and rely solely on the past interaction history for recommendations, limiting their ability to dynamically adapt to changing preferences, potentially reinforcing echo chambers. To address this issue, we propose a new paradigm, namely preference discerning, which explicitly conditions a generative recommendation model on user preferences in natural language within its context. To evaluate preference discerning, we introduce a novel benchmark that provides a holistic evaluation across various scenarios, including preference steering and sentiment following. Upon evaluating current state-of-the-art methods on our benchmark, we discover that their ability to dynamically adapt to evolving user preferences is limited. To address this, we propose a new method named Mender ($\textbf{M}$ultimodal Prefer$\textbf{en}$ce $\textbf{D}$iscern$\textbf{er}$), which achieves state-of-the-art performance in our benchmark. Our results show that Mender effectively adapts its recommendation guided by human preferences, even if not observed during training, paving the way toward more flexible recommendation models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes