IRAICLOct 25, 2023

LlamaRec: Two-Stage Recommendation using Large Language Models for Ranking

arXiv:2311.02089v185 citationsh-index: 15Has Code
Originality Incremental advance
AI Analysis

This work addresses efficiency issues in real-time recommendation systems using LLMs, though it appears incremental as it builds on existing LLM and retrieval methods.

The authors tackled the problem of slow inference in LLM-based recommendation systems by proposing LlamaRec, a two-stage framework that uses sequential recommenders for candidate retrieval and LLMs with verbalizers for ranking, achieving superior performance and efficiency on benchmark datasets.

Recently, large language models (LLMs) have exhibited significant progress in language understanding and generation. By leveraging textual features, customized LLMs are also applied for recommendation and demonstrate improvements across diverse recommendation scenarios. Yet the majority of existing methods perform training-free recommendation that heavily relies on pretrained knowledge (e.g., movie recommendation). In addition, inference on LLMs is slow due to autoregressive generation, rendering existing methods less effective for real-time recommendation. As such, we propose a two-stage framework using large language models for ranking-based recommendation (LlamaRec). In particular, we use small-scale sequential recommenders to retrieve candidates based on the user interaction history. Then, both history and retrieved items are fed to the LLM in text via a carefully designed prompt template. Instead of generating next-item titles, we adopt a verbalizer-based approach that transforms output logits into probability distributions over the candidate items. Therefore, the proposed LlamaRec can efficiently rank items without generating long text. To validate the effectiveness of the proposed framework, we compare against state-of-the-art baseline methods on benchmark datasets. Our experimental results demonstrate the performance of LlamaRec, which consistently achieves superior performance in both recommendation performance and efficiency.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes