IRCLSep 18, 2024

FLARE: Fusing Language Models and Collaborative Architectures for Recommender Enhancement

arXiv:2409.11699v25 citationsh-index: 8
AI Analysis

This work addresses the need for more effective and scalable recommender systems for web applications, though it is incremental as it builds on existing methods with tuning and hybrid architectures.

The paper tackles the problem of improving recommender systems by showing that a tuned Bert4Rec baseline outperforms previous results and is competitive with state-of-the-art models, and introduces Flare, a hybrid model combining language models and collaborative filtering, which achieves new competitive results on datasets with larger item vocabularies.

Recent proposals in recommender systems represent items with their textual description, using a large language model. They show better results on standard benchmarks compared to an item ID-only model, such as Bert4Rec. In this work, we revisit the often-used Bert4Rec baseline and show that with further tuning, Bert4Rec significantly outperforms previously reported numbers, and in some datasets, is competitive with state-of-the-art models. With revised baselines for item ID-only models, this paper also establishes new competitive results for architectures that combine IDs and textual descriptions. We demonstrate this with Flare (Fusing Language models and collaborative Architectures for Recommender Enhancement). Flare is a novel hybrid sequence recommender that integrates a language model with a collaborative filtering model using a Perceiver network. Prior studies focus evaluation on datasets with limited-corpus size, but many commercially-applicable recommender systems common on the web must handle larger corpora. We evaluate Flare on a more realistic dataset with a significantly larger item vocabulary, introducing new baselines for this setting. This paper also showcases Flare's inherent ability to support critiquing, enabling users to provide feedback and refine recommendations. We leverage critiquing as an evaluation method to assess the model's language understanding and its transferability to the recommendation task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes