IRJan 16, 2022

Learning from Atypical Behavior: Temporary Interest Aware Recommendation Based on Reinforcement Learning

arXiv:2201.05970v11 citations
AI Analysis

This addresses the challenge of personalizing recommendations for users by leveraging atypical interactions that signal temporary interests, which is an incremental improvement over traditional noise-filtering methods.

The paper tackles the problem of distinguishing atypical user-item interactions from noise in recommendation systems by proposing TIARec, a reinforcement learning model that captures both temporary interests and general preferences without supervision, achieving improved recommendation performance on real-world datasets.

Traditional robust recommendation methods view atypical user-item interactions as noise and aim to reduce their impact with some kind of noise filtering technique, which often suffers from two challenges. First, in real world, atypical interactions may signal users' temporary interest different from their general preference. Therefore, simply filtering out the atypical interactions as noise may be inappropriate and degrade the personalization of recommendations. Second, it is hard to acquire the temporary interest since there are no explicit supervision signals to indicate whether an interaction is atypical or not. To address this challenges, we propose a novel model called Temporary Interest Aware Recommendation (TIARec), which can distinguish atypical interactions from normal ones without supervision and capture the temporary interest as well as the general preference of users. Particularly, we propose a reinforcement learning framework containing a recommender agent and an auxiliary classifier agent, which are jointly trained with the objective of maximizing the cumulative return of the recommendations made by the recommender agent. During the joint training process, the classifier agent can judge whether the interaction with an item recommended by the recommender agent is atypical, and the knowledge about learning temporary interest from atypical interactions can be transferred to the recommender agent, which makes the recommender agent able to alone make recommendations that balance the general preference and temporary interest of users. At last, the experiments conducted on real world datasets verify the effectiveness of TIARec.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes