IROct 21, 2021

Locality-Sensitive Experience Replay for Online Recommendation

Xiaocong Chen, Lina Yao, Xianzhi Wang, Julian McAuley

arXiv:2110.10850v12.0

Originality Incremental advance

AI Analysis

This work addresses the problem of dynamic user preference adaptation in online recommender systems, representing an incremental improvement over prior experience replay techniques.

The paper tackles the challenge of training deep reinforcement learning agents for online recommendation by addressing inefficiencies in existing experience replay methods, resulting in a model that demonstrates feasibility and superiority over several existing methods on three online simulation platforms.

Online recommendation requires handling rapidly changing user preferences. Deep reinforcement learning (DRL) is gaining interest as an effective means of capturing users' dynamic interest during interactions with recommender systems. However, it is challenging to train a DRL agent, due to large state space (e.g., user-item rating matrix and user profiles), action space (e.g., candidate items), and sparse rewards. Existing studies encourage the agent to learn from past experience via experience replay (ER). They adapt poorly to the complex environment of online recommender systems and are inefficient in determining an optimal strategy from past experience. To address these issues, we design a novel state-aware experience replay model, which uses locality-sensitive hashing to map high dimensional data into low-dimensional representations and a prioritized reward-driven strategy to replay more valuable experience at a higher chance. Our model can selectively pick the most relevant and salient experiences and recommend the agent with the optimal policy. Experiments on three online simulation platforms demonstrate our model' feasibility and superiority toseveral existing experience replay methods.

View on arXiv PDF

Similar