LG AIDec 9, 2020

Interactive Search Based on Deep Reinforcement Learning

Yang Yu, Zhenhao Gu, Rong Tao, Jingtian Ge, Kenglun Chang

arXiv:2012.06052v12.31 citations

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of offline training for reinforcement learning-based recommendation systems, which is a problem for e-commerce platforms seeking more effective recommendations.

This paper establishes a virtual user environment for offline training of recommendation systems. It also proposes an improved reinforcement learning algorithm based on bi-clustering to expand the action and recommended path space for the recommendation agent.

With the continuous development of machine learning technology, major e-commerce platforms have launched recommendation systems based on it to serve a large number of customers with different needs more efficiently. Compared with traditional supervised learning, reinforcement learning can better capture the user's state transition in the decision-making process, and consider a series of user actions, not just the static characteristics of the user at a certain moment. In theory, it will have a long-term perspective, producing a more effective recommendation. The special requirements of reinforcement learning for data make it need to rely on an offline virtual system for training. Our project mainly establishes a virtual user environment for offline training. At the same time, we tried to improve a reinforcement learning algorithm based on bi-clustering to expand the action space and recommended path space of the recommendation agent.

View on arXiv PDF

Similar