MLLGFeb 23, 2017

Automatic Representation for Lifetime Value Recommender Systems

arXiv:1702.07125v11 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of moving beyond immediate gains to maximize cumulative user value in commercial recommender systems, though it appears incremental by automating representation within an existing RL framework.

The paper tackles the problem of optimizing lifetime value (LTV) in recommender systems by proposing a new architecture that combines reinforcement learning (RL) with automated state-space representation, eliminating the need for hand-tuned features, and tests it on real-world batch data.

Many modern commercial sites employ recommender systems to propose relevant content to users. While most systems are focused on maximizing the immediate gain (clicks, purchases or ratings), a better notion of success would be the lifetime value (LTV) of the user-system interaction. The LTV approach considers the future implications of the item recommendation, and seeks to maximize the cumulative gain over time. The Reinforcement Learning (RL) framework is the standard formulation for optimizing cumulative successes over time. However, RL is rarely used in practice due to its associated representation, optimization and validation techniques which can be complex. In this paper we propose a new architecture for combining RL with recommendation systems which obviates the need for hand-tuned features, thus automating the state-space representation construction process. We analyze the practical difficulties in this formulation and test our solutions on batch off-line real-world recommendation data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes