AISep 3, 2020

Learning to Infer User Hidden States for Online Sequential Advertising

arXiv:2009.01453v1
Originality Incremental advance
AI Analysis

This work addresses interpretability in online advertising for advertisers, though it appears incremental as it builds on existing POMDP and deep learning methods.

The authors tackled the problem of optimizing sequential advertising strategies by improving interpretability through modeling consumer purchase intent as a latent variable in a POMDP framework, achieving superior performance in large-scale industrial experiments.

To drive purchase in online advertising, it is of the advertiser's great interest to optimize the sequential advertising strategy whose performance and interpretability are both important. The lack of interpretability in existing deep reinforcement learning methods makes it not easy to understand, diagnose and further optimize the strategy. In this paper, we propose our Deep Intents Sequential Advertising (DISA) method to address these issues. The key part of interpretability is to understand a consumer's purchase intent which is, however, unobservable (called hidden states). In this paper, we model this intention as a latent variable and formulate the problem as a Partially Observable Markov Decision Process (POMDP) where the underlying intents are inferred based on the observable behaviors. Large-scale industrial offline and online experiments demonstrate our method's superior performance over several baselines. The inferred hidden states are analyzed, and the results prove the rationality of our inference.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes