LGSIMLMay 23, 2018

Deep Reinforcement Learning of Marked Temporal Point Processes

arXiv:1805.09360v2116 citations
AI Analysis

It addresses the challenge of asynchronous event-based interactions for applications such as education and marketing, representing an incremental improvement with a novel method for a known bottleneck.

The paper tackles the problem of designing online interventions for asynchronous stochastic discrete events in continuous time, using deep reinforcement learning of marked temporal point processes, and shows it can help learners and marketers achieve goals more effectively than alternatives in applications like personalized teaching and viral marketing.

In a wide variety of applications, humans interact with a complex environment by means of asynchronous stochastic discrete events in continuous time. Can we design online interventions that will help humans achieve certain goals in such asynchronous setting? In this paper, we address the above problem from the perspective of deep reinforcement learning of marked temporal point processes, where both the actions taken by an agent and the feedback it receives from the environment are asynchronous stochastic discrete events characterized using marked temporal point processes. In doing so, we define the agent's policy using the intensity and mark distribution of the corresponding process and then derive a flexible policy gradient method, which embeds the agent's actions and the feedback it receives into real-valued vectors using deep recurrent neural networks. Our method does not make any assumptions on the functional form of the intensity and mark distribution of the feedback and it allows for arbitrarily complex reward functions. We apply our methodology to two different applications in personalized teaching and viral marketing and, using data gathered from Duolingo and Twitter, we show that it may be able to find interventions to help learners and marketers achieve their goals more effectively than alternatives.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes