LGMLMay 26, 2020

Time-Variant Variational Transfer for Value Functions

arXiv:2005.12864v2
AI Analysis

This work addresses a specific challenge in transfer learning for RL by handling non-stationary task distributions, which is incremental as it builds on existing variational transfer methods.

The paper tackles the problem of transferring value functions in reinforcement learning when the task distribution is time-variant, proposing a variational method that leverages temporal structure, and shows experimental results in three RL environments with distinct temporal dynamics.

In most of the transfer learning approaches to reinforcement learning (RL) the distribution over the tasks is assumed to be stationary. Therefore, the target and source tasks are i.i.d. samples of the same distribution. In the context of this work, we consider the problem of transferring value functions through a variational method when the distribution that generates the tasks is time-variant, proposing a solution that leverages this temporal structure inherent in the task generating process. Furthermore, by means of a finite-sample analysis, the previously mentioned solution is theoretically compared to its time-invariant version. Finally, we will provide an experimental evaluation of the proposed technique with three distinct temporal dynamics in three different RL environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes