LGAIJan 7, 2018

Sample-Efficient Reinforcement Learning through Transfer and Architectural Priors

arXiv:1801.02268v115 citations
Originality Incremental advance
AI Analysis

This addresses the slow learning rates in reinforcement learning for AI systems, though it is incremental as it builds on existing transfer learning methods.

The paper tackles the sample inefficiency of deep reinforcement learning by using transfer learning and architectural priors, achieving a 50x improvement in positive transfer on a toy problem-set.

Recent work in deep reinforcement learning has allowed algorithms to learn complex tasks such as Atari 2600 games just from the reward provided by the game, but these algorithms presently require millions of training steps in order to learn, making them approximately five orders of magnitude slower than humans. One reason for this is that humans build robust shared representations that are applicable to collections of problems, making it much easier to assimilate new variants. This paper first introduces the idea of automatically-generated game sets to aid in transfer learning research, and then demonstrates the utility of shared representations by showing that models can substantially benefit from the incorporation of relevant architectural priors. This technique affords a remarkable 50x positive transfer on a toy problem-set.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes