LG AI MLFeb 11, 2019

Stochastic Reinforcement Learning

Nikki Lijing Kuang, Clement H. C. Leung, Vienne W. K. Sung

arXiv:1902.04178v14 citations

Originality Incremental advance

AI Analysis

This work addresses the challenge of efficient learning in noisy, real-world reinforcement learning settings where observations are expensive, though it appears incremental in modeling stochastic elements and costs.

The paper tackles the problem of reinforcement learning in stochastic environments with costly observations by presenting a stochastic approach that models environmental variability and observation costs, resulting in quantitative criteria for learning success and probabilities of exceeding cost bounds.

In reinforcement learning episodes, the rewards and punishments are often non-deterministic, and there are invariably stochastic elements governing the underlying situation. Such stochastic elements are often numerous and cannot be known in advance, and they have a tendency to obscure the underlying rewards and punishments patterns. Indeed, if stochastic elements were absent, the same outcome would occur every time and the learning problems involved could be greatly simplified. In addition, in most practical situations, the cost of an observation to receive either a reward or punishment can be significant, and one would wish to arrive at the correct learning conclusion by incurring minimum cost. In this paper, we present a stochastic approach to reinforcement learning which explicitly models the variability present in the learning environment and the cost of observation. Criteria and rules for learning success are quantitatively analyzed, and probabilities of exceeding the observation cost bounds are also obtained.

View on arXiv PDF

Similar