LGMLAug 22, 2019

Practical Risk Measures in Reinforcement Learning

arXiv:1908.08379v19 citations
AI Analysis

This work addresses risk management in RL for practical applications, but it appears incremental as it builds on existing approximation schemes and neural methods.

The paper tackles the problem of incorporating risk considerations into reinforcement learning by proposing a neural architecture for estimating and optimizing policies under general risk measures, demonstrating its efficacy through experiments.

Practical application of Reinforcement Learning (RL) often involves risk considerations. We study a generalized approximation scheme for risk measures, based on Monte-Carlo simulations, where the risk measures need not necessarily be \emph{coherent}. We demonstrate that, even in simple problems, measures such as the variance of the reward-to-go do not capture the risk in a satisfactory manner. In addition, we show how a risk measure can be derived from model's realizations. We propose a neural architecture for estimating the risk and suggest the risk critic architecture that can be use to optimize a policy under general risk measures. We conclude our work with experiments that demonstrate the efficacy of our approach.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes