LGAIOct 5, 2022

Query The Agent: Improving sample efficiency through epistemic uncertainty estimation

arXiv:2210.02585v11 citationsh-index: 21
Originality Incremental advance
AI Analysis

This addresses sample efficiency for reinforcement learning agents, though it appears incremental as it builds on existing curricula methods.

The paper tackles the problem of poor sample efficiency in goal-conditioned reinforcement learning by proposing Query The Agent (QTA), which uses Predictive Uncertainty Networks to estimate epistemic uncertainty and set goals in uncertain states, resulting in decisive sample efficiency improvements over existing methods.

Curricula for goal-conditioned reinforcement learning agents typically rely on poor estimates of the agent's epistemic uncertainty or fail to consider the agents' epistemic uncertainty altogether, resulting in poor sample efficiency. We propose a novel algorithm, Query The Agent (QTA), which significantly improves sample efficiency by estimating the agent's epistemic uncertainty throughout the state space and setting goals in highly uncertain areas. Encouraging the agent to collect data in highly uncertain states allows the agent to improve its estimation of the value function rapidly. QTA utilizes a novel technique for estimating epistemic uncertainty, Predictive Uncertainty Networks (PUN), to allow QTA to assess the agent's uncertainty in all previously observed states. We demonstrate that QTA offers decisive sample efficiency improvements over preexisting methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes