LGAICRMay 5, 2024

RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation

arXiv:2405.03064v312 citationsh-index: 12ICML
Originality Incremental advance
AI Analysis

This addresses a key challenge in training DRL agents for complex tasks, offering a novel solution to improve performance, though it appears incremental as it builds on existing refining schemes.

The paper tackles the training bottlenecks in deep reinforcement learning, especially with sparse rewards, by proposing RICE, a refining scheme that uses explanation methods to create a mixed initial state distribution, resulting in significantly outperforming existing schemes in various environments.

Deep reinforcement learning (DRL) is playing an increasingly important role in real-world applications. However, obtaining an optimally performing DRL agent for complex tasks, especially with sparse rewards, remains a significant challenge. The training of a DRL agent can be often trapped in a bottleneck without further progress. In this paper, we propose RICE, an innovative refining scheme for reinforcement learning that incorporates explanation methods to break through the training bottlenecks. The high-level idea of RICE is to construct a new initial state distribution that combines both the default initial states and critical states identified through explanation methods, thereby encouraging the agent to explore from the mixed initial states. Through careful design, we can theoretically guarantee that our refining scheme has a tighter sub-optimality bound. We evaluate RICE in various popular RL environments and real-world applications. The results demonstrate that RICE significantly outperforms existing refining schemes in enhancing agent performance.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes