LGAIFeb 13, 2023

Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning

arXiv:2302.06548v116 citationsh-index: 49Has Code
Originality Incremental advance
AI Analysis

This addresses a critical challenge for robots and AI systems in distinguishing useful information from noise, though it is incremental as it builds on existing dynamic sparse training and reinforcement learning methods.

The paper tackles the problem of filtering irrelevant noise in reinforcement learning by introducing the extremely noisy environment (ENE) setting, where up to 99% of input features are noise, and proposes Automatic Noise Filtering (ANF), which improves performance over baselines like SAC and TD3 by using up to 95% fewer weights and excels in transfer learning scenarios.

Tomorrow's robots will need to distinguish useful information from noise when performing different tasks. A household robot for instance may continuously receive a plethora of information about the home, but needs to focus on just a small subset to successfully execute its current chore. Filtering distracting inputs that contain irrelevant data has received little attention in the reinforcement learning literature. To start resolving this, we formulate a problem setting in reinforcement learning called the $\textit{extremely noisy environment}$ (ENE), where up to $99\%$ of the input features are pure noise. Agents need to detect which features provide task-relevant information about the state of the environment. Consequently, we propose a new method termed $\textit{Automatic Noise Filtering}$ (ANF), which uses the principles of dynamic sparse training in synergy with various deep reinforcement learning algorithms. The sparse input layer learns to focus its connectivity on task-relevant features, such that ANF-SAC and ANF-TD3 outperform standard SAC and TD3 by a large margin, while using up to $95\%$ fewer weights. Furthermore, we devise a transfer learning setting for ENEs, by permuting all features of the environment after 1M timesteps to simulate the fact that other information sources can become relevant as the world evolves. Again, ANF surpasses the baselines in final performance and sample complexity. Our code is available at https://github.com/bramgrooten/automatic-noise-filtering

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes