LGAIJun 19, 2022

Robust Imitation Learning against Variations in Environment Dynamics

arXiv:2206.09314v136 citationsh-index: 9
Originality Incremental advance
AI Analysis

This addresses robustness issues in imitation learning for applications where environment conditions vary, though it appears incremental.

The paper tackles the problem of imitation learning failing when environment dynamics change by proposing a framework that imitates multiple experts across sampled dynamics to improve robustness. Numerical results show the algorithm significantly improves robustness against dynamics perturbations compared to conventional baselines.

In this paper, we propose a robust imitation learning (IL) framework that improves the robustness of IL when environment dynamics are perturbed. The existing IL framework trained in a single environment can catastrophically fail with perturbations in environment dynamics because it does not capture the situation that underlying environment dynamics can be changed. Our framework effectively deals with environments with varying dynamics by imitating multiple experts in sampled environment dynamics to enhance the robustness in general variations in environment dynamics. In order to robustly imitate the multiple sample experts, we minimize the risk with respect to the Jensen-Shannon divergence between the agent's policy and each of the sample experts. Numerical results show that our algorithm significantly improves robustness against dynamics perturbations compared to conventional IL baselines.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes