CVAIROJul 2, 2021

Collaborative Visual Navigation

arXiv:2107.01151v222 citations
AI Analysis

This work addresses the gap in multi-agent systems for visually rich environments, which is incremental by extending existing MARL methods to new data and tasks.

The authors tackled the problem of multi-agent visual navigation in visually rich environments by introducing a large-scale 3D dataset, CollaVN, and a memory-augmented communication framework, resulting in improved collaboration and planning as verified empirically across task settings.

As a fundamental problem for Artificial Intelligence, multi-agent system (MAS) is making rapid progress, mainly driven by multi-agent reinforcement learning (MARL) techniques. However, previous MARL methods largely focused on grid-world like or game environments; MAS in visually rich environments has remained less explored. To narrow this gap and emphasize the crucial role of perception in MAS, we propose a large-scale 3D dataset, CollaVN, for multi-agent visual navigation (MAVN). In CollaVN, multiple agents are entailed to cooperatively navigate across photo-realistic environments to reach target locations. Diverse MAVN variants are explored to make our problem more general. Moreover, a memory-augmented communication framework is proposed. Each agent is equipped with a private, external memory to persistently store communication information. This allows agents to make better use of their past communication information, enabling more efficient collaboration and robust long-term planning. In our experiments, several baselines and evaluation metrics are designed. We also empirically verify the efficacy of our proposed MARL approach across different MAVN task settings.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes