LGMAROJan 19, 2022

Interpretable Learned Emergent Communication for Human-Agent Teams

arXiv:2201.07452v213 citations
AI Analysis

This work addresses the challenge of improving human-agent collaboration through interpretable communication, though it is incremental as it builds on existing sparse-discrete methods.

The paper tackled the problem of enabling interpretable communication in human-agent teams by analyzing sparse-discrete methods, showing that a prototype-based method allows humans to learn agent communication faster and better than a baseline, with minimal performance loss in benchmark environments.

Learning interpretable communication is essential for multi-agent and human-agent teams (HATs). In multi-agent reinforcement learning for partially-observable environments, agents may convey information to others via learned communication, allowing the team to complete its task. Inspired by human languages, recent works study discrete (using only a finite set of tokens) and sparse (communicating only at some time-steps) communication. However, the utility of such communication in human-agent team experiments has not yet been investigated. In this work, we analyze the efficacy of sparse-discrete methods for producing emergent communication that enables high agent-only and human-agent team performance. We develop agent-only teams that communicate sparsely via our scheme of Enforcers that sufficiently constrain communication to any budget. Our results show no loss or minimal loss of performance in benchmark environments and tasks. In human-agent teams tested in benchmark environments, where agents have been modeled using the Enforcers, we find that a prototype-based method produces meaningful discrete tokens that enable human partners to learn agent communication faster and better than a one-hot baseline. Additional HAT experiments show that an appropriate sparsity level lowers the cognitive load of humans when communicating with teams of agents and leads to superior team performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes