Frequency-Based Patrolling with Heterogeneous Agents and Limited Communication
This addresses patrolling in large, unstructured environments with sparse communication, but it is incremental as it builds on existing multi-agent reinforcement learning methods.
The paper tackles multi-agent patrolling on intersecting circle graphs with non-uniform visitation requirements and limited communication, developing a reinforcement learning solution that shows learned coordination strategies and generalization to dynamic task variations.
This paper investigates multi-agent frequencybased patrolling of intersecting, circle graphs under conditions where graph nodes have non-uniform visitation requirements and agents have limited ability to communicate. The task is modeled as a partially observable Markov decision process, and a reinforcement learning solution is developed. Each agent generates its own policy from Markov chains, and policies are exchanged only when agents occupy the same or adjacent nodes. This constraint on policy exchange models sparse communication conditions over large, unstructured environments. Empirical results provide perspectives on convergence properties, agent cooperation, and generalization of learned patrolling policies to new instances of the task. The emergent behavior indicates learned coordination strategies between heterogeneous agents for patrolling large, unstructured regions as well as the ability to generalize to dynamic variation in node visitation requirements.