NI AI LG MAJan 9, 2021

Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for MANETs

Saeed Kaviani, Bo Ryu, Ejaz Ahmed, Kevin A. Larson, Anh Le, Alex Yahja, Jae H. Kim

arXiv:2101.03273v25.930 citations

Originality Incremental advance

AI Analysis

This work addresses the challenge of developing robust, efficient, and scalable routing protocols for highly dynamic MANETs, which is a critical problem for network engineers and operators.

The paper introduces DeepCQ+, a routing protocol for mobile ad-hoc networks (MANETs) that integrates multi-agent deep reinforcement learning (MADRL) with existing Q-learning protocols. DeepCQ+ achieves 10-15% higher end-to-end throughput with lower overhead compared to Q-learning counterparts, and maintains similar performance gains even in untrained network configurations.

Highly dynamic mobile ad-hoc networks (MANETs) are continuing to serve as one of the most challenging environments to develop and deploy robust, efficient, and scalable routing protocols. In this paper, we present DeepCQ+ routing which, in a novel manner, integrates emerging multi-agent deep reinforcement learning (MADRL) techniques into existing Q-learning-based routing protocols and their variants, and achieves persistently higher performance across a wide range of MANET configurations while training only on a limited range of network parameters and conditions. Quantitatively, DeepCQ+ shows consistently higher end-to-end throughput with lower overhead compared to its Q-learning-based counterparts with the overall gain of 10-15% in its efficiency. Qualitatively and more significantly, DeepCQ+ maintains remarkably similar performance gains under many scenarios that it was not trained for in terms of network sizes, mobility conditions, and traffic dynamics. To the best of our knowledge, this is the first successful demonstration of MADRL for the MANET routing problem that achieves and maintains a high degree of scalability and robustness even in the environments that are outside the trained range of scenarios. This implies that the proposed hybrid design approach of DeepCQ+ that combines MADRL and Q-learning significantly increases its practicality and explainability because the real-world MANET environment will likely vary outside the trained range of MANET scenarios.

View on arXiv PDF

Similar