MA AI LGDec 13, 2021

PantheonRL: A MARL Library for Dynamic Training Interactions

Bidipta Sarkar, Aditi Talati, Andy Shih, Dorsa Sadigh

arXiv:2112.07013v17.319 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This provides a software tool for researchers and practitioners in multiagent reinforcement learning, but it is incremental as it builds on existing frameworks without introducing new algorithmic breakthroughs.

The authors introduced PantheonRL, a multiagent reinforcement learning library designed to support dynamic training interactions like round-robin and adaptive training, built on StableBaselines3 to leverage existing deep RL algorithms and featuring a web interface for experiment configuration.

We present PantheonRL, a multiagent reinforcement learning software package for dynamic training interactions such as round-robin, adaptive, and ad-hoc training. Our package is designed around flexible agent objects that can be easily configured to support different training interactions, and handles fully general multiagent environments with mixed rewards and n agents. Built on top of StableBaselines3, our package works directly with existing powerful deep RL algorithms. Finally, PantheonRL comes with an intuitive yet functional web user interface for configuring experiments and launching multiple asynchronous jobs. Our package can be found at https://github.com/Stanford-ILIAD/PantheonRL.

View on arXiv PDF Code

Similar