AISep 5, 2022

A New Approach to Training Multiple Cooperative Agents for Autonomous Driving

arXiv:2209.02157v1h-index: 22

Originality Incremental advance

AI Analysis

This addresses the problem of safe and cooperative control for autonomous driving fleets, but it appears incremental as it builds on existing multi-agent training approaches.

The paper tackles training multiple cooperative agents for autonomous driving by proposing Lepus, a method using shared policy networks and reward functions with adversarial pre-training and reward approximation from expert trajectories, which in experiments on MADRaS outperforms four baseline methods in stability.

Training multiple agents to perform safe and cooperative control in the complex scenarios of autonomous driving has been a challenge. For a small fleet of cars moving together, this paper proposes Lepus, a new approach to training multiple agents. Lepus adopts a pure cooperative manner for training multiple agents, featured with the shared parameters of policy networks and the shared reward function of multiple agents. In particular, Lepus pre-trains the policy networks via an adversarial process, improving its collaborative decision-making capability and further the stability of car driving. Moreover, for alleviating the problem of sparse rewards, Lepus learns an approximate reward function from expert trajectories by combining a random network and a distillation network. We conduct extensive experiments on the MADRaS simulation platform. The experimental results show that multiple agents trained by Lepus can avoid collisions as many as possible while driving simultaneously and outperform the other four methods, that is, DDPG-FDE, PSDDPG, MADDPG, and MAGAIL(DDPG) in terms of stability.

View on arXiv PDF

Similar