AINEDec 8, 2023

DARLEI: Deep Accelerated Reinforcement Learning with Evolutionary Intelligence

arXiv:2312.05171v1h-index: 23
Originality Incremental advance
AI Analysis

This work addresses the computational bottleneck in evolutionary robotics for researchers, though it is incremental as it builds on existing methods like PPO and Isaac Gym.

The authors tackled the problem of efficiently training and evolving populations of UNIMAL agents by combining evolutionary algorithms with parallelized reinforcement learning, achieving over 20x speedup on a single workstation compared to previous distributed CPU clusters.

We present DARLEI, a framework that combines evolutionary algorithms with parallelized reinforcement learning for efficiently training and evolving populations of UNIMAL agents. Our approach utilizes Proximal Policy Optimization (PPO) for individual agent learning and pairs it with a tournament selection-based generational learning mechanism to foster morphological evolution. By building on Nvidia's Isaac Gym, DARLEI leverages GPU accelerated simulation to achieve over 20x speedup using just a single workstation, compared to previous work which required large distributed CPU clusters. We systematically characterize DARLEI's performance under various conditions, revealing factors impacting diversity of evolved morphologies. For example, by enabling inter-agent collisions within the simulator, we find that we can simulate some multi-agent interactions between the same morphology, and see how it influences individual agent capabilities and long-term evolutionary adaptation. While current results demonstrate limited diversity across generations, we hope to extend DARLEI in future work to include interactions between diverse morphologies in richer environments, and create a platform that allows for coevolving populations and investigating emergent behaviours in them. Our source code is also made publicly at https://saeejithnair.github.io/darlei.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes