LGNEROMar 5, 2023

Swim: A General-Purpose, High-Performing, and Efficient Activation Function for Locomotion Control Tasks

arXiv:2303.02640v12 citationsh-index: 9
Originality Incremental advance
AI Analysis

This work addresses the need for more efficient activation functions in robotics and deep reinforcement learning, offering an incremental improvement over existing methods like Swish for locomotion control.

The paper tackles the problem of inefficient activation functions like Swish in deep reinforcement learning by proposing Swim, a general-purpose alternative that achieves state-of-the-art performance and efficiency on MuJoCo locomotion tasks, with concrete improvements in reward and computational speed compared to Swish.

Activation functions play a significant role in the performance of deep learning algorithms. In particular, the Swish activation function tends to outperform ReLU on deeper models, including deep reinforcement learning models, across challenging tasks. Despite this progress, ReLU is the preferred function partly because it is more efficient than Swish. Furthermore, in contrast to the fields of computer vision and natural language processing, the deep reinforcement learning and robotics domains have seen less inclination to adopt new activation functions, such as Swish, and instead continue to use more traditional functions, like ReLU. To tackle those issues, we propose Swim, a general-purpose, efficient, and high-performing alternative to Swish, and then provide an analysis of its properties as well as an explanation for its high-performance relative to Swish, in terms of both reward-achievement and efficiency. We focus on testing Swim on MuJoCo's locomotion continuous control tasks since they exhibit more complex dynamics and would therefore benefit most from a high-performing and efficient activation function. We also use the TD3 algorithm in conjunction with Swim and explain this choice in the context of the robot locomotion domain. We then conclude that Swim is a state-of-the-art activation function for continuous control locomotion tasks and recommend using it with TD3 as a working framework.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes