LG AIMay 13, 2024

Decision Mamba Architectures

arXiv:2405.07943v22.61 citationsh-index: 5Has Code

Originality Incremental advance

AI Analysis

This work addresses the need for more efficient sequence models in imitation learning for robotics and AI applications, though it is incremental as it builds on existing Mamba and Transformer architectures.

The paper tackled the problem of improving imitation learning performance by introducing Decision Mamba (DM) and Hierarchical Decision Mamba (HDM) architectures, which outperformed Transformer-based models like Decision Transformer and Hierarchical Decision Transformer in most tasks across environments such as OpenAI Gym and D4RL.

Recent advancements in imitation learning have been largely fueled by the integration of sequence models, which provide a structured flow of information to effectively mimic task behaviours. Currently, Decision Transformer (DT) and subsequently, the Hierarchical Decision Transformer (HDT), presented Transformer-based approaches to learn task policies. Recently, the Mamba architecture has shown to outperform Transformers across various task domains. In this work, we introduce two novel methods, Decision Mamba (DM) and Hierarchical Decision Mamba (HDM), aimed at enhancing the performance of the Transformer models. Through extensive experimentation across diverse environments such as OpenAI Gym and D4RL, leveraging varying demonstration data sets, we demonstrate the superiority of Mamba models over their Transformer counterparts in a majority of tasks. Results show that DM outperforms other methods in most settings. The code can be found at https://github.com/meowatthemoon/DecisionMamba.

View on arXiv PDF Code

Similar