LGAIMay 20, 2025

Energy-Efficient Deep Reinforcement Learning with Spiking Transformers

arXiv:2505.14533v11 citationsh-index: 2
Originality Incremental advance
AI Analysis

This work addresses energy efficiency for deploying reinforcement learning in real-world autonomous systems, representing an incremental advance by combining existing SNN and Transformer concepts.

The paper tackled the high energy consumption of Transformers in reinforcement learning by developing a spiking neural network-based Transformer algorithm, achieving significantly improved policy performance and enhanced energy efficiency on benchmarks.

Agent-based Transformers have been widely adopted in recent reinforcement learning advances due to their demonstrated ability to solve complex tasks. However, the high computational complexity of Transformers often results in significant energy consumption, limiting their deployment in real-world autonomous systems. Spiking neural networks (SNNs), with their biologically inspired structure, offer an energy-efficient alternative for machine learning. In this paper, a novel Spike-Transformer Reinforcement Learning (STRL) algorithm that combines the energy efficiency of SNNs with the powerful decision-making capabilities of reinforcement learning is developed. Specifically, an SNN using multi-step Leaky Integrate-and-Fire (LIF) neurons and attention mechanisms capable of processing spatio-temporal patterns over multiple time steps is designed. The architecture is further enhanced with state, action, and reward encodings to create a Transformer-like structure optimized for reinforcement learning tasks. Comprehensive numerical experiments conducted on state-of-the-art benchmarks demonstrate that the proposed SNN Transformer achieves significantly improved policy performance compared to conventional agent-based Transformers. With both enhanced energy efficiency and policy optimality, this work highlights a promising direction for deploying bio-inspired, low-cost machine learning models in complex real-world decision-making scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes