LGAICVNEAug 19, 2025

STAS: Spatio-Temporal Adaptive Computation Time for Spiking Transformers

arXiv:2508.14138v1h-index: 9
Originality Incremental advance
AI Analysis

This work addresses energy efficiency and performance issues in spiking Transformers for computer vision, offering a unified approach to dynamic computation, but it is incremental as it builds on adaptive computation time principles.

The paper tackles the high latency and computational overhead in spiking neural networks (SNNs) for vision Transformers by proposing STAS, a framework that co-designs static architecture and dynamic computation policy, resulting in up to 45.9% energy reduction on CIFAR-10 while improving accuracy over state-of-the-art models.

Spiking neural networks (SNNs) offer energy efficiency over artificial neural networks (ANNs) but suffer from high latency and computational overhead due to their multi-timestep operational nature. While various dynamic computation methods have been developed to mitigate this by targeting spatial, temporal, or architecture-specific redundancies, they remain fragmented. While the principles of adaptive computation time (ACT) offer a robust foundation for a unified approach, its application to SNN-based vision Transformers (ViTs) is hindered by two core issues: the violation of its temporal similarity prerequisite and a static architecture fundamentally unsuited for its principles. To address these challenges, we propose STAS (Spatio-Temporal Adaptive computation time for Spiking transformers), a framework that co-designs the static architecture and dynamic computation policy. STAS introduces an integrated spike patch splitting (I-SPS) module to establish temporal stability by creating a unified input representation, thereby solving the architectural problem of temporal dissimilarity. This stability, in turn, allows our adaptive spiking self-attention (A-SSA) module to perform two-dimensional token pruning across both spatial and temporal axes. Implemented on spiking Transformer architectures and validated on CIFAR-10, CIFAR-100, and ImageNet, STAS reduces energy consumption by up to 45.9%, 43.8%, and 30.1%, respectively, while simultaneously improving accuracy over SOTA models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes