NECVMar 9, 2025

SDTrack: A Baseline for Event-based Tracking via Spiking Neural Networks

arXiv:2503.08703v310 citationsh-index: 8
Originality Incremental advance
AI Analysis

This work addresses event-based tracking for neuromorphic vision, offering a novel baseline with incremental improvements in efficiency and performance.

The paper tackles the problem of event-based tracking by proposing SDTrack, a Transformer-based spike-driven tracker that achieves state-of-the-art performance with the lowest parameter count and energy consumption on multiple benchmarks.

Event cameras provide superior temporal resolution, dynamic range, power efficiency, and pixel bandwidth. Spiking Neural Networks (SNNs) naturally complement event data through discrete spike signals, making them ideal for event-based tracking. However, current approaches that combine Artificial Neural Networks (ANNs) and SNNs, along with suboptimal architectures, compromise energy efficiency and limit tracking performance. To address these limitations, we propose the first Transformer-based spike-driven tracking pipeline. Our Global Trajectory Prompt (GTP) method effectively captures global trajectory information and aggregates it with event streams into event images to enhance spatiotemporal representation. We then introduce SDTrack, a Transformer-based spike-driven tracker comprising a Spiking MetaFormer backbone and a tracking head that directly predicts normalized coordinates using spike signals. The framework is end-to-end, does not require data augmentation or post-processing. Extensive experiments demonstrate that SDTrack achieves state-of-the-art performance while maintaining the lowest parameter count and energy consumption across multiple event-based tracking benchmarks, establishing a solid baseline for future research in the field of neuromorphic vision.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes