CVAug 15, 2025

Multi-State Tracker: Enhancing Efficient Object Tracking via Multi-State Specialization and Interaction

arXiv:2508.11531v11 citationsh-index: 30Has CodeMM
Originality Incremental advance
AI Analysis

This work addresses the trade-off between efficiency and accuracy in object tracking for computer vision applications, representing an incremental improvement with specific gains.

The paper tackles the problem of efficient object tracking, where reduced computational complexity often weakens feature representation, by proposing Multi-State Tracker (MST) that uses lightweight modules to enhance and interact multi-state features, resulting in improved tracking accuracy and robustness, such as a 4.5% AO score gain over the previous SOTA on GOT-10K.

Efficient trackers achieve faster runtime by reducing computational complexity and model parameters. However, this efficiency often compromises the expense of weakened feature representation capacity, thus limiting their ability to accurately capture target states using single-layer features. To overcome this limitation, we propose Multi-State Tracker (MST), which utilizes highly lightweight state-specific enhancement (SSE) to perform specialized enhancement on multi-state features produced by multi-state generation (MSG) and aggregates them in an interactive and adaptive manner using cross-state interaction (CSI). This design greatly enhances feature representation while incurring minimal computational overhead, leading to improved tracking robustness in complex environments. Specifically, the MSG generates multiple state representations at multiple stages during feature extraction, while SSE refines them to highlight target-specific features. The CSI module facilitates information exchange between these states and ensures the integration of complementary features. Notably, the introduced SSE and CSI modules adopt a highly lightweight hidden state adaptation-based state space duality (HSA-SSD) design, incurring only 0.1 GFLOPs in computation and 0.66 M in parameters. Experimental results demonstrate that MST outperforms all previous efficient trackers across multiple datasets, significantly improving tracking accuracy and robustness. In particular, it shows excellent runtime performance, with an AO score improvement of 4.5\% over the previous SOTA efficient tracker HCAT on the GOT-10K dataset. The code is available at https://github.com/wsumel/MST.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes