CVAILGNov 25, 2025

StableTrack: Stabilizing Multi-Object Tracking on Low-Frequency Detections

arXiv:2511.20418v1
Originality Incremental advance
AI Analysis

This addresses the issue of running MOT models efficiently in resource-constrained environments, though it is incremental as it builds on existing tracking methods.

The paper tackles the problem of multi-object tracking under low-frequency detections, which is challenging with limited computing resources, and achieves an 11.6% HOTA improvement at 1 Hz on MOT17-val while maintaining performance on standard benchmarks.

Multi-object tracking (MOT) is one of the most challenging tasks in computer vision, where it is important to correctly detect objects and associate these detections across frames. Current approaches mainly focus on tracking objects in each frame of a video stream, making it almost impossible to run the model under conditions of limited computing resources. To address this issue, we propose StableTrack, a novel approach that stabilizes the quality of tracking on low-frequency detections. Our method introduces a new two-stage matching strategy to improve the cross-frame association between low-frequency detections. We propose a novel Bbox-Based Distance instead of the conventional Mahalanobis distance, which allows us to effectively match objects using the Re-ID model. Furthermore, we integrate visual tracking into the Kalman Filter and the overall tracking pipeline. Our method outperforms current state-of-the-art trackers in the case of low-frequency detections, achieving $\textit{11.6%}$ HOTA improvement at $\textit{1}$ Hz on MOT17-val, while keeping up with the best approaches on the standard MOT17, MOT20, and DanceTrack benchmarks with full-frequency detections.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes