CVFeb 15, 2024

Beyond Kalman Filters: Deep Learning-Based Filters for Improved Object Tracking

arXiv:2402.09865v19 citationsh-index: 3Mach Vis Appl
Originality Incremental advance
AI Analysis

This work addresses the problem of handling non-linear motion patterns in object tracking for applications like sports and dance, offering incremental improvements over existing motion-based trackers.

The paper tackles the limitations of Kalman filters in object tracking by proposing two data-driven filtering methods that improve bounding box prediction accuracy and reduce domain-specific design needs, demonstrating performance gains over traditional methods on datasets like DanceTrack and SportsMOT.

Traditional tracking-by-detection systems typically employ Kalman filters (KF) for state estimation. However, the KF requires domain-specific design choices and it is ill-suited to handling non-linear motion patterns. To address these limitations, we propose two innovative data-driven filtering methods. Our first method employs a Bayesian filter with a trainable motion model to predict an object's future location and combines its predictions with observations gained from an object detector to enhance bounding box prediction accuracy. Moreover, it dispenses with most domain-specific design choices characteristic of the KF. The second method, an end-to-end trainable filter, goes a step further by learning to correct detector errors, further minimizing the need for domain expertise. Additionally, we introduce a range of motion model architectures based on Recurrent Neural Networks, Neural Ordinary Differential Equations, and Conditional Neural Processes, that are combined with the proposed filtering methods. Our extensive evaluation across multiple datasets demonstrates that our proposed filters outperform the traditional KF in object tracking, especially in the case of non-linear motion patterns -- the use case our filters are best suited to. We also conduct noise robustness analysis of our filters with convincing positive results. We further propose a new cost function for associating observations with tracks. Our tracker, which incorporates this new association cost with our proposed filters, outperforms the conventional SORT method and other motion-based trackers in multi-object tracking according to multiple metrics on motion-rich DanceTrack and SportsMOT datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes