CVJul 5, 2024

FeatureSORT: Essential Features for Effective Tracking

arXiv:2407.04249v215 citationsh-index: 3
AI Analysis

This work addresses identity switches and occlusions in multi-object tracking for applications like surveillance and autonomous driving, representing an incremental improvement over existing methods.

The paper tackles the problem of online multiple object tracking by introducing FeatureSORT, which enhances DeepSORT with additional appearance features and post-processing, achieving state-of-the-art MOTA scores such as 79.7 on MOT16 and 92.2 on DanceTrack.

We introduce FeatureSORT, a simple yet effective online multiple object tracker that reinforces the DeepSORT baseline with a redesigned detector and additional feature cues. In contrast to conventional detectors that only provide bounding boxes, our modified YOLOX architecture is extended to output multiple appearance attributes, including clothing color, clothing style, and motion direction, alongside the bounding boxes. These feature cues, together with a ReID network, form complementary embeddings that substantially improve association accuracy. Furthermore, we incorporate stronger post-processing strategies, such as global linking and Gaussian Smoothing Process interpolation, to handle missing associations and detections. During online tracking, we define a measurement-to-track distance function that jointly considers IoU, direction, color, style, and ReID similarity. This design enables FeatureSORT to maintain consistent identities through longer occlusions while reducing identity switches. Extensive experiments on standard MOT benchmarks demonstrate that FeatureSORT achieves state-of-the-art online performance, with MOTA scores of 79.7 on MOT16, 80.6 on MOT17, 77.9 on MOT20, and 92.2 on DanceTrack, underscoring the effectiveness of feature-enriched detection and modular post processing in advancing multi-object tracking.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes