CVNov 17, 2023

Single-Shot and Multi-Shot Feature Learning for Multi-Object Tracking

arXiv:2311.10382v111 citationsh-index: 31
Originality Incremental advance
AI Analysis

This addresses the challenge of maintaining target identities in videos affected by occlusion and clutter, offering incremental improvements in tracking accuracy.

The paper tackles the problem of robust data association in Multi-Object Tracking by proposing a two-stage feature learning paradigm that jointly learns single-shot and multi-shot features, achieving state-of-the-art performance on MOT17, MOT20, and DanceTrack datasets.

Multi-Object Tracking (MOT) remains a vital component of intelligent video analysis, which aims to locate targets and maintain a consistent identity for each target throughout a video sequence. Existing works usually learn a discriminative feature representation, such as motion and appearance, to associate the detections across frames, which are easily affected by mutual occlusion and background clutter in practice. In this paper, we propose a simple yet effective two-stage feature learning paradigm to jointly learn single-shot and multi-shot features for different targets, so as to achieve robust data association in the tracking process. For the detections without being associated, we design a novel single-shot feature learning module to extract discriminative features of each detection, which can efficiently associate targets between adjacent frames. For the tracklets being lost several frames, we design a novel multi-shot feature learning module to extract discriminative features of each tracklet, which can accurately refind these lost targets after a long period. Once equipped with a simple data association logic, the resulting VisualTracker can perform robust MOT based on the single-shot and multi-shot feature representations. Extensive experimental results demonstrate that our method has achieved significant improvements on MOT17 and MOT20 datasets while reaching state-of-the-art performance on DanceTrack dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes