CVSep 26, 2024

CAMOT: Camera Angle-aware Multi-Object Tracking

arXiv:2409.17533v211 citationsh-index: 15
Originality Incremental advance
AI Analysis

This work addresses tracking challenges in computer vision, offering a computationally efficient solution for applications like surveillance, though it is incremental as it builds on existing 2D MOT methods.

The paper tackled occlusion and inaccurate depth estimation in multi-object tracking by proposing CAMOT, a camera angle estimator that assumes objects are on a flat plane, enabling pseudo-3D tracking and achieving state-of-the-art results such as 63.8% HOTA on MOT17.

This paper proposes CAMOT, a simple camera angle estimator for multi-object tracking to tackle two problems: 1) occlusion and 2) inaccurate distance estimation in the depth direction. Under the assumption that multiple objects are located on a flat plane in each video frame, CAMOT estimates the camera angle using object detection. In addition, it gives the depth of each object, enabling pseudo-3D MOT. We evaluated its performance by adding it to various 2D MOT methods on the MOT17 and MOT20 datasets and confirmed its effectiveness. Applying CAMOT to ByteTrack, we obtained 63.8% HOTA, 80.6% MOTA, and 78.5% IDF1 in MOT17, which are state-of-the-art results. Its computational cost is significantly lower than the existing deep-learning-based depth estimators for tracking.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes