CVSep 11, 2022

Multiple Object Tracking in Recent Times: A Literature Review

arXiv:2209.04796v134 citationsh-index: 19
Originality Synthesis-oriented
AI Analysis

It offers a comprehensive overview for researchers in computer vision, especially for applications like autonomous driving, but is incremental as it consolidates existing work without new findings.

This literature review synthesizes over a hundred recent papers on multiple object tracking (MOT) in computer vision, identifying key techniques like transformers and graph neural networks to address challenges such as occlusion and ID switching, and provides future directions and benchmark datasets.

Multiple object tracking gained a lot of interest from researchers in recent years, and it has become one of the trending problems in computer vision, especially with the recent advancement of autonomous driving. MOT is one of the critical vision tasks for different issues like occlusion in crowded scenes, similar appearance, small object detection difficulty, ID switching, etc. To tackle these challenges, as researchers tried to utilize the attention mechanism of transformer, interrelation of tracklets with graph convolutional neural network, appearance similarity of objects in different frames with the siamese network, they also tried simple IOU matching based CNN network, motion prediction with LSTM. To take these scattered techniques under an umbrella, we have studied more than a hundred papers published over the last three years and have tried to extract the techniques that are more focused on by researchers in recent times to solve the problems of MOT. We have enlisted numerous applications, possibilities, and how MOT can be related to real life. Our review has tried to show the different perspectives of techniques that researchers used overtimes and give some future direction for the potential researchers. Moreover, we have included popular benchmark datasets and metrics in this review.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes