CVLGIVOct 29, 2020

Recurrent Neural Networks for video object detection

arXiv:2010.15740v16 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of object detection in videos for applications like autonomous driving, but it is incremental as it reviews and compares existing methods without introducing new techniques.

The paper compares different Recurrent Neural Network methods for video object detection, highlighting the benefit of incorporating temporal context, but does not report specific numerical results.

There is lots of scientific work about object detection in images. For many applications like for example autonomous driving the actual data on which classification has to be done are videos. This work compares different methods, especially those which use Recurrent Neural Networks to detect objects in videos. We differ between feature-based methods, which feed feature maps of different frames into the recurrent units, box-level methods, which feed bounding boxes with class probabilities into the recurrent units and methods which use flow networks. This study indicates common outcomes of the compared methods like the benefit of including the temporal context into object detection and states conclusions and guidelines for video object detection networks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes