CVApr 26, 2020

Dynamic Scale Training for Object Detection

Yukang Chen, Peizhen Zhang, Zeming Li, Yanwei Li, Xiangyu Zhang, Lu Qi, Jian Sun, Jiaya Jia

arXiv:2004.12432v211.675 citationsHas Code

Originality Highly original

AI Analysis

This addresses scale variation in object detection for computer vision applications, offering a simple, inference-free improvement that generalizes across backbones and tasks.

The paper tackles the scale variation challenge in object detection by proposing Dynamic Scale Training (DST), which uses optimization feedback to dynamically guide data preparation, achieving over 2% Average Precision gain on MS COCO.

We propose a Dynamic Scale Training paradigm (abbreviated as DST) to mitigate scale variation challenge in object detection. Previous strategies like image pyramid, multi-scale training, and their variants are aiming at preparing scale-invariant data for model optimization. However, the preparation procedure is unaware of the following optimization process that restricts their capability in handling the scale variation. Instead, in our paradigm, we use feedback information from the optimization process to dynamically guide the data preparation. The proposed method is surprisingly simple yet obtains significant gains (2%+ Average Precision on MS COCO dataset), outperforming previous methods. Experimental results demonstrate the efficacy of our proposed DST method towards scale variation handling. It could also generalize to various backbones, benchmarks, and other challenging downstream tasks like instance segmentation. It does not introduce inference overhead and could serve as a free lunch for general detection configurations. Besides, it also facilitates efficient training due to fast convergence. Code and models are available at github.com/yukang2017/Stitcher.

View on arXiv PDF Code

Similar