CVLGApr 1, 2022

Proper Reuse of Image Classification Features Improves Object Detection

arXiv:2204.00484v235 citationsh-index: 24
Originality Incremental advance
AI Analysis

This work addresses the challenge of making object detection more efficient and effective, particularly for researchers with limited computational resources, though it is incremental in building on existing transfer learning practices.

The paper tackles the problem of improving object detection by freezing the classifier-initialized backbone instead of fine-tuning it, which consistently enhances many detection models and saves computational resources. The result includes performance gains on hard cases like long-tail object classes and significant resource savings.

A common practice in transfer learning is to initialize the downstream model weights by pre-training on a data-abundant upstream task. In object detection specifically, the feature backbone is typically initialized with Imagenet classifier weights and fine-tuned on the object detection task. Recent works show this is not strictly necessary under longer training regimes and provide recipes for training the backbone from scratch. We investigate the opposite direction of this end-to-end training trend: we show that an extreme form of knowledge preservation -- freezing the classifier-initialized backbone -- consistently improves many different detection models, and leads to considerable resource savings. We hypothesize and corroborate experimentally that the remaining detector components capacity and structure is a crucial factor in leveraging the frozen backbone. Immediate applications of our findings include performance improvements on hard cases like detection of long-tail object classes and computational and memory resource savings that contribute to making the field more accessible to researchers with access to fewer computational resources.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes