CVOct 26, 2019

Learning an Efficient Network for Large-Scale Hierarchical Object Detection with Data Imbalance: 3rd Place Solution to Open Images Challenge 2019

arXiv:1910.12044v11 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental solution for improving object detection in a specific competition dataset with known challenges like annotation incompleteness and imbalance.

The paper tackled large-scale hierarchical object detection with data imbalance in the Open Images Challenge 2019, achieving a mAP of 61.90 with a single model and 64.21 after ensemble, earning 3rd place.

This report details our solution to the Google AI Open Images Challenge 2019 Object Detection Track. Based on our detailed analysis on the Open Images dataset, it is found that there are four typical features: large-scale, hierarchical tag system, severe annotation incompleteness and data imbalance. Considering these characteristics, many strategies are employed, including larger backbone, distributed softmax loss, class-aware sampling, expert model, and heavier classifier. In virtue of these effective strategies, our best single model could achieve a mAP of 61.90. After ensemble, the final mAP is boosted to 67.17 in the public leaderboard and 64.21 in the private leaderboard, which earns 3rd place in the Open Images Challenge 2019.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes