CVDec 6, 2018

OMNIA Faster R-CNN: Detection in the wild through dataset merging and soft distillation

arXiv:1812.02611v218 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of domain adaptation in object detection for real-world applications, offering an incremental improvement through dataset merging and soft distillation.

The paper tackles the problem of object detectors performing poorly in new domains by merging datasets without additional labeling, resulting in a performance increase from 45.5% to 57.4% mAP for fashion detection and beating state-of-the-art by 5.3 points for domain shift detection.

Object detectors tend to perform poorly in new or open domains, and require exhaustive yet costly annotations from fully labeled datasets. We aim at benefiting from several datasets with different categories but without additional labelling, not only to increase the number of categories detected, but also to take advantage from transfer learning and to enhance domain independence. Our dataset merging procedure starts with training several initial Faster R-CNN on the different datasets while considering the complementary datasets' images for domain adaptation. Similarly to self-training methods, the predictions of these initial detectors mitigate the missing annotations on the complementary datasets. The final OMNIA Faster R-CNN is trained with all categories on the union of the datasets enriched by predictions. The joint training handles unsafe targets with a new classification loss called SoftSig in a softly supervised way. Experimental results show that in the case of fashion detection for images in the wild, merging Modanet with COCO increases the final performance from 45.5% to 57.4% in mAP. Applying our soft distillation to the task of detection with domain shift between GTA and Cityscapes enables to beat the state-of-the-art by 5.3 points. Our methodology could unlock object detection for real-world applications without immense datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes