CVMar 15, 2020

StarNet: towards Weakly Supervised Few-Shot Object Detection

arXiv:2003.06798v36 citations
AI Analysis

This addresses the challenge of reducing annotation costs for few-shot object detection, making it more applicable in data-scarce scenarios, though it builds incrementally on existing few-shot methods.

The paper tackles the problem of few-shot object detection without bounding box annotations by introducing StarNet, which uses image-level labels for meta-training and achieves significant improvements on weakly supervised few-shot object detection benchmarks.

Few-shot detection and classification have advanced significantly in recent years. Yet, detection approaches require strong annotation (bounding boxes) both for pre-training and for adaptation to novel classes, and classification approaches rarely provide localization of objects in the scene. In this paper, we introduce StarNet - a few-shot model featuring an end-to-end differentiable non-parametric star-model detection and classification head. Through this head, the backbone is meta-trained using only image-level labels to produce good features for jointly localizing and classifying previously unseen categories of few-shot test tasks using a star-model that geometrically matches between the query and support images (to find corresponding object instances). Being a few-shot detector, StarNet does not require any bounding box annotations, neither during pre-training nor for novel classes adaptation. It can thus be applied to the previously unexplored and challenging task of Weakly Supervised Few-Shot Object Detection (WS-FSOD), where it attains significant improvements over the baselines. In addition, StarNet shows significant gains on few-shot classification benchmarks that are less cropped around the objects (where object localization is key).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes