CV LG NEApr 8, 2021

Prototypical Region Proposal Networks for Few-Shot Localization and Classification

Elliott Skomski, Aaron Tuor, Andrew Avila, Lauren Phillips, Zachary New, Henry Kvinge, Courtney D. Corley, Nathan Hodas

arXiv:2104.03496v11.4

Originality Incremental advance

AI Analysis

This addresses a limitation in few-shot learning for real-world images with multiple objects, though it is incremental as it builds on existing segmentation and classification methods.

The paper tackles the problem of few-shot image classification in densely-annotated, busy images where objects are not central subjects, by developing PRoPnet, a framework that uses prototype-based segmentation for region proposals to condition classifiers, improving accuracy on natural scene datasets.

Recently proposed few-shot image classification methods have generally focused on use cases where the objects to be classified are the central subject of images. Despite success on benchmark vision datasets aligned with this use case, these methods typically fail on use cases involving densely-annotated, busy images: images common in the wild where objects of relevance are not the central subject, instead appearing potentially occluded, small, or among other incidental objects belonging to other classes of potential interest. To localize relevant objects, we employ a prototype-based few-shot segmentation model which compares the encoded features of unlabeled query images with support class centroids to produce region proposals indicating the presence and location of support set classes in a query image. These region proposals are then used as additional conditioning input to few-shot image classifiers. We develop a framework to unify the two stages (segmentation and classification) into an end-to-end classification model -- PRoPnet -- and empirically demonstrate that our methods improve accuracy on image datasets with natural scenes containing multiple object classes.

View on arXiv PDF

Similar