CVLGIVNov 28, 2019

One-Shot Object Detection with Co-Attention and Co-Excitation

arXiv:1911.12529v1193 citationsHas Code
Originality Highly original
AI Analysis

It addresses the problem of detecting objects from unseen classes with minimal examples for computer vision applications, representing an incremental advance in few-shot learning.

The paper tackles one-shot object detection, where a model detects objects of unseen classes using a single query image, by introducing a co-attention and co-excitation framework that achieves strong baseline performance on VOC and MS-COCO datasets.

This paper aims to tackle the challenging problem of one-shot object detection. Given a query image patch whose class label is not included in the training data, the goal of the task is to detect all instances of the same class in a target image. To this end, we develop a novel {\em co-attention and co-excitation} (CoAE) framework that makes contributions in three key technical aspects. First, we propose to use the non-local operation to explore the co-attention embodied in each query-target pair and yield region proposals accounting for the one-shot situation. Second, we formulate a squeeze-and-co-excitation scheme that can adaptively emphasize correlated feature channels to help uncover relevant proposals and eventually the target objects. Third, we design a margin-based ranking loss for implicitly learning a metric to predict the similarity of a region proposal to the underlying query, no matter its class label is seen or unseen in training. The resulting model is therefore a two-stage detector that yields a strong baseline on both VOC and MS-COCO under one-shot setting of detecting objects from both seen and never-seen classes. Codes are available at https://github.com/timy90022/One-Shot-Object-Detection.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes