CVApr 11, 2022

CFA: Constraint-based Finetuning Approach for Generalized Few-Shot Object Detection

arXiv:2204.05220v121 citationsh-index: 7
Originality Incremental advance
AI Analysis

This addresses a realistic scenario in object detection where models must handle both base and novel classes without forgetting, though it is incremental as it builds on existing continual learning techniques.

The paper tackles catastrophic forgetting in generalized few-shot object detection by proposing a constraint-based finetuning approach (CFA), which adapts a continual learning method to improve novel class performance with minor base class degradation, achieving competitive results on MS-COCO and PASCAL-VOC datasets.

Few-shot object detection (FSOD) seeks to detect novel categories with limited data by leveraging prior knowledge from abundant base data. Generalized few-shot object detection (G-FSOD) aims to tackle FSOD without forgetting previously seen base classes and, thus, accounts for a more realistic scenario, where both classes are encountered during test time. While current FSOD methods suffer from catastrophic forgetting, G-FSOD addresses this limitation yet exhibits a performance drop on novel tasks compared to the state-of-the-art FSOD. In this work, we propose a constraint-based finetuning approach (CFA) to alleviate catastrophic forgetting, while achieving competitive results on the novel task without increasing the model capacity. CFA adapts a continual learning method, namely Average Gradient Episodic Memory (A-GEM) to G-FSOD. Specifically, more constraints on the gradient search strategy are imposed from which a new gradient update rule is derived, allowing for better knowledge exchange between base and novel classes. To evaluate our method, we conduct extensive experiments on MS-COCO and PASCAL-VOC datasets. Our method outperforms current FSOD and G-FSOD approaches on the novel task with minor degeneration on the base task. Moreover, CFA is orthogonal to FSOD approaches and operates as a plug-and-play module without increasing the model capacity or inference time.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes