CVAIROApr 1, 2024

Object-conditioned Bag of Instances for Few-Shot Personalized Instance Recognition

arXiv:2404.01397v12 citationsh-index: 19ICASSP
Originality Incremental advance
AI Analysis

This addresses the demand for personalized vision systems in applications like smart devices, though it is an incremental improvement over existing approaches.

The paper tackles the problem of few-shot personalized instance recognition, where vision systems need to identify specific personal instances (e.g., a user's dog) from limited data, achieving 77.1% accuracy with a 12% relative gain over state-of-the-art methods.

Nowadays, users demand for increased personalization of vision systems to localize and identify personal instances of objects (e.g., my dog rather than dog) from a few-shot dataset only. Despite outstanding results of deep networks on classical label-abundant benchmarks (e.g., those of the latest YOLOv8 model for standard object detection), they struggle to maintain within-class variability to represent different instances rather than object categories only. We construct an Object-conditioned Bag of Instances (OBoI) based on multi-order statistics of extracted features, where generic object detection models are extended to search and identify personal instances from the OBoI's metric space, without need for backpropagation. By relying on multi-order statistics, OBoI achieves consistent superior accuracy in distinguishing different instances. In the results, we achieve 77.1% personal object recognition accuracy in case of 18 personal instances, showing about 12% relative gain over the state of the art.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes