CVJul 22, 2015

Part Localization using Multi-Proposal Consensus for Fine-Grained Categorization

arXiv:1507.06332v143 citations
AI Analysis

This addresses fine-grained categorization for computer vision applications, with incremental improvements in keypoint localization.

The paper tackles fine-grained bird recognition by simultaneously predicting keypoint locations and visibilities using a deep learning framework, achieving state-of-the-art performance with over 2% improvement over existing methods.

We present a simple deep learning framework to simultaneously predict keypoint locations and their respective visibilities and use those to achieve state-of-the-art performance for fine-grained classification. We show that by conditioning the predictions on object proposals with sufficient image support, our method can do well without complicated spatial reasoning. Instead, inference methods with robustness to outliers, yield state-of-the-art for keypoint localization. We demonstrate the effectiveness of our accurate keypoint localization and visibility prediction on the fine-grained bird recognition task with and without ground truth bird bounding boxes, and outperform existing state-of-the-art methods by over 2%.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes