CVAILGSep 5, 2017

Fine-tuning deep CNN models on specific MS COCO categories

arXiv:1709.01476v115 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental tool for researchers or practitioners working with object detection on MS COCO, simplifying the fine-tuning process.

The paper introduces a software library for fine-tuning a specific deep CNN model on custom subsets of the MS COCO dataset, automating image selection to avoid manual searching.

Fine-tuning of a deep convolutional neural network (CNN) is often desired. This paper provides an overview of our publicly available py-faster-rcnn-ft software library that can be used to fine-tune the VGG_CNN_M_1024 model on custom subsets of the Microsoft Common Objects in Context (MS COCO) dataset. For example, we improved the procedure so that the user does not have to look for suitable image files in the dataset by hand which can then be used in the demo program. Our implementation randomly selects images that contain at least one object of the categories on which the model is fine-tuned.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes