CVIRFeb 1, 2017

ImageNet MPEG-7 Visual Descriptors - Technical Report

arXiv:1702.00187v13 citations
Originality Synthesis-oriented
AI Analysis

This provides additional standardized features for researchers using ImageNet, but it is incremental as it extends existing resources without introducing new methods.

The authors tackled the lack of diverse visual descriptors in the ImageNet database by extracting MPEG-7 visual descriptors from the images, making them publicly available to support research in visual recognition applications.

ImageNet is a large scale and publicly available image database. It currently offers more than 14 millions of images, organised according to the WordNet hierarchy. One of the main objective of the creators is to provide to the research community a relevant database for visual recognition applications such as object recognition, image classification or object localisation. However, only a few visual descriptors of the images are available to be used by the researchers. Only SIFT-based features have been extracted from a subset of the collection. This technical report presents the extraction of some MPEG-7 visual descriptors from the ImageNet database. These descriptors are made publicly available in an effort towards open research.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes