MMIROct 10, 2018

V3C - a Research Video Collection

arXiv:1810.04401v2111 citations
AI Analysis

This provides a new dataset for video retrieval and analysis research, addressing gaps in existing collections, but it is incremental as it focuses on data collection rather than novel methods.

The authors tackled the lack of large-scale, representative video datasets for research by introducing V3C, a collection of 28,450 videos (about 3,800 hours) from Vimeo under creative commons license, which includes shot segmentation, keyframes, and metadata for use in TRECVid from 2019.

With the widespread use of smartphones as recording devices and the massive growth in bandwidth, the number and volume of video collections has increased significantly in the last years. This poses novel challenges to the management of these large-scale video data and especially to the analysis of and retrieval from such video collections. At the same time, existing video datasets used for research and experimentation are either not large enough to represent current collections or do not reflect the properties of video commonly found on the Internet in terms of content, length, or resolution. In this paper, we introduce the Vimeo Creative Commons Collection, in short V3C, a collection of 28'450 videos (with overall length of about 3'800 hours) published under creative commons license on Vimeo. V3C comes with a shot segmentation for each video, together with the resulting keyframes in original as well as reduced resolution and additional metadata. It is intended to be used from 2019 at the International large-scale TREC Video Retrieval Evaluation campaign (TRECVid).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes