IRCVFeb 27, 2016

Content-based Video Indexing and Retrieval Using Corr-LDA

arXiv:1602.08581v223 citations
AI Analysis

This addresses the issue of sparse user-provided tagging for video search on multimedia websites, offering an incremental improvement through semantic content matching.

The paper tackled the problem of video indexing and retrieval by proposing a content-based method using the corr-LDA probabilistic framework to auto-annotate videos with textual descriptors, achieving increased accuracy in retrieval compared to an SVM-based approach.

Existing video indexing and retrieval methods on popular web-based multimedia sharing websites are based on user-provided sparse tagging. This paper proposes a very specific way of searching for video clips, based on the content of the video. We present our work on Content-based Video Indexing and Retrieval using the Correspondence-Latent Dirichlet Allocation (corr-LDA) probabilistic framework. This is a model that provides for auto-annotation of videos in a database with textual descriptors, and brings the added benefit of utilizing the semantic relations between the content of the video and text. We use the concept-level matching provided by corr-LDA to build correspondences between text and multimedia, with the objective of retrieving content with increased accuracy. In our experiments, we employ only the audio components of the individual recordings and compare our results with an SVM-based approach.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes