IR CVFeb 27, 2016

Content-based Video Indexing and Retrieval Using Corr-LDA

Rahul Radhakrishnan Iyer, Sanjeel Parekh, Vikas Mohandoss, Anush Ramsurat, Bhiksha Raj, Rita Singh

arXiv:1602.08581v210.823 citations

Originality Incremental advance

AI Analysis

This addresses the issue of sparse user-provided tagging for video search on multimedia websites, offering an incremental improvement through semantic content matching.

The paper tackled the problem of video indexing and retrieval by proposing a content-based method using the corr-LDA probabilistic framework to auto-annotate videos with textual descriptors, achieving increased accuracy in retrieval compared to an SVM-based approach.

Existing video indexing and retrieval methods on popular web-based multimedia sharing websites are based on user-provided sparse tagging. This paper proposes a very specific way of searching for video clips, based on the content of the video. We present our work on Content-based Video Indexing and Retrieval using the Correspondence-Latent Dirichlet Allocation (corr-LDA) probabilistic framework. This is a model that provides for auto-annotation of videos in a database with textual descriptors, and brings the added benefit of utilizing the semantic relations between the content of the video and text. We use the concept-level matching provided by corr-LDA to build correspondences between text and multimedia, with the objective of retrieving content with increased accuracy. In our experiments, we employ only the audio components of the individual recordings and compare our results with an SVM-based approach.

View on arXiv PDF

Similar