CVDec 2, 2017

Lecture video indexing using boosted margin maximizing neural networks

arXiv:1712.00575v19 citations
Originality Incremental advance
AI Analysis

This addresses the problem of efficient lecture video indexing for educational applications, representing an incremental improvement with a novel hybrid method.

The paper tackles lecture video indexing by matching slide images to video frames using a boosted deep convolutional neural network system, achieving significantly better handling of occlusion, spatial transformations, and noise compared to existing methods.

This paper presents a novel approach for lecture video indexing using a boosted deep convolutional neural network system. The indexing is performed by matching high quality slide images, for which text is either known or extracted, to lower resolution video frames with possible noise, perspective distortion, and occlusions. We propose a deep neural network integrated with a boosting framework composed of two sub-networks targeting feature extraction and similarity determination to perform the matching. The trained network is given as input a pair of slide image and a candidate video frame image and produces the similarity between them. A boosting framework is integrated into our proposed network during the training process. Experimental results show that the proposed approach is much more capable of handling occlusion, spatial transformations, and other types of noises when compared with known approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes