Sergey Podlesnyy

CVJul 17, 2019

Towards Data-Driven Automatic Video Editing

Sergey Podlesnyy

Automatic video editing involving at least the steps of selecting the most valuable footage from points of view of visual quality and the importance of action filmed; and cutting the footage into a brief and coherent visual story that would be interesting to watch is implemented in a purely data-driven manner. Visual semantic and aesthetic features are extracted by the ImageNet-trained convolutional neural network, and the editing controller is trained by an imitation learning algorithm. As a result, at test time the controller shows the signs of observing basic cinematography editing rules learned from the corpus of motion pictures masterpieces.

IRJan 28, 2016

Deep Learning Based Semantic Video Indexing and Retrieval

Anna Podlesnaya, Sergey Podlesnyy

We share the implementation details and testing results for video retrieval system based exclusively on features extracted by convolutional neural networks. We show that deep learned features might serve as universal signature for semantic content of video useful in many search and retrieval tasks. We further show that graph-based storage structure for video index allows to efficiently retrieving the content with complicated spatial and temporal search queries.

Sergey Podlesnyy

2 Papers