Miha Skalic

2papers

2 Papers

CVOct 25, 2019
Learning to Localize Temporal Events in Large-scale Video Data

Mikel Bober-Irizar, Miha Skalic, David Austin

We address temporal localization of events in large-scale video data, in the context of the Youtube-8M Segments dataset. This emerging field within video recognition can enable applications to identify the precise time a specified event occurs in a video, which has broad implications for video search. To address this we present two separate approaches: (1) a gradient boosted decision tree model on a crafted dataset and (2) a combination of deep learning models based on frame-level data, video-level data, and a localization model. The combinations of these two approaches achieved 5th place in the 3rd Youtube-8M video recognition challenge.

MLJun 14, 2017
Deep Learning Methods for Efficient Large Scale Video Labeling

Miha Skalic, Marcin Pekalski, Xingguo E. Pan

We present a solution to "Google Cloud and YouTube-8M Video Understanding Challenge" that ranked 5th place. The proposed model is an ensemble of three model families, two frame level and one video level. The training was performed on augmented dataset, with cross validation.