CVJun 14, 2019

Trimmed Action Recognition, Dense-Captioning Events in Videos, and Spatio-temporal Action Localization with Focus on ActivityNet Challenge 2019

Zhaofan Qiu, Dong Li, Yehao Li, Qi Cai, Yingwei Pan, Ting Yao

arXiv:1906.07016v14.14 citations

Originality Synthesis-oriented

AI Analysis

This work addresses video understanding tasks for researchers and practitioners in computer vision, but it is incremental as it focuses on applying existing methods to a specific challenge.

The paper presents systems for three tasks in the ActivityNet Challenge 2019: trimmed action recognition, dense-captioning events in videos, and spatio-temporal action localization, providing an overview and comparative analysis of their approaches.

This notebook paper presents an overview and comparative analysis of our systems designed for the following three tasks in ActivityNet Challenge 2019: trimmed action recognition, dense-captioning events in videos, and spatio-temporal action localization.

View on arXiv PDF

Similar