CLIRAug 7, 2017

Video Highlights Detection and Summarization with Lag-Calibration based on Concept-Emotion Mapping of Crowd-sourced Time-Sync Comments

arXiv:1708.02210v118 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of automatic video digestion for users of video-sharing platforms, though it is incremental as it builds on existing methods for handling noisy data.

The paper tackled the problem of detecting and summarizing video highlights from noisy and lagging crowdsourced time-sync comments by proposing a framework that uses concept-emotion mapping and lag calibration, resulting in highlight detection and summarization methods that outperform benchmarks with considerable margins.

With the prevalence of video sharing, there are increasing demands for automatic video digestion such as highlight detection. Recently, platforms with crowdsourced time-sync video comments have emerged worldwide, providing a good opportunity for highlight detection. However, this task is non-trivial: (1) time-sync comments often lag behind their corresponding shot; (2) time-sync comments are semantically sparse and noisy; (3) to determine which shots are highlights is highly subjective. The present paper aims to tackle these challenges by proposing a framework that (1) uses concept-mapped lexical-chains for lag calibration; (2) models video highlights based on comment intensity and combination of emotion and concept concentration of each shot; (3) summarize each detected highlight using improved SumBasic with emotion and concept mapping. Experiments on large real-world datasets show that our highlight detection method and summarization method both outperform other benchmarks with considerable margins.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes