CVSPOct 21, 2021

Fast Graph Sampling for Short Video Summarization using Gershgorin Disc Alignment

arXiv:2110.11420v24 citations
Originality Incremental advance
AI Analysis

This addresses the need for efficient video summarization, particularly for short videos, but is incremental as it builds on existing graph sampling techniques.

The paper tackles the problem of efficiently summarizing short videos into keyframes by developing a fast graph sampling algorithm based on Gershgorin disc alignment, which achieves comparable performance to state-of-the-art methods with substantially reduced complexity.

We study the problem of efficiently summarizing a short video into several keyframes, leveraging recent progress in fast graph sampling. Specifically, we first construct a similarity path graph (SPG) $\mathcal{G}$, represented by graph Laplacian matrix $\mathbf{L}$, where the similarities between adjacent frames are encoded as positive edge weights. We show that maximizing the smallest eigenvalue $λ_{\min}(\mathbf{B})$ of a coefficient matrix $\mathbf{B} = \text{diag}(\mathbf{a}) + μ\mathbf{L}$, where $\mathbf{a}$ is the binary keyframe selection vector, is equivalent to minimizing a worst-case signal reconstruction error. We prove that, after partitioning $\mathcal{G}$ into $Q$ sub-graphs $\{\mathcal{G}^q\}^Q_{q=1}$, the smallest Gershgorin circle theorem (GCT) lower bound of $Q$ corresponding coefficient matrices -- $\min_q λ^-_{\min}(\mathbf{B}^q)$ -- is a lower bound for $λ_{\min}(\mathbf{B})$. This inspires a fast graph sampling algorithm to iteratively partition $\mathcal{G}$ into $Q$ sub-graphs using $Q$ samples (keyframes), while maximizing $λ^-_{\min}(\mathbf{B}^q)$ for each sub-graph $\mathcal{G}^q$. Experimental results show that our algorithm achieves comparable video summarization performance as state-of-the-art methods, at a substantially reduced complexity.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes