CVSep 26, 2019

Learning Energy-based Spatial-Temporal Generative ConvNets for Dynamic Patterns

arXiv:1909.11975v154 citations
Originality Incremental advance
AI Analysis

This work addresses video analysis challenges for computer vision researchers, offering a method to handle incomplete data, but it appears incremental as it builds on existing energy-based and ConvNet frameworks.

The authors tackled the problem of modeling and synthesizing dynamic patterns in video sequences, such as textures and actions, by proposing an energy-based spatial-temporal generative ConvNet, which can learn from training videos and even incomplete data to produce realistic patterns.

Video sequences contain rich dynamic patterns, such as dynamic texture patterns that exhibit stationarity in the temporal domain, and action patterns that are non-stationary in either spatial or temporal domain. We show that an energy-based spatial-temporal generative ConvNet can be used to model and synthesize dynamic patterns. The model defines a probability distribution on the video sequence, and the log probability is defined by a spatial-temporal ConvNet that consists of multiple layers of spatial-temporal filters to capture spatial-temporal patterns of different scales. The model can be learned from the training video sequences by an "analysis by synthesis" learning algorithm that iterates the following two steps. Step 1 synthesizes video sequences from the currently learned model. Step 2 then updates the model parameters based on the difference between the synthesized video sequences and the observed training sequences. We show that the learning algorithm can synthesize realistic dynamic patterns. We also show that it is possible to learn the model from incomplete training sequences with either occluded pixels or missing frames, so that model learning and pattern completion can be accomplished simultaneously.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes