CVAIJun 24, 2019

LMVP: Video Predictor with Leaked Motion Information

arXiv:1906.10101v1
Originality Incremental advance
AI Analysis

This addresses video prediction for applications like autonomous systems, but it appears incremental as it builds on existing methods with new components.

The paper tackles video frame prediction by introducing LMVP, which uses a motion guider to learn and guide temporal features and an adaptive filtering network for spatial consistency, achieving state-of-the-art results on synthetic and real data without human labeling.

We propose a Leaked Motion Video Predictor (LMVP) to predict future frames by capturing the spatial and temporal dependencies from given inputs. The motion is modeled by a newly proposed component, motion guider, which plays the role of both learner and teacher. Specifically, it {\em learns} the temporal features from real data and {\em guides} the generator to predict future frames. The spatial consistency in video is modeled by an adaptive filtering network. To further ensure the spatio-temporal consistency of the prediction, a discriminator is also adopted to distinguish the real and generated frames. Further, the discriminator leaks information to the motion guider and the generator to help the learning of motion. The proposed LMVP can effectively learn the static and temporal features in videos without the need for human labeling. Experiments on synthetic and real data demonstrate that LMVP can yield state-of-the-art results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes