GRCVApr 29, 2020

Interactive Video Stylization Using Few-Shot Patch-Based Training

arXiv:2004.14489v185 citations
AI Analysis

This addresses the problem of efficient and interactive video stylization for artists, offering an incremental improvement over previous methods by reducing training requirements.

The paper tackles video stylization by enabling artists to propagate style from a few keyframes to entire sequences, achieving semantically meaningful results without lengthy pre-training or large datasets, and supports real-time inference and interactive editing.

In this paper, we present a learning-based method to the keyframe-based video stylization that allows an artist to propagate the style from a few selected keyframes to the rest of the sequence. Its key advantage is that the resulting stylization is semantically meaningful, i.e., specific parts of moving objects are stylized according to the artist's intention. In contrast to previous style transfer techniques, our approach does not require any lengthy pre-training process nor a large training dataset. We demonstrate how to train an appearance translation network from scratch using only a few stylized exemplars while implicitly preserving temporal consistency. This leads to a video stylization framework that supports real-time inference, parallel processing, and random access to an arbitrary output frame. It can also merge the content from multiple keyframes without the need to perform an explicit blending operation. We demonstrate its practical utility in various interactive scenarios, where the user paints over a selected keyframe and sees her style transferred to an existing recorded sequence or a live video stream.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes