CVJul 10, 2020

Optical Flow Distillation: Towards Efficient and Stable Video Style Transfer

arXiv:2007.05146v265 citations
AI Analysis

This work addresses efficiency and stability issues in video style transfer for mobile applications, representing an incremental improvement over existing methods.

The paper tackles the problem of inefficient and unstable video style transfer by proposing a knowledge distillation method that uses two teacher networks to highlight optical flow improvements, enabling a lightweight student network to generate stable videos without optical flow, achieving faster inference times.

Video style transfer techniques inspire many exciting applications on mobile devices. However, their efficiency and stability are still far from satisfactory. To boost the transfer stability across frames, optical flow is widely adopted, despite its high computational complexity, e.g. occupying over 97% inference time. This paper proposes to learn a lightweight video style transfer network via knowledge distillation paradigm. We adopt two teacher networks, one of which takes optical flow during inference while the other does not. The output difference between these two teacher networks highlights the improvements made by optical flow, which is then adopted to distill the target student network. Furthermore, a low-rank distillation loss is employed to stabilize the output of student network by mimicking the rank of input videos. Extensive experiments demonstrate that our student network without an optical flow module is still able to generate stable video and runs much faster than the teacher network.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes