CVMar 21, 2025

HyperNVD: Accelerating Neural Video Decomposition via Hypernetworks

Maria Pilligua, Danna Xue, Javier Vazquez-Corral

arXiv:2503.17276v18.42 citationsh-index: 2Has CodeCVPR

Originality Incremental advance

AI Analysis

This work addresses a bottleneck in video editing for creative industries by accelerating layer-based decomposition, though it is incremental as it builds on existing implicit neural representation methods.

The paper tackles the slow training of neural video decomposition models for new videos by proposing a hypernetwork-based meta-learning approach, which reduces convergence time by 60% compared to per-video training.

Decomposing a video into a layer-based representation is crucial for easy video editing for the creative industries, as it enables independent editing of specific layers. Existing video-layer decomposition models rely on implicit neural representations (INRs) trained independently for each video, making the process time-consuming when applied to new videos. Noticing this limitation, we propose a meta-learning strategy to learn a generic video decomposition model to speed up the training on new videos. Our model is based on a hypernetwork architecture which, given a video-encoder embedding, generates the parameters for a compact INR-based neural video decomposition model. Our strategy mitigates the problem of single-video overfitting and, importantly, shortens the convergence of video decomposition on new, unseen videos. Our code is available at: https://hypernvd.github.io/

View on arXiv PDF

Similar