CVMar 21, 2025

HyperNVD: Accelerating Neural Video Decomposition via Hypernetworks

arXiv:2503.17276v12 citationsh-index: 2Has CodeCVPR
Originality Incremental advance
AI Analysis

This work addresses a bottleneck in video editing for creative industries by accelerating layer-based decomposition, though it is incremental as it builds on existing implicit neural representation methods.

The paper tackles the slow training of neural video decomposition models for new videos by proposing a hypernetwork-based meta-learning approach, which reduces convergence time by 60% compared to per-video training.

Decomposing a video into a layer-based representation is crucial for easy video editing for the creative industries, as it enables independent editing of specific layers. Existing video-layer decomposition models rely on implicit neural representations (INRs) trained independently for each video, making the process time-consuming when applied to new videos. Noticing this limitation, we propose a meta-learning strategy to learn a generic video decomposition model to speed up the training on new videos. Our model is based on a hypernetwork architecture which, given a video-encoder embedding, generates the parameters for a compact INR-based neural video decomposition model. Our strategy mitigates the problem of single-video overfitting and, importantly, shortens the convergence of video decomposition on new, unseen videos. Our code is available at: https://hypernvd.github.io/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes