CVJan 25

Frequency-aware Neural Representation for Videos

arXiv:2601.17741v11 citations
Originality Incremental advance
AI Analysis

This work addresses video compression for applications requiring high fidelity, representing an incremental improvement over existing INR methods.

The paper tackled the problem of spectral bias in implicit neural representations for video compression, which leads to over-smoothed reconstructions, and proposed FaNeRV to decouple low- and high-frequency components, achieving competitive rate-distortion performance against traditional codecs.

Implicit Neural Representations (INRs) have emerged as a promising paradigm for video compression. However, existing INR-based frameworks typically suffer from inherent spectral bias, which favors low-frequency components and leads to over-smoothed reconstructions and suboptimal rate-distortion performance. In this paper, we propose FaNeRV, a Frequency-aware Neural Representation for videos, which explicitly decouples low- and high-frequency components to enable efficient and faithful video reconstruction. FaNeRV introduces a multi-resolution supervision strategy that guides the network to progressively capture global structures and fine-grained textures through staged supervision . To further enhance high-frequency reconstruction, we propose a dynamic high-frequency injection mechanism that adaptively emphasizes challenging regions. In addition, we design a frequency-decomposed network module to improve feature modeling across different spectral bands. Extensive experiments on standard benchmarks demonstrate that FaNeRV significantly outperforms state-of-the-art INR methods and achieves competitive rate-distortion performance against traditional codecs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes