CV AINov 9, 2025

MambaOVSR: Multiscale Fusion with Global Motion Modeling for Chinese Opera Video Super-Resolution

Hua Chang, Xin Xu, Wei Liu, Wei Wang, Xin Yuan, Kui Jiang

arXiv:2511.06172v16.21 citationsh-index: 4

Originality Incremental advance

AI Analysis

This addresses archival preservation of culturally significant Chinese opera videos, though it appears to be a domain-specific incremental advancement.

The paper tackles the challenge of restoring degraded Chinese opera videos by creating a new dataset (COVC) and proposing MambaOVSR, a method that achieves an average 1.86 dB PSNR improvement over state-of-the-art STVSR methods.

Chinese opera is celebrated for preserving classical art. However, early filming equipment limitations have degraded videos of last-century performances by renowned artists (e.g., low frame rates and resolution), hindering archival efforts. Although space-time video super-resolution (STVSR) has advanced significantly, applying it directly to opera videos remains challenging. The scarcity of datasets impedes the recovery of high frequency details, and existing STVSR methods lack global modeling capabilities, compromising visual quality when handling opera's characteristic large motions. To address these challenges, we pioneer a large scale Chinese Opera Video Clip (COVC) dataset and propose the Mamba-based multiscale fusion network for space-time Opera Video Super-Resolution (MambaOVSR). Specifically, MambaOVSR involves three novel components: the Global Fusion Module (GFM) for motion modeling through a multiscale alternating scanning mechanism, and the Multiscale Synergistic Mamba Module (MSMM) for alignment across different sequence lengths. Additionally, our MambaVR block resolves feature artifacts and positional information loss during alignment. Experimental results on the COVC dataset show that MambaOVSR significantly outperforms the SOTA STVSR method by an average of 1.86 dB in terms of PSNR. Dataset and Code will be publicly released.

View on arXiv PDF

Similar