CVAIDec 11, 2025

RobustSora: De-Watermarked Benchmark for Robust AI-Generated Video Detection

arXiv:2512.10248v11 citations
Originality Synthesis-oriented
AI Analysis

This addresses the need for more robust detection methods in AI-generated content, but it is incremental as it focuses on benchmarking an overlooked factor rather than proposing a new detection solution.

The paper tackled the problem of AI-generated video detection being influenced by digital watermarks, by introducing RobustSora, a benchmark that evaluates watermark robustness, and found performance variations of 2-8 percentage points across models under watermark manipulation.

The proliferation of AI-generated video technologies poses challenges to information integrity. While recent benchmarks advance AIGC video detection, they overlook a critical factor: many state-of-the-art generative models embed digital watermarks in outputs, and detectors may partially rely on these patterns. To evaluate this influence, we present RobustSora, the benchmark designed to assess watermark robustness in AIGC video detection. We systematically construct a dataset of 6,500 videos comprising four types: Authentic-Clean (A-C), Authentic-Spoofed with fake watermarks (A-S), Generated-Watermarked (G-W), and Generated-DeWatermarked (G-DeW). Our benchmark introduces two evaluation tasks: Task-I tests performance on watermark-removed AI videos, while Task-II assesses false alarm rates on authentic videos with fake watermarks. Experiments with ten models spanning specialized AIGC detectors, transformer architectures, and MLLM approaches reveal performance variations of 2-8pp under watermark manipulation. Transformer-based models show consistent moderate dependency (6-8pp), while MLLMs exhibit diverse patterns (2-8pp). These findings indicate partial watermark dependency and highlight the need for watermark-aware training strategies. RobustSora provides essential tools to advance robust AIGC detection research.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes