CVJun 23, 2025

RDPO: Real Data Preference Optimization for Physics Consistency Video Generation

arXiv:2506.18655v110 citationsh-index: 14
Originality Incremental advance
AI Analysis

This work addresses the challenge of physical consistency in video generation for applications requiring realistic simulations, though it is incremental as it builds on existing preference-based optimization methods.

The paper tackles the problem of generating videos that are physically consistent by introducing Real Data Preference Optimization (RDPO), an annotation-free framework that distills physical priors from real-world videos, resulting in significant improvements in action coherence and physical realism as demonstrated by evaluations on multiple benchmarks.

Video generation techniques have achieved remarkable advancements in visual quality, yet faithfully reproducing real-world physics remains elusive. Preference-based model post-training may improve physical consistency, but requires costly human-annotated datasets or reward models that are not yet feasible. To address these challenges, we present Real Data Preference Optimisation (RDPO), an annotation-free framework that distills physical priors directly from real-world videos. Specifically, the proposed RDPO reverse-samples real video sequences with a pre-trained generator to automatically build preference pairs that are statistically distinguishable in terms of physical correctness. A multi-stage iterative training schedule then guides the generator to obey physical laws increasingly well. Benefiting from the dynamic information explored from real videos, our proposed RDPO significantly improves the action coherence and physical realism of the generated videos. Evaluations on multiple benchmarks and human evaluations have demonstrated that RDPO achieves improvements across multiple dimensions. The source code and demonstration of this paper are available at: https://wwenxu.github.io/RDPO/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes