CVAICELGROJun 9, 2025

FreeGave: 3D Physics Learning from Dynamic Videos by Gaussian Velocity

arXiv:2506.07865v19 citationsh-index: 3Has CodeCVPR
Originality Highly original
AI Analysis

This addresses the challenge of learning complex physical motions in dynamic 3D scenes for computer vision applications, representing a novel method rather than an incremental improvement.

The paper tackles the problem of modeling 3D scene geometry, appearance, and physics from multi-view videos without object priors, achieving superior performance in future frame extrapolation and motion segmentation on multiple datasets.

In this paper, we aim to model 3D scene geometry, appearance, and the underlying physics purely from multi-view videos. By applying various governing PDEs as PINN losses or incorporating physics simulation into neural networks, existing works often fail to learn complex physical motions at boundaries or require object priors such as masks or types. In this paper, we propose FreeGave to learn the physics of complex dynamic 3D scenes without needing any object priors. The key to our approach is to introduce a physics code followed by a carefully designed divergence-free module for estimating a per-Gaussian velocity field, without relying on the inefficient PINN losses. Extensive experiments on three public datasets and a newly collected challenging real-world dataset demonstrate the superior performance of our method for future frame extrapolation and motion segmentation. Most notably, our investigation into the learned physics codes reveals that they truly learn meaningful 3D physical motion patterns in the absence of any human labels in training.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes