CVJun 5, 2025

ProJo4D: Progressive Joint Optimization for Sparse-View Inverse Physics Estimation

arXiv:2506.05317v21 citationsh-index: 6
Originality Highly original
AI Analysis

This addresses the problem of scalable, physically accurate scene understanding for robotics and XR, offering a novel method to reduce error accumulation in sparse-view settings.

The paper tackles the challenge of estimating physics from sparse multi-view videos for applications like digital twins, proposing ProJo4D, a progressive joint optimization framework that outperforms prior methods in 4D future state prediction, novel view rendering, and material parameter estimation.

Neural rendering has made significant strides in 3D reconstruction and novel view synthesis. With the integration with physics, it opens up new applications. The inverse problem of estimating physics from visual data, however, still remains challenging, limiting its effectiveness for applications like physically accurate digital twin creation in robotics and XR. Existing methods that incorporate physics into neural rendering frameworks typically require dense multi-view videos as input, making them impractical for scalable, real-world use. When presented with sparse multi-view videos, the sequential optimization strategy used by existing approaches introduces significant error accumulation, e.g., poor initial 3D reconstruction leads to bad material parameter estimation in subsequent stages. Instead of sequential optimization, directly optimizing all parameters at the same time also fails due to the highly non-convex and often non-differentiable nature of the problem. We propose ProJo4D, a progressive joint optimization framework that gradually increases the set of jointly optimized parameters guided by their sensitivity, leading to fully joint optimization over geometry, appearance, physical state, and material property. Evaluations on PAC-NeRF and Spring-Gaus datasets show that ProJo4D outperforms prior work in 4D future state prediction, novel view rendering of future state, and material parameter estimation, demonstrating its effectiveness in physically grounded 4D scene understanding. For demos, please visit the project webpage: https://daniel03c1.github.io/ProJo4D/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes