CVMar 15, 2022

OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction

Wenbin Lin, Chengwei Zheng, Jun-Hai Yong, Feng Xu

arXiv:2203.07977v117.047 citationsh-index: 28Has Code

Originality Incremental advance

AI Analysis

This addresses the problem of robust real-time 3D reconstruction for applications like robotics or AR/VR, though it appears incremental as it builds on existing single-view methods.

The paper tackles inaccurate inter-frame motion estimation in single-view RGBD-based real-time dynamic 3D reconstruction, which suffers from error accumulation and occlusions, by proposing OcclusionFusion to compute occlusion-aware 3D motion, resulting in outperforming existing methods by a large margin on public datasets and handling long, challenging sequences.

RGBD-based real-time dynamic 3D reconstruction suffers from inaccurate inter-frame motion estimation as errors may accumulate with online tracking. This problem is even more severe for single-view-based systems due to strong occlusions. Based on these observations, we propose OcclusionFusion, a novel method to calculate occlusion-aware 3D motion to guide the reconstruction. In our technique, the motion of visible regions is first estimated and combined with temporal information to infer the motion of the occluded regions through an LSTM-involved graph neural network. Furthermore, our method computes the confidence of the estimated motion by modeling the network output with a probabilistic model, which alleviates untrustworthy motions and enables robust tracking. Experimental results on public datasets and our own recorded data show that our technique outperforms existing single-view-based real-time methods by a large margin. With the reduction of the motion errors, the proposed technique can handle long and challenging motion sequences. Please check out the project page for sequence results: https://wenbin-lin.github.io/OcclusionFusion.

View on arXiv PDF Code

Similar