CVJul 30, 2025

Image-Guided Shape-from-Template Using Mesh Inextensibility Constraints

arXiv:2507.22699v1h-index: 9Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of real-time 3D shape reconstruction for applications like robotics or augmented reality, though it appears incremental as it builds on prior unsupervised approaches.

The paper tackles the problem of reconstructing 3D shapes of deforming objects from images without requiring point correspondences or large supervised datasets, achieving a 400× faster reconstruction speed than the best-performing unsupervised method and significantly outperforming existing methods in handling finer details and severe occlusions.

Shape-from-Template (SfT) refers to the class of methods that reconstruct the 3D shape of a deforming object from images/videos using a 3D template. Traditional SfT methods require point correspondences between images and the texture of the 3D template in order to reconstruct 3D shapes from images/videos in real time. Their performance severely degrades when encountered with severe occlusions in the images because of the unavailability of correspondences. In contrast, modern SfT methods use a correspondence-free approach by incorporating deep neural networks to reconstruct 3D objects, thus requiring huge amounts of data for supervision. Recent advances use a fully unsupervised or self-supervised approach by combining differentiable physics and graphics to deform 3D template to match input images. In this paper, we propose an unsupervised SfT which uses only image observations: color features, gradients and silhouettes along with a mesh inextensibility constraint to reconstruct at a $400\times$ faster pace than (best-performing) unsupervised SfT. Moreover, when it comes to generating finer details and severe occlusions, our method outperforms the existing methodologies by a large margin. Code is available at https://github.com/dvttran/nsft.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes