CVJan 30, 2024

An Embeddable Implicit IUVD Representation for Part-based 3D Human Surface Reconstruction

arXiv:2401.16810v25 citationsh-index: 2IEEE Transactions on Image Processing
Originality Incremental advance
AI Analysis

This work addresses computational bottlenecks in implicit human reconstruction pipelines for computer vision applications, representing an incremental improvement over existing methods.

The paper tackles the problem of inefficient 3D human surface reconstruction from single images by proposing an IUVD-Feedback representation that replaces signed distance calculations with linear transformations in IUVD space and reduces redundant query points, achieving three times faster acceleration while improving robustness on the THuman2.0 dataset.

To reconstruct a 3D human surface from a single image, it is crucial to simultaneously consider human pose, shape, and clothing details. Recent approaches have combined parametric body models (such as SMPL), which capture body pose and shape priors, with neural implicit functions that flexibly learn clothing details. However, this combined representation introduces additional computation, e.g. signed distance calculation in 3D body feature extraction, leading to redundancy in the implicit query-and-infer process and failing to preserve the underlying body shape prior. To address these issues, we propose a novel IUVD-Feedback representation, consisting of an IUVD occupancy function and a feedback query algorithm. This representation replaces the time-consuming signed distance calculation with a simple linear transformation in the IUVD space, leveraging the SMPL UV maps. Additionally, it reduces redundant query points through a feedback mechanism, leading to more reasonable 3D body features and more effective query points, thereby preserving the parametric body prior. Moreover, the IUVD-Feedback representation can be embedded into any existing implicit human reconstruction pipeline without requiring modifications to the trained neural networks. Experiments on the THuman2.0 dataset demonstrate that the proposed IUVD-Feedback representation improves the robustness of results and achieves three times faster acceleration in the query-and-infer process. Furthermore, this representation holds potential for generative applications by leveraging its inherent semantic information from the parametric body model.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes