CVApr 22, 2024

MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets

Zeyu Li, Ruitong Gan, Chuanchen Luo, Yuxi Wang, Jiaheng Liu, Ziwei Zhu Man Zhang, Qing Li, Xucheng Yin, Zhaoxiang Zhang, Junran Peng

arXiv:2404.13923v39.69 citationsh-index: 15MM

Originality Incremental advance

AI Analysis

This addresses a domain-specific problem for 3D asset generation, enabling better relighting for downstream applications like gaming or VR, but it is incremental as it builds on prior 2D-to-3D methods.

The paper tackles the problem of generating 3D assets with accurate material maps from 2D priors, as existing methods produce spurious components due to illumination effects, limiting relighting in novel scenes. The result is MaterialSeg3D, a framework that infers materials from 2D semantic priors, validated through extensive experiments.

Driven by powerful image diffusion models, recent research has achieved the automatic creation of 3D objects from textual or visual guidance. By performing score distillation sampling (SDS) iteratively across different views, these methods succeed in lifting 2D generative prior to the 3D space. However, such a 2D generative image prior bakes the effect of illumination and shadow into the texture. As a result, material maps optimized by SDS inevitably involve spurious correlated components. The absence of precise material definition makes it infeasible to relight the generated assets reasonably in novel scenes, which limits their application in downstream scenarios. In contrast, humans can effortlessly circumvent this ambiguity by deducing the material of the object from its appearance and semantics. Motivated by this insight, we propose MaterialSeg3D, a 3D asset material generation framework to infer underlying material from the 2D semantic prior. Based on such a prior model, we devise a mechanism to parse material in 3D space. We maintain a UV stack, each map of which is unprojected from a specific viewpoint. After traversing all viewpoints, we fuse the stack through a weighted voting scheme and then employ region unification to ensure the coherence of the object parts. To fuel the learning of semantics prior, we collect a material dataset, named Materialized Individual Objects (MIO), which features abundant images, diverse categories, and accurate annotations. Extensive quantitative and qualitative experiments demonstrate the effectiveness of our method.

View on arXiv PDF

Similar