InTeX: Interactive Text-to-texture Synthesis via Unified Depth-aware Inpainting
This addresses the problem of creating high-quality 3D textures from text for 3D content creators, representing an incremental improvement over existing methods.
The paper tackles 3D inconsistency and limited controllability in text-to-texture synthesis by introducing InteX, a framework with an interactive interface and unified depth-aware inpainting model, which improves generation speed and enables precise texture editing.
Text-to-texture synthesis has become a new frontier in 3D content creation thanks to the recent advances in text-to-image models. Existing methods primarily adopt a combination of pretrained depth-aware diffusion and inpainting models, yet they exhibit shortcomings such as 3D inconsistency and limited controllability. To address these challenges, we introduce InteX, a novel framework for interactive text-to-texture synthesis. 1) InteX includes a user-friendly interface that facilitates interaction and control throughout the synthesis process, enabling region-specific repainting and precise texture editing. 2) Additionally, we develop a unified depth-aware inpainting model that integrates depth information with inpainting cues, effectively mitigating 3D inconsistencies and improving generation speed. Through extensive experiments, our framework has proven to be both practical and effective in text-to-texture synthesis, paving the way for high-quality 3D content creation.