CVLGMay 3, 2023

Shap-E: Generating Conditional 3D Implicit Functions

arXiv:2305.02463v1461 citationsHas Code
Originality Highly original
AI Analysis

This addresses the need for efficient and high-quality 3D content creation for applications in gaming, VR, and design, representing a novel method rather than an incremental improvement.

The paper tackles the problem of generating 3D assets by introducing Shap-E, a conditional generative model that directly produces parameters for implicit functions, enabling rendering as textured meshes and neural radiance fields. The result is a model that generates complex and diverse 3D assets in seconds, with faster convergence and comparable or better sample quality than Point-E.

We present Shap-E, a conditional generative model for 3D assets. Unlike recent work on 3D generative models which produce a single output representation, Shap-E directly generates the parameters of implicit functions that can be rendered as both textured meshes and neural radiance fields. We train Shap-E in two stages: first, we train an encoder that deterministically maps 3D assets into the parameters of an implicit function; second, we train a conditional diffusion model on outputs of the encoder. When trained on a large dataset of paired 3D and text data, our resulting models are capable of generating complex and diverse 3D assets in a matter of seconds. When compared to Point-E, an explicit generative model over point clouds, Shap-E converges faster and reaches comparable or better sample quality despite modeling a higher-dimensional, multi-representation output space. We release model weights, inference code, and samples at https://github.com/openai/shap-e.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes