CV AI GRDec 18, 2025

Using Gaussian Splats to Create High-Fidelity Facial Geometry and Texture

arXiv:2512.16397v1h-index: 2

Originality Incremental advance

AI Analysis

This provides a practical solution for creating detailed 3D facial assets from limited images, benefiting graphics and animation industries, though it builds incrementally on existing neural representation methods.

The paper tackles the problem of reconstructing high-fidelity facial geometry and texture from a small set of uncalibrated images by using Gaussian Splatting with segmentation and surface constraints, achieving reconstruction from only 11 images and enabling integration into standard graphics pipelines.

We leverage increasingly popular three-dimensional neural representations in order to construct a unified and consistent explanation of a collection of uncalibrated images of the human face. Our approach utilizes Gaussian Splatting, since it is more explicit and thus more amenable to constraints than NeRFs. We leverage segmentation annotations to align the semantic regions of the face, facilitating the reconstruction of a neutral pose from only 11 images (as opposed to requiring a long video). We soft constrain the Gaussians to an underlying triangulated surface in order to provide a more structured Gaussian Splat reconstruction, which in turn informs subsequent perturbations to increase the accuracy of the underlying triangulated surface. The resulting triangulated surface can then be used in a standard graphics pipeline. In addition, and perhaps most impactful, we show how accurate geometry enables the Gaussian Splats to be transformed into texture space where they can be treated as a view-dependent neural texture. This allows one to use high visual fidelity Gaussian Splatting on any asset in a scene without the need to modify any other asset or any other aspect (geometry, lighting, renderer, etc.) of the graphics pipeline. We utilize a relightable Gaussian model to disentangle texture from lighting in order to obtain a delit high-resolution albedo texture that is also readily usable in a standard graphics pipeline. The flexibility of our system allows for training with disparate images, even with incompatible lighting, facilitating robust regularization. Finally, we demonstrate the efficacy of our approach by illustrating its use in a text-driven asset creation pipeline.

View on arXiv PDF

Similar