CVGRDec 20, 2024

3D Shape Tokenization via Latent Flow Matching

arXiv:2412.15618v38 citationsh-index: 47
Originality Incremental advance
AI Analysis

This provides a more efficient 3D representation for machine learning applications, though it appears incremental as it builds on existing flow-matching techniques.

The paper tackles 3D shape representation by modeling surfaces as probability density functions using flow matching, resulting in a method that achieves competitive performance across tasks like 3D-CLIP and generative modeling while requiring minimal preprocessing.

We introduce a latent 3D representation that models 3D surfaces as probability density functions in 3D, i.e., p(x,y,z), with flow-matching. Our representation is specifically designed for consumption by machine learning models, offering continuity and compactness by construction while requiring only point clouds and minimal data preprocessing. Despite being a data-driven method, our use of flow matching in the 3D space enables interesting geometry properties, including the capabilities to perform zero-shot estimation of surface normal and deformation field. We evaluate with several machine learning tasks, including 3D-CLIP, unconditional generative models, single-image conditioned generative model, and intersection-point estimation. Across all experiments, our models achieve competitive performance to existing baselines, while requiring less preprocessing and auxiliary information from training data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes