LGMTRL-SCIBMMar 20, 2025

Unified Cross-Scale 3D Generation and Understanding via Autoregressive Modeling

Peking U
arXiv:2503.16278v36 citationsh-index: 12
Originality Highly original
AI Analysis

This addresses the problem of specialized, non-generalizable 3D models for researchers and practitioners in fields like fluid simulation, protein folding, and 3D reconstruction, offering a unified solution.

The paper tackles the fragmentation in 3D structure modeling by proposing Uni-3DAR, a unified autoregressive framework for cross-scale 3D generation and understanding, achieving up to 256% relative improvement over previous state-of-the-art diffusion models and inference speeds up to 21.8x faster.

3D structure modeling is essential across scales, enabling applications from fluid simulation and 3D reconstruction to protein folding and molecular docking. Yet, despite shared 3D spatial patterns, current approaches remain fragmented, with models narrowly specialized for specific domains and unable to generalize across tasks or scales. We propose Uni-3DAR, a unified autoregressive framework for cross-scale 3D generation and understanding. At its core is a coarse-to-fine tokenizer based on octree data structures, which compresses diverse 3D structures into compact 1D token sequences. We further propose a two-level subtree compression strategy, which reduces the octree token sequence by up to 8x. To address the challenge of dynamically varying token positions introduced by compression, we introduce a masked next-token prediction strategy that ensures accurate positional modeling, significantly boosting model performance. Extensive experiments across multiple 3D generation and understanding tasks, including small molecules, proteins, polymers, crystals, and macroscopic 3D objects, validate its effectiveness and versatility. Notably, Uni-3DAR surpasses previous state-of-the-art diffusion models by a substantial margin, achieving up to 256\% relative improvement while delivering inference speeds up to 21.8x faster.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes