CVROMar 9, 2024

SPAFormer: Sequential 3D Part Assembly with Transformers

arXiv:2403.05874v39 citationsh-index: 7Has Code3DV
Originality Highly original
AI Analysis

This addresses the problem of efficient and accurate 3D part assembly for robotics and manufacturing, representing an incremental improvement with a novel method for a known bottleneck.

The paper tackles the combinatorial explosion challenge in 3D Part Assembly by introducing SPAFormer, which leverages weak constraints from assembly sequences to reduce solution space complexity, achieving superior generalization in multi-tasking and long-horizon assembly scenarios.

We introduce SPAFormer, an innovative model designed to overcome the combinatorial explosion challenge in the 3D Part Assembly (3D-PA) task. This task requires accurate prediction of each part's poses in sequential steps. As the number of parts increases, the possible assembly combinations increase exponentially, leading to a combinatorial explosion that severely hinders the efficacy of 3D-PA. SPAFormer addresses this problem by leveraging weak constraints from assembly sequences, effectively reducing the solution space's complexity. Since the sequence of parts conveys construction rules similar to sentences structured through words, our model explores both parallel and autoregressive generation. We further strengthen SPAFormer through knowledge enhancement strategies that utilize the attributes of parts and their sequence information, enabling it to capture the inherent assembly pattern and relationships among sequentially ordered parts. We also construct a more challenging benchmark named PartNet-Assembly covering 21 varied categories to more comprehensively validate the effectiveness of SPAFormer. Extensive experiments demonstrate the superior generalization capabilities of SPAFormer, particularly with multi-tasking and in scenarios requiring long-horizon assembly. Code is available at https://github.com/xuboshen/SPAFormer.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes