CVGRLGNov 25, 2020

PLAD: Learning to Infer Shape Programs with Pseudo-Labels and Approximate Distributions

arXiv:2011.13045v430 citations
AI Analysis

This work is significant for researchers and practitioners in computer graphics and reverse engineering who need to infer shape programs from visual data, offering a faster and more accurate method.

This paper addresses the challenge of inferring programs that generate 2D and 3D shapes, where paired (shape, program) data is scarce. The authors propose PLAD, a self-training variant that pairs program pseudo-labels with their executed output shapes, avoiding label mismatch. PLAD techniques infer more accurate shape programs and converge significantly faster compared to policy gradient reinforcement learning.

Inferring programs which generate 2D and 3D shapes is important for reverse engineering, editing, and more. Training models to perform this task is complicated because paired (shape, program) data is not readily available for many domains, making exact supervised learning infeasible. However, it is possible to get paired data by compromising the accuracy of either the assigned program labels or the shape distribution. Wake-sleep methods use samples from a generative model of shape programs to approximate the distribution of real shapes. In self-training, shapes are passed through a recognition model, which predicts programs that are treated as "pseudo-labels" for those shapes. Related to these approaches, we introduce a novel self-training variant unique to program inference, where program pseudo-labels are paired with their executed output shapes, avoiding label mismatch at the cost of an approximate shape distribution. We propose to group these regimes under a single conceptual framework, where training is performed with maximum likelihood updates sourced from either Pseudo-Labels or an Approximate Distribution (PLAD). We evaluate these techniques on multiple 2D and 3D shape program inference domains. Compared with policy gradient reinforcement learning, we show that PLAD techniques infer more accurate shape programs and converge significantly faster. Finally, we propose to combine updates from different PLAD methods within the training of a single model, and find that this approach outperforms any individual technique.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes