CVMar 22, 2024

DragAPart: Learning a Part-Level Motion Prior for Articulated Objects

Oxford
arXiv:2403.15382v233 citationsh-index: 18ECCV
Originality Incremental advance
AI Analysis

This addresses the challenge of realistic part-level motion synthesis in images for applications in robotics and graphics, though it builds incrementally on pre-trained generators.

The paper tackles the problem of generating images of articulated objects responding to part-level drag interactions, such as opening drawers, by introducing DragAPart, which learns a generalist motion model and shows improved part-level motion understanding compared to prior methods.

We introduce DragAPart, a method that, given an image and a set of drags as input, generates a new image of the same object that responds to the action of the drags. Differently from prior works that focused on repositioning objects, DragAPart predicts part-level interactions, such as opening and closing a drawer. We study this problem as a proxy for learning a generalist motion model, not restricted to a specific kinematic structure or object category. We start from a pre-trained image generator and fine-tune it on a new synthetic dataset, Drag-a-Move, which we introduce. Combined with a new encoding for the drags and dataset randomization, the model generalizes well to real images and different categories. Compared to prior motion-controlled generators, we demonstrate much better part-level motion understanding.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes