CVGRNov 4, 2022

CASA: Category-agnostic Skeletal Animal Reconstruction

arXiv:2211.03568v142 citationsh-index: 35
Originality Incremental advance
AI Analysis

This addresses the challenge of category-agnostic animal reconstruction for applications like animation, though it appears incremental by building on retrieval and inverse graphics techniques.

The paper tackles the problem of reconstructing skeletal shapes of animals from monocular videos by introducing CASA, a method that retrieves an articulated shape from a 3D character bank and optimizes shape deformation, skeleton structure, and skinning weights, achieving validated efficacy in shape reconstruction and articulation.

Recovering the skeletal shape of an animal from a monocular video is a longstanding challenge. Prevailing animal reconstruction methods often adopt a control-point driven animation model and optimize bone transforms individually without considering skeletal topology, yielding unsatisfactory shape and articulation. In contrast, humans can easily infer the articulation structure of an unknown animal by associating it with a seen articulated character in their memory. Inspired by this fact, we present CASA, a novel Category-Agnostic Skeletal Animal reconstruction method consisting of two major components: a video-to-shape retrieval process and a neural inverse graphics framework. During inference, CASA first retrieves an articulated shape from a 3D character assets bank so that the input video scores highly with the rendered image, according to a pretrained language-vision model. CASA then integrates the retrieved character into an inverse graphics framework and jointly infers the shape deformation, skeleton structure, and skinning weights through optimization. Experiments validate the efficacy of CASA regarding shape reconstruction and articulation. We further demonstrate that the resulting skeletal-animated characters can be used for re-animation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes