CVNov 22, 2023

Two-stage Synthetic Supervising and Multi-view Consistency Self-supervising based Animal 3D Reconstruction by Single Image

arXiv:2311.13199v3h-index: 3Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of 3D animal reconstruction for researchers and applications in fields like biology and animation, but it is incremental as it adapts existing human-based methods to animals.

The paper tackles the challenge of 3D reconstruction of live animals from single images by combining synthetic supervised and multi-view consistency self-supervised training, achieving state-of-the-art performance in bird 3D digitization with improvements in quantitative and qualitative metrics.

Pixel-aligned Implicit Function (PIFu) effectively captures subtle variations in body shape within a low-dimensional space through extensive training with human 3D scans, its application to live animals presents formidable challenges due to the difficulty of obtaining animal cooperation for 3D scanning. To address this challenge, we propose the combination of two-stage supervised and self-supervised training to address the challenge of obtaining animal cooperation for 3D scanning. In the first stage, we leverage synthetic animal models for supervised learning. This allows the model to learn from a diverse set of virtual animal instances. In the second stage, we use 2D multi-view consistency as a self-supervised training method. This further enhances the model's ability to reconstruct accurate and realistic 3D shape and texture from largely available single-view images of real animals. The results of our study demonstrate that our approach outperforms state-of-the-art methods in both quantitative and qualitative aspects of bird 3D digitization. The source code is available at https://github.com/kuangzijian/drifu-for-animals.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes