CVAug 28, 2021

AP-10K: A Benchmark for Animal Pose Estimation in the Wild

Hang Yu, Yufei Xu, Jing Zhang, Wei Zhao, Ziyu Guan, Dacheng Tao

arXiv:2108.12617v226.3193 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This addresses the need for better tools in wildlife conservation and behavior analysis by providing a foundational dataset, though it is incremental as it builds on existing pose estimation methods.

The paper tackles the problem of limited generalization in animal pose estimation by introducing AP-10K, a large-scale benchmark with 10,015 images across 54 mammal species, and shows that learning from diverse species improves accuracy and generalization.

Accurate animal pose estimation is an essential step towards understanding animal behavior, and can potentially benefit many downstream applications, such as wildlife conservation. Previous works only focus on specific animals while ignoring the diversity of animal species, limiting the generalization ability. In this paper, we propose AP-10K, the first large-scale benchmark for mammal animal pose estimation, to facilitate the research in animal pose estimation. AP-10K consists of 10,015 images collected and filtered from 23 animal families and 54 species following the taxonomic rank and high-quality keypoint annotations labeled and checked manually. Based on AP-10K, we benchmark representative pose estimation models on the following three tracks: (1) supervised learning for animal pose estimation, (2) cross-domain transfer learning from human pose estimation to animal pose estimation, and (3) intra- and inter-family domain generalization for unseen animals. The experimental results provide sound empirical evidence on the superiority of learning from diverse animals species in terms of both accuracy and generalization ability. It opens new directions for facilitating future research in animal pose estimation. AP-10k is publicly available at https://github.com/AlexTheBad/AP10K.

View on arXiv PDF Code

Similar