CVNov 15, 2023

WildlifeDatasets: An open-source toolkit for animal re-identification

arXiv:2311.09118v258 citationsh-index: 15Has Code
Originality Incremental advance
AI Analysis

This toolkit addresses the problem of animal re-identification for ecologists and computer vision researchers, providing an incremental improvement through comprehensive benchmarking and a new model.

The authors introduced WildlifeDatasets, an open-source toolkit for animal re-identification, and developed MegaDescriptor, a foundation model that achieves state-of-the-art performance, significantly outperforming models like CLIP and DINOv2.

In this paper, we present WildlifeDatasets (https://github.com/WildlifeDatasets/wildlife-datasets) - an open-source toolkit intended primarily for ecologists and computer-vision / machine-learning researchers. The WildlifeDatasets is written in Python, allows straightforward access to publicly available wildlife datasets, and provides a wide variety of methods for dataset pre-processing, performance analysis, and model fine-tuning. We showcase the toolkit in various scenarios and baseline experiments, including, to the best of our knowledge, the most comprehensive experimental comparison of datasets and methods for wildlife re-identification, including both local descriptors and deep learning approaches. Furthermore, we provide the first-ever foundation model for individual re-identification within a wide range of species - MegaDescriptor - that provides state-of-the-art performance on animal re-identification datasets and outperforms other pre-trained models such as CLIP and DINOv2 by a significant margin. To make the model available to the general public and to allow easy integration with any existing wildlife monitoring applications, we provide multiple MegaDescriptor flavors (i.e., Small, Medium, and Large) through the HuggingFace hub (https://huggingface.co/BVRA).

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes