LGQMMay 18, 2023

A benchmark for computational analysis of animal behavior, using animal-borne tags

arXiv:2305.10740v321 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This provides a standardized benchmark for researchers in ecology and conservation to evaluate and improve machine learning methods for animal behavior analysis, though it is incremental as it builds on existing data and techniques.

The authors tackled the lack of a common framework for comparing machine learning techniques in analyzing animal behavior from bio-logger data by introducing the Bio-logger Ethogram Benchmark (BEBE), the largest and most taxonomically diverse public benchmark of its type, and found that deep neural networks outperform classical methods across all nine datasets, with self-supervised learning showing particular strength in low-data settings.

Animal-borne sensors (`bio-loggers') can record a suite of kinematic and environmental data, which are used to elucidate animal ecophysiology and improve conservation efforts. Machine learning techniques are used for interpreting the large amounts of data recorded by bio-loggers, but there exists no common framework for comparing the different machine learning techniques in this domain. This makes it difficult to, for example, identify patterns in what works well for machine learning-based analysis of bio-logger data. It also makes it difficult to evaluate the effectiveness of novel methods developed by the machine learning community. To address this, we present the Bio-logger Ethogram Benchmark (BEBE), a collection of datasets with behavioral annotations, as well as a modeling task and evaluation metrics. BEBE is to date the largest, most taxonomically diverse, publicly available benchmark of this type. Using BEBE, we compare the performance of deep and classical machine learning methods for identifying animal behaviors based on bio-logger data. As an example usage of BEBE, we test an approach based on self-supervised learning. To apply this approach to animal behavior classification, we adapt a deep neural network pre-trained with 700,000 hours of data collected from human wrist-worn accelerometers. We find that deep neural networks out-perform the classical machine learning methods we tested across all nine datasets in BEBE. We additionally find that the approach based on self-supervised learning out-performs the alternatives we tested, especially in settings when there is a low amount of training data available. In light of this, we are able to make concrete suggestions for designing studies that rely on machine learning to infer behavior from bio-logger data. Datasets and code are available at https://github.com/earthspecies/BEBE.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes