SDCVMay 20

A strongly annotated passive acoustic dataset for tropical bird monitoring

arXiv:2605.2057832.7
Predicted impact top 73% in SD · last 90 daysOriginality Incremental advance
AI Analysis

This dataset addresses the scarcity of time-resolved annotated datasets for tropical bird monitoring, enabling supervised machine learning in complex soundscapes.

The authors present PteroSet, a strongly annotated dataset of Neotropical bird vocalizations with 15,372 annotations across 168 species, and provide a deep learning baseline for binary bird detection, highlighting challenges like acoustic co-occurrence and domain shift.

Passive acoustic monitoring enables continuous, non-invasive biodiversity assessment across diverse ecosystems. The scale of these datasets has driven the adoption of machine learning, with supervised approaches showing strong performance. However, supervised methods require time-resolved annotated datasets, which remain scarce, especially in complex tropical soundscapes. We present PteroSet, a curated dataset of strongly annotated Neotropical bird vocalizations recorded in Puerto Asis (Putumayo) and Pivijay (Magdalena), Colombia, between 2023 and 2025. The dataset comprises 563 recordings (73.62 h) and 15,372 time-frequency annotations, including 6,702 events identified to the species level across 168 species. We release the annotations in a COCO-inspired JSON schema that unifies audio files, taxonomic categories, and labels for machine learning workflows. Beyond providing annotated data, PteroSet serves as a realistic benchmark that highlights key characteristics of tropical soundscapes, including acoustic co-occurrence and domain shift across recording sites. We provide a deep learning baseline for binary bird detection, demonstrating PteroSet's usability and the challenges it presents.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes