Machine Learning-based Classification of Birds through Birdsong
This work addresses bird species identification for ecological monitoring, but it is incremental as it applies existing methods to new data.
The paper tackled the problem of identifying birds from their songs using machine learning, achieving 91% accuracy for a top-5 classification of 30 Australian birds and 58% accuracy for a more challenging dataset of 152 species.
Audio sound recognition and classification is used for many tasks and applications including human voice recognition, music recognition and audio tagging. In this paper we apply Mel Frequency Cepstral Coefficients (MFCC) in combination with a range of machine learning models to identify (Australian) birds from publicly available audio files of their birdsong. We present approaches used for data processing and augmentation and compare the results of various state of the art machine learning models. We achieve an overall accuracy of 91% for the top-5 birds from the 30 selected as the case study. Applying the models to more challenging and diverse audio files comprising 152 bird species, we achieve an accuracy of 58%