An open dataset for research on audio field recording archives: freefield1010
This provides a standardized dataset for researchers working on audio archive mining, though it is incremental as it repurposes existing data.
The authors introduced a free and open dataset of 7690 audio clips from field recordings to facilitate research in data mining for audio archives, and demonstrated its utility through an auto-tagging experiment.
We introduce a free and open dataset of 7690 audio clips sampled from the field-recording tag in the Freesound audio archive. The dataset is designed for use in research related to data mining in audio archives of field recordings / soundscapes. Audio is standardised, and audio and metadata are Creative Commons licensed. We describe the data preparation process, characterise the dataset descriptively, and illustrate its use through an auto-tagging experiment.