BIRD: Big Impulse Response Dataset
This provides a resource for researchers in audio processing, though it is incremental as it extends existing simulation methods to create a larger dataset.
The paper tackles the lack of large open datasets for multichannel room impulse responses by introducing BIRD, a dataset of 100,000 simulated RIRs, which enables efficient online data augmentation for multi-microphone and multi-source audio scenarios.
This paper introduces BIRD, the Big Impulse Response Dataset. This open dataset consists of 100,000 multichannel room impulse responses (RIRs) generated from simulations using the Image Method, making it the largest multichannel open dataset currently available. These RIRs can be used toperform efficient online data augmentation for scenarios that involve two microphones and multiple sound sources. The paper also introduces use cases to illustrate how BIRD can perform data augmentation with existing speech corpora.