A Semantic Timbre Dataset for the Electric Guitar
This addresses the problem of limited timbre data for researchers in audio synthesis and generative AI, though it is incremental as it focuses on a specific instrument.
The authors tackled the lack of annotated datasets for timbre in machine learning by creating the Semantic Timbre Dataset for electric guitar sounds, labeled with semantic descriptors, and validated it using a VAE that captured timbral structure and enabled interpolation.
Understanding and manipulating timbre is central to audio synthesis, yet this remains under-explored in machine learning due to a lack of annotated datasets linking perceptual timbre dimensions to semantic descriptors. We present the Semantic Timbre Dataset, a curated collection of monophonic electric guitar sounds, each labeled with one of 19 semantic timbre descriptors and corresponding magnitudes. These descriptors were derived from a qualitative analysis of physical and virtual guitar effect units and applied systematically to clean guitar tones. The dataset bridges perceptual timbre and machine learning representations, supporting learning for timbre control and semantic audio generation. We validate the dataset by training a variational autoencoder (VAE) on its latent space and evaluating it using human perceptual judgments and descriptor classifiers. Results show that the VAE captures timbral structure and enables smooth interpolation across descriptors. We release the dataset, code, and evaluation protocols to support timbre-aware generative AI research.