SDMay 15, 2017

Texture features for the reproduction of the perceptual organization of sound

arXiv:1705.05271v12 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses sound categorization for applications in audio processing and perception, but it is incremental as it builds on existing filterbank methods.

The paper tackled the problem of reproducing human perceptual organization of sound by proposing a novel analysis method that separates sound into tonal, pulsal, and noisy textures, and found that energy-based tonality and pulsality strongly correlate with the first perceptual dimension in human subjects.

Human categorization of sound seems predominantly based on sound source properties. To estimate these source properties we propose a novel sound analysis method, which separates sound into different sonic textures: tones, pulses, and broadband noises. The audible presence of tones or pulses corresponds to more extended cochleagram patterns than can be expected on the basis of correlations introduced by the gammachirp filterbank alone. We design tract features to respond to these extended patterns, and use these to identify areas of the time-frequency plane as tonal, pulsal, and noisy. Where an area is marked as noisy if it is neither tonal nor pulsal. To investigate whether a similar separation indeed underlies human perceptual organization we introduce tract based descriptors: tonality, pulsality, and noisiness. These descriptors keep track of either the total energy or the cochleagram area marked as respectively tonal, pulsal, and noisy. Energy based tonality and pulsality is strongly correlated with the first perceptual dimension of human subjects, while energy based noisiness correlates moderately with the second perceptual dimension. We conclude that harmonic, impact and continuous process sounds can be largely separated with energy based tonality, pulsality and noisiness.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes