Phase-aware Harmonic/Percussive Source Separation via Convex Optimization
This is an incremental improvement for audio processing applications, enhancing pre-processing tools by addressing phase information in separation.
The paper tackled the problem of harmonic/percussive source separation in audio by proposing a phase-aware method based on convex optimization, which improved separation quality by simultaneously handling amplitude and phase, with numerical experiments validating its effectiveness.
Decomposition of an audio mixture into harmonic and percussive components, namely harmonic/percussive source separation (HPSS), is a useful pre-processing tool for many audio applications. Popular approaches to HPSS exploit the distinctive source-specific structures of power spectrograms. However, such approaches consider only power spectrograms, and the phase remains intact for resynthesizing the separated signals. In this paper, we propose a phase-aware HPSS method based on the structure of the phase of harmonic components. It is formulated as a convex optimization problem in the time domain, which enables the simultaneous treatment of both amplitude and phase. The numerical experiment validates the effectiveness of the proposed method.