ASAISPMar 21, 2024

Crowdsourced Multilingual Speech Intelligibility Testing

arXiv:2403.14817v18 citationsh-index: 2ICASSP
Originality Synthesis-oriented
AI Analysis

This work addresses the need for scalable and multilingual intelligibility testing in audio generation, but it is incremental as it builds on existing crowdsourcing concepts without a major breakthrough.

The paper tackles the problem of evaluating speech intelligibility for generative audio features by proposing a crowdsourced assessment approach, resulting in the public release of multilingual speech data and early experimental results.

With the advent of generative audio features, there is an increasing need for rapid evaluation of their impact on speech intelligibility. Beyond the existing laboratory measures, which are expensive and do not scale well, there has been comparatively little work on crowdsourced assessment of intelligibility. Standards and recommendations are yet to be defined, and publicly available multilingual test materials are lacking. In response to this challenge, we propose an approach for a crowdsourced intelligibility assessment. We detail the test design, the collection and public release of the multilingual speech data, and the results of our early experiments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes