Challenges and Opportunities in Multi-device Speech Processing
This work addresses the problem of improving speech processing in multi-device environments for researchers and practitioners, but it is incremental as it primarily reviews existing knowledge.
The paper reviews current solutions and challenges in multi-device speech processing, such as automatic speech recognition and device arbitration, and identifies needed datasets to support future research in this domain.
We review current solutions and technical challenges for automatic speech recognition, keyword spotting, device arbitration, speech enhancement, and source localization in multidevice home environments to provide context for the INTERSPEECH 2022 special session, "Challenges and opportunities for signal processing and machine learning for multiple smart devices". We also identify the datasets needed to support these research areas. Based on the review and our research experience in the multi-device domain, we conclude with an outlook on the future evolution