CR SD ASApr 13, 2019

Towards Vulnerability Analysis of Voice-Driven Interfaces and Countermeasures for Replay

Khalid Mahmood Malik, Hafiz Malik, Roland Baumann

arXiv:1904.06591v16.836 citations

Originality Incremental advance

AI Analysis

This addresses security risks for users of smart speakers and chatbots, though it is incremental as it builds on existing replay attack detection methods.

The paper tackles the vulnerability of voice-driven interfaces to replay attacks by proposing a non-learning-based detection method that models replay distortion as higher-order nonlinearity, achieving successful detection on platforms like Google Home and Amazon Alexa.

Fake audio detection is expected to become an important research area in the field of smart speakers such as Google Home, Amazon Echo and chatbots developed for these platforms. This paper presents replay attack vulnerability of voice-driven interfaces and proposes a countermeasure to detect replay attack on these platforms. This paper presents a novel framework to model replay attack distortion, and then use a non-learning-based method for replay attack detection on smart speakers. The reply attack distortion is modeled as a higher-order nonlinearity in the replay attack audio. Higher-order spectral analysis (HOSA) is used to capture characteristics distortions in the replay audio. Effectiveness of the proposed countermeasure scheme is evaluated on original speech as well as corresponding replayed recordings. The replay attack recordings are successfully injected into the Google Home device via Amazon Alexa using the drop-in conferencing feature.

View on arXiv PDF

Similar