Robotic Speech Synthesis: Perspectives on Interactions, Scenarios, and Ethics
This addresses the practical issue of voice design for human-robot interaction researchers, but it is incremental as it reviews existing findings without presenting new results.
The paper tackles the problem of synthesizing robotic voices for conversational robots in various scenarios, discussing challenges in non-verbal speech, scenario classification, and ethical considerations, based on literature review and prior work.
In recent years, many works have investigated the feasibility of conversational robots for performing specific tasks, such as healthcare and interview. Along with this development comes a practical issue: how should we synthesize robotic voices to meet the needs of different situations? In this paper, we discuss this issue from three perspectives: 1) the difficulties of synthesizing non-verbal and interaction-oriented speech signals, particularly backchannels; 2) the scenario classification for robotic voice synthesis; 3) the ethical issues regarding the design of robot voice for its emotion and identity. We present the findings of relevant literature and our prior work, trying to bring the attention of human-robot interaction researchers to design better conversational robots in the future.