ROOct 13, 2020

Labeling the Phrases of a Conversational Agent with a Unique Personalized Vocabulary

arXiv:2010.06194v22 citations
Originality Incremental advance
AI Analysis

This study addresses the challenge of gesture generation for conversational agents with unique vocabularies, but it is incremental as it builds on existing concept-based methods.

This paper tackles the problem of mapping spoken text to gestures for conversational agents by using a concept-based approach, comparing automatic NLP-derived concepts with manual sociological ones for the agent Rinna and identifying limitations in handling personalized vocabulary, with an experiment showing that concept-based gestures leave a better impression than random ones.

Mapping spoken text to gestures is an important research topic for robots with conversation capabilities. According to studies on human co-speech gestures, a reasonable solution for mapping is using a concept-based approach in which a text is first mapped to a semantic cluster (i.e., a concept) containing texts with similar meanings. Subsequently, each concept is mapped to a predefined gesture. By using a concept-based approach, this paper discusses the practical issue of obtaining concepts for a unique vocabulary personalized for a conversational agent. Using Microsoft Rinna as an agent, we qualitatively compare concepts obtained automatically through a natural language processing (NLP) approach to those obtained manually through a sociological approach. We then identify three limitations of the NLP approach: at the semantic level with emojis and symbols; at the semantic level with slang, new words, and buzzwords; and at the pragmatic level. We attribute these limitations to the personalized vocabulary of Rinna. A follow-up experiment demonstrates that robot gestures selected using a concept-based approach leave a better impression than randomly selected gestures for the Rinna vocabulary, suggesting the usefulness of a concept-based gesture generation system for personalized vocabularies. This study provides insights into the development of gesture generation systems for conversational agents with personalized vocabularies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes