AVIN-Chat: An Audio-Visual Interactive Chatbot System with Emotional State Tuning
This work addresses the need for more engaging human-chatbot interactions, particularly for applications requiring emotional connection, though it appears incremental by adding audio-visual and emotional features to existing chatbot frameworks.
The paper tackles the problem of limited interaction in chatbots by developing AVIN-Chat, an audio-visual system with 3D avatars that respond to user emotions in real-time, resulting in higher user immersion compared to previous systems as shown in subjective tests.
This work presents an audio-visual interactive chatbot (AVIN-Chat) system that allows users to have face-to-face conversations with 3D avatars in real-time. Compared to the previous chatbot services, which provide text-only or speech-only communications, the proposed AVIN-Chat can offer audio-visual communications providing users with a superior experience quality. In addition, the proposed AVIN-Chat emotionally speaks and expresses according to the user's emotional state. Thus, it enables users to establish a strong bond with the chatbot system, increasing the user's immersion. Through user subjective tests, it is demonstrated that the proposed system provides users with a higher sense of immersion than previous chatbot systems. The demonstration video is available at https://www.youtube.com/watch?v=Z74uIV9k7_k.