CL CVJun 1, 2018

Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7

Huda Alamri, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Jue Wang, Irfan Essa, Dhruv Batra, Devi Parikh, Anoop Cherian, Tim K. Marks, Chiori Hori

arXiv:1806.00525v13.037 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This addresses the problem of enabling AI systems to have conversations about real-world scenes for researchers and developers, but it is incremental as it builds on existing technologies.

The paper introduces the Audio Visual Scene-Aware Dialog (AVSD) challenge and dataset to advance scene-aware dialog systems by integrating technologies from end-to-end dialog, visual dialog, and video description, with the task of generating responses in dialogs about input videos.

Scene-aware dialog systems will be able to have conversations with users about the objects and events around them. Progress on such systems can be made by integrating state-of-the-art technologies from multiple research areas including end-to-end dialog systems visual dialog, and video description. We introduce the Audio Visual Scene Aware Dialog (AVSD) challenge and dataset. In this challenge, which is one track of the 7th Dialog System Technology Challenges (DSTC7) workshop1, the task is to build a system that generates responses in a dialog about an input video

View on arXiv PDF Code

Similar