CLCVJun 1, 2018

Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7

arXiv:1806.00525v137 citations
Originality Synthesis-oriented
AI Analysis

This addresses the problem of enabling AI systems to have conversations about real-world scenes for researchers and developers, but it is incremental as it builds on existing technologies.

The paper introduces the Audio Visual Scene-Aware Dialog (AVSD) challenge and dataset to advance scene-aware dialog systems by integrating technologies from end-to-end dialog, visual dialog, and video description, with the task of generating responses in dialogs about input videos.

Scene-aware dialog systems will be able to have conversations with users about the objects and events around them. Progress on such systems can be made by integrating state-of-the-art technologies from multiple research areas including end-to-end dialog systems visual dialog, and video description. We introduce the Audio Visual Scene Aware Dialog (AVSD) challenge and dataset. In this challenge, which is one track of the 7th Dialog System Technology Challenges (DSTC7) workshop1, the task is to build a system that generates responses in a dialog about an input video

Code Implementations4 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes