AICLLGMANov 10, 2016

Learning to Play Guess Who? and Inventing a Grounded Language as a Consequence

arXiv:1611.03218v469 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of developing grounded language for AI agents, though it is incremental as it builds on existing interactive frameworks.

The paper tackles the problem of learning grounded communication by having agents play Guess Who? using Deep Recurrent Q-Networks, resulting in agents that encode physical concepts in their words and engage in multi-step dialogues.

Acquiring your first language is an incredible feat and not easily duplicated. Learning to communicate using nothing but a few pictureless books, a corpus, would likely be impossible even for humans. Nevertheless, this is the dominating approach in most natural language processing today. As an alternative, we propose the use of situated interactions between agents as a driving force for communication, and the framework of Deep Recurrent Q-Networks for evolving a shared language grounded in the provided environment. We task the agents with interactive image search in the form of the game Guess Who?. The images from the game provide a non trivial environment for the agents to discuss and a natural grounding for the concepts they decide to encode in their communication. Our experiments show that the agents learn not only to encode physical concepts in their words, i.e. grounding, but also that the agents learn to hold a multi-step dialogue remembering the state of the dialogue from step to step.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes