CLAILGMAOct 11, 2021

Calibrate your listeners! Robust communication-based training for pragmatic speakers

arXiv:2110.05422v1663 citations
Originality Highly original
AI Analysis

This addresses the issue of training NLP systems to produce contextually useful utterances without diverging from natural language, which is incremental as it builds on prior communication-based training methods.

The paper tackles the problem of semantic drift in NLP systems trained with communication-based objectives by proposing a method that uses a population of neural listeners to regularize speaker training, showing that the ensemble-based approach enables pragmatic utterance generation while scaling to large vocabularies and generalizing to new scenarios.

To be good conversational partners, natural language processing (NLP) systems should be trained to produce contextually useful utterances. Prior work has investigated training NLP systems with communication-based objectives, where a neural listener stands in as a communication partner. However, these systems commonly suffer from semantic drift where the learned language diverges radically from natural language. We propose a method that uses a population of neural listeners to regularize speaker training. We first show that language drift originates from the poor uncertainty calibration of a neural listener, which makes high-certainty predictions on novel sentences. We explore ensemble- and dropout-based populations of listeners and find that the former results in better uncertainty quantification. We evaluate both population-based objectives on reference games, and show that the ensemble method with better calibration enables the speaker to generate pragmatic utterances while scaling to a large vocabulary and generalizing to new games and listeners.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes