CLLGMLOct 9, 2020

Multichannel Generative Language Model: Learning All Possible Factorizations Within and Across Channels

arXiv:2010.04438v1993 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of flexible multilingual generation and inference for natural language processing, though it appears incremental as it builds on existing generative and multilingual modeling concepts.

The paper tackles the problem of modeling multiple linguistic channels (e.g., languages) as transformations of underlying meaning by introducing the Multichannel Generative Language Model (MGLM), which learns a joint distribution over channels and marginalizes over all factorizations, and it shows that MGLM outperforms traditional bilingual discriminative models on the Multi30K dataset in quality-diversity trade-offs.

A channel corresponds to a viewpoint or transformation of an underlying meaning. A pair of parallel sentences in English and French express the same underlying meaning, but through two separate channels corresponding to their languages. In this work, we present the Multichannel Generative Language Model (MGLM). MGLM is a generative joint distribution model over channels. MGLM marginalizes over all possible factorizations within and across all channels. MGLM endows flexible inference, including unconditional generation, conditional generation (where 1 channel is observed and other channels are generated), and partially observed generation (where incomplete observations are spread across all the channels). We experiment with the Multi30K dataset containing English, French, Czech, and German. We demonstrate experiments with unconditional, conditional, and partially conditional generation. We provide qualitative samples sampled unconditionally from the generative joint distribution. We also quantitatively analyze the quality-diversity trade-offs and find MGLM outperforms traditional bilingual discriminative models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes