CLOct 15, 2020

Pretrained Language Models for Dialogue Generation with Multiple Input Sources

arXiv:2010.07576v11003 citations
Originality Incremental advance
AI Analysis

This work addresses a specific challenge in dialogue systems for AI applications, but it is incremental as it builds on existing models like GPT2.

The paper tackled the problem of applying pretrained language models to dialogue generation with multiple input sources, finding that proper fusion methods improved relevance to dialogue history compared to simple baselines.

Large-scale pretrained language models have achieved outstanding performance on natural language understanding tasks. However, it is still under investigating how to apply them to dialogue generation tasks, especially those with responses conditioned on multiple sources. Previous work simply concatenates all input sources or averages information from different input sources. In this work, we study dialogue models with multiple input sources adapted from the pretrained language model GPT2. We explore various methods to fuse multiple separate attention information corresponding to different sources. Our experimental results show that proper fusion methods deliver higher relevance with dialogue history than simple fusion baselines.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes