CLMar 17, 2022

An Imitation Learning Curriculum for Text Editing with Non-Autoregressive Models

arXiv:2203.09486v1641 citationsh-index: 36
Originality Incremental advance
AI Analysis

This addresses a specific challenge in text editing for NLP applications, offering incremental improvements in model training efficiency and output control.

The paper tackled the problem of training non-autoregressive sequence-to-sequence models for text editing tasks, where existing imitation learning methods cause mismatches leading to poor generalization, and proposed a framework with roll-in policies and a curriculum that significantly improved output quality on text simplification and abstractive summarization.

We propose a framework for training non-autoregressive sequence-to-sequence models for editing tasks, where the original input sequence is iteratively edited to produce the output. We show that the imitation learning algorithms designed to train such models for machine translation introduces mismatches between training and inference that lead to undertraining and poor generalization in editing scenarios. We address this issue with two complementary strategies: 1) a roll-in policy that exposes the model to intermediate training sequences that it is more likely to encounter during inference, 2) a curriculum that presents easy-to-learn edit operations first, gradually increasing the difficulty of training samples as the model becomes competent. We show the efficacy of these strategies on two challenging English editing tasks: controllable text simplification and abstractive summarization. Our approach significantly improves output quality on both tasks and controls output complexity better on the simplification task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes