LGMLJun 14, 2017

SEARNN: Training RNNs with Global-Local Losses

arXiv:1706.04499v351 citations
AI Analysis

This addresses the problem of suboptimal training for RNNs in structured prediction applications like machine translation, offering a novel method to reduce training-test discrepancies, though it appears incremental as it builds on existing learning-to-search approaches.

The paper tackles the problem of training recurrent neural networks (RNNs) for structured prediction tasks, where maximum likelihood estimation (MLE) is suboptimal due to discrepancies like exposure bias and poor surrogate loss alignment. The result is SEARNN, a training algorithm using global-local losses that improves performance over MLE on OCR, spelling correction, and machine translation tasks.

We propose SEARNN, a novel training algorithm for recurrent neural networks (RNNs) inspired by the "learning to search" (L2S) approach to structured prediction. RNNs have been widely successful in structured prediction applications such as machine translation or parsing, and are commonly trained using maximum likelihood estimation (MLE). Unfortunately, this training loss is not always an appropriate surrogate for the test error: by only maximizing the ground truth probability, it fails to exploit the wealth of information offered by structured losses. Further, it introduces discrepancies between training and predicting (such as exposure bias) that may hurt test performance. Instead, SEARNN leverages test-alike search space exploration to introduce global-local losses that are closer to the test error. We first demonstrate improved performance over MLE on two different tasks: OCR and spelling correction. Then, we propose a subsampling strategy to enable SEARNN to scale to large vocabulary sizes. This allows us to validate the benefits of our approach on a machine translation task.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes