ML LGDec 4, 2018

A Retrieve-and-Edit Framework for Predicting Structured Outputs

Tatsunori B. Hashimoto, Kelvin Guu, Yonatan Oren, Percy Liang

arXiv:1812.01194v127.7188 citations

Originality Incremental advance

AI Analysis

This addresses the challenge of generating complex outputs more efficiently for tasks like code generation, though it is incremental as it builds on existing sequence-to-sequence models.

The paper tackles the problem of generating complex structured outputs like source code by proposing a retrieve-and-edit framework that first retrieves a training example based on input and then edits it, which boosts performance on GitHub Python code autocomplete and Hearthstone cards benchmarks compared to a vanilla sequence-to-sequence model.

For the task of generating complex outputs such as source code, editing existing outputs can be easier than generating complex outputs from scratch. With this motivation, we propose an approach that first retrieves a training example based on the input (e.g., natural language description) and then edits it to the desired output (e.g., code). Our contribution is a computationally efficient method for learning a retrieval model that embeds the input in a task-dependent way without relying on a hand-crafted metric or incurring the expense of jointly training the retriever with the editor. Our retrieve-and-edit framework can be applied on top of any base model. We show that on a new autocomplete task for GitHub Python code and the Hearthstone cards benchmark, retrieve-and-edit significantly boosts the performance of a vanilla sequence-to-sequence model on both tasks.

View on arXiv PDF

Similar