LGPLJul 26, 2023

ExeDec: Execution Decomposition for Compositional Generalization in Neural Program Synthesis

arXiv:2307.13883v222 citationsh-index: 42
AI Analysis

This addresses the challenge of enabling neural models to generalize to complex tasks by decomposing them into simpler subtasks, which is incremental but important for improving program synthesis methods.

The paper tackles the problem of compositional generalization in neural program synthesis by proposing ExeDec, a decomposition-based synthesis strategy that predicts execution subgoals step-by-step, resulting in better synthesis performance and greatly improved compositional generalization compared to baselines on datasets like RobustFill and DeepCoder.

When writing programs, people have the ability to tackle a new complex task by decomposing it into smaller and more familiar subtasks. While it is difficult to measure whether neural program synthesis methods have similar capabilities, we can measure whether they compositionally generalize, that is, whether a model that has been trained on the simpler subtasks is subsequently able to solve more complex tasks. In this paper, we characterize several different forms of compositional generalization that are desirable in program synthesis, forming a meta-benchmark which we use to create generalization tasks for two popular datasets, RobustFill and DeepCoder. We then propose ExeDec, a novel decomposition-based synthesis strategy that predicts execution subgoals to solve problems step-by-step informed by program execution at each step. When used with Transformer models trained from scratch, ExeDec has better synthesis performance and greatly improved compositional generalization ability compared to baselines. Finally, we use our benchmarks to demonstrate that LLMs struggle to compositionally generalize when asked to do programming-by-example in a few-shot setting, but an ExeDec-style prompting approach can improve the generalization ability and overall performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes