CLOct 15, 2021

On The Ingredients of an Effective Zero-shot Semantic Parser

arXiv:2110.08381v1638 citations
Originality Incremental advance
AI Analysis

This work addresses the data scarcity issue in semantic parsing for NLP applications, offering an incremental improvement over existing zero-shot methods.

The paper tackles the problem of training semantic parsers without labeled data by analyzing and bridging language and logical gaps between synthetic and real user utterances, achieving strong zero-shot performance on two benchmarks.

Semantic parsers map natural language utterances into meaning representations (e.g., programs). Such models are typically bottlenecked by the paucity of training data due to the required laborious annotation efforts. Recent studies have performed zero-shot learning by synthesizing training examples of canonical utterances and programs from a grammar, and further paraphrasing these utterances to improve linguistic diversity. However, such synthetic examples cannot fully capture patterns in real data. In this paper we analyze zero-shot parsers through the lenses of the language and logical gaps (Herzig and Berant, 2019), which quantify the discrepancy of language and programmatic patterns between the canonical examples and real-world user-issued ones. We propose bridging these gaps using improved grammars, stronger paraphrasers, and efficient learning methods using canonical examples that most likely reflect real user intents. Our model achieves strong performance on two semantic parsing benchmarks (Scholar, Geo) with zero labeled data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes