CLAIDec 19, 2022

Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments

Microsoft
arXiv:2212.09736v2248 citationsh-index: 42
Originality Highly original
AI Analysis

This addresses the challenge of ensuring grammaticality, faithfulness, and controllability in grounded language understanding for applications like knowledge base question answering, representing a novel approach rather than an incremental improvement.

The paper tackles the problem of grounding language models to real-world environments by proposing Pangu, a framework that uses LMs discriminatively to evaluate plans instead of generating them, achieving state-of-the-art results on knowledge base question answering datasets with a BERT-base model and enabling few-shot learning with larger LMs.

A key missing capacity of current language models (LMs) is grounding to real-world environments. Most existing work for grounded language understanding uses LMs to directly generate plans that can be executed in the environment to achieve the desired effects. It thereby casts the burden of ensuring grammaticality, faithfulness, and controllability all on the LMs. We propose Pangu, a generic framework for grounded language understanding that capitalizes on the discriminative ability of LMs instead of their generative ability. Pangu consists of a symbolic agent and a neural LM working in a concerted fashion: The agent explores the environment to incrementally construct valid plans, and the LM evaluates the plausibility of the candidate plans to guide the search process. A case study on the challenging problem of knowledge base question answering (KBQA), which features a massive environment, demonstrates the remarkable effectiveness and flexibility of Pangu: A BERT-base LM is sufficient for setting a new record on standard KBQA datasets, and larger LMs further bring substantial gains. Pangu also enables, for the first time, effective few-shot in-context learning for KBQA with large LMs such as Codex.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes