CLNov 16, 2022

Towards Computationally Verifiable Semantic Grounding for Language Models

DeepMind
arXiv:2211.09070v1h-index: 36
Originality Incremental advance
AI Analysis

This work addresses the challenge of ensuring language models produce text that accurately reflects specified semantics, which is important for applications like data-to-text generation, though it is incremental as it builds on existing auto-encoder and parsing techniques.

The paper tackles the problem of semantic grounding for language models by conceptualizing them as conditional models generating text from entity-relationship triples, and it demonstrates that sampling multiple candidate sequences and training the LM with a frozen semantic parser significantly improve fluency and semantic accuracy over a greedy search baseline on the WebNLG 3.0 dataset.

The paper presents an approach to semantic grounding of language models (LMs) that conceptualizes the LM as a conditional model generating text given a desired semantic message formalized as a set of entity-relationship triples. It embeds the LM in an auto-encoder by feeding its output to a semantic parser whose output is in the same representation domain as the input message. Compared to a baseline that generates text using greedy search, we demonstrate two techniques that improve the fluency and semantic accuracy of the generated text: The first technique samples multiple candidate text sequences from which the semantic parser chooses. The second trains the language model while keeping the semantic parser frozen to improve the semantic accuracy of the auto-encoder. We carry out experiments on the English WebNLG 3.0 data set, using BLEU to measure the fluency of generated text and standard parsing metrics to measure semantic accuracy. We show that our proposed approaches significantly improve on the greedy search baseline. Human evaluation corroborates the results of the automatic evaluation experiments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes