CLLGNov 5, 2021

Grounded Graph Decoding Improves Compositional Generalization in Question Answering

arXiv:2111.03642v1663 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses a key limitation in QA models for handling complex inputs, though it is incremental as it builds on prior work on permutation invariant models.

The paper tackles the problem of compositional generalization in question answering, where models struggle with novel combinations of training patterns, and proposes Grounded Graph Decoding to improve this by grounding structured predictions with attention, achieving 98% accuracy on the MCD1 split of the CFQ dataset.

Question answering models struggle to generalize to novel compositions of training patterns, such to longer sequences or more complex test structures. Current end-to-end models learn a flat input embedding which can lose input syntax context. Prior approaches improve generalization by learning permutation invariant models, but these methods do not scale to more complex train-test splits. We propose Grounded Graph Decoding, a method to improve compositional generalization of language representations by grounding structured predictions with an attention mechanism. Grounding enables the model to retain syntax information from the input in thereby significantly improving generalization over complex inputs. By predicting a structured graph containing conjunctions of query clauses, we learn a group invariant representation without making assumptions on the target domain. Our model significantly outperforms state-of-the-art baselines on the Compositional Freebase Questions (CFQ) dataset, a challenging benchmark for compositional generalization in question answering. Moreover, we effectively solve the MCD1 split with 98% accuracy.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes