CL LGNov 5, 2021

Grounded Graph Decoding Improves Compositional Generalization in Question Answering

Yu Gai, Paras Jain, Wendi Zhang, Joseph E. Gonzalez, Dawn Song, Ion Stoica

arXiv:2111.03642v130.8663 citationsh-index: 134Has Code

Originality Incremental advance

AI Analysis

This addresses a key limitation in QA models for handling complex inputs, though it is incremental as it builds on prior work on permutation invariant models.

The paper tackles the problem of compositional generalization in question answering, where models struggle with novel combinations of training patterns, and proposes Grounded Graph Decoding to improve this by grounding structured predictions with attention, achieving 98% accuracy on the MCD1 split of the CFQ dataset.

Question answering models struggle to generalize to novel compositions of training patterns, such to longer sequences or more complex test structures. Current end-to-end models learn a flat input embedding which can lose input syntax context. Prior approaches improve generalization by learning permutation invariant models, but these methods do not scale to more complex train-test splits. We propose Grounded Graph Decoding, a method to improve compositional generalization of language representations by grounding structured predictions with an attention mechanism. Grounding enables the model to retain syntax information from the input in thereby significantly improving generalization over complex inputs. By predicting a structured graph containing conjunctions of query clauses, we learn a group invariant representation without making assumptions on the target domain. Our model significantly outperforms state-of-the-art baselines on the Compositional Freebase Questions (CFQ) dataset, a challenging benchmark for compositional generalization in question answering. Moreover, we effectively solve the MCD1 split with 98% accuracy.

View on arXiv PDF Code

Similar