CL AIOct 7, 2019

Commonsense Knowledge Base Completion with Structural and Semantic Context

Chaitanya Malaviya, Chandra Bhagavatula, Antoine Bosselut, Yejin Choi

arXiv:1910.02915v28.8155 citations

Originality Incremental advance

AI Analysis

This work addresses the problem of automatic knowledge base completion for sparser, free-text commonsense graphs, which is incremental by building on existing methods with novel adaptations.

The paper tackles the challenge of completing commonsense knowledge graphs, which are sparser and have more nodes than conventional KBs, by proposing a model that combines graph structure learning with pre-trained language model representations. The result shows improved link prediction performance, including a +1.5 MRR gain on ConceptNet, and provides insights into the types of commonsense knowledge captured by language models.

Automatic KB completion for commonsense knowledge graphs (e.g., ATOMIC and ConceptNet) poses unique challenges compared to the much studied conventional knowledge bases (e.g., Freebase). Commonsense knowledge graphs use free-form text to represent nodes, resulting in orders of magnitude more nodes compared to conventional KBs (18x more nodes in ATOMIC compared to Freebase (FB15K-237)). Importantly, this implies significantly sparser graph structures - a major challenge for existing KB completion methods that assume densely connected graphs over a relatively smaller set of nodes. In this paper, we present novel KB completion models that can address these challenges by exploiting the structural and semantic context of nodes. Specifically, we investigate two key ideas: (1) learning from local graph structure, using graph convolutional networks and automatic graph densification and (2) transfer learning from pre-trained language models to knowledge graphs for enhanced contextual representation of knowledge. We describe our method to incorporate information from both these sources in a joint model and provide the first empirical results for KB completion on ATOMIC and evaluation with ranking metrics on ConceptNet. Our results demonstrate the effectiveness of language model representations in boosting link prediction performance and the advantages of learning from local graph structure (+1.5 points in MRR for ConceptNet) when training on subgraphs for computational efficiency. Further analysis on model predictions shines light on the types of commonsense knowledge that language models capture well.

View on arXiv PDF

Similar