CL AIAug 4, 2022

Integrating Knowledge Graph embedding and pretrained Language Models in Hypercomplex Spaces

Mojtaba Nayyeri, Zihao Wang, Mst. Mahfuja Akter, Mirza Mohtashim Alam, Md Rashad Al Hasan Rony, Jens Lehmann, Steffen Staab

arXiv:2208.02743v30.88 citationsh-index: 18Has Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of multimodal knowledge integration for researchers in knowledge representation and link prediction, offering an incremental improvement over existing methods.

The paper tackled the problem of integrating structural and textual knowledge from Knowledge Graphs by proposing hypercomplex algebra representations to unify four modalities, resulting in superior link prediction performance on standard benchmarks.

Knowledge Graphs, such as Wikidata, comprise structural and textual knowledge in order to represent knowledge. For each of the two modalities dedicated approaches for graph embedding and language models learn patterns that allow for predicting novel structural knowledge. Few approaches have integrated learning and inference with both modalities and these existing ones could only partially exploit the interaction of structural and textual knowledge. In our approach, we build on existing strong representations of single modalities and we use hypercomplex algebra to represent both, (i), single-modality embedding as well as, (ii), the interaction between different modalities and their complementary means of knowledge representation. More specifically, we suggest Dihedron and Quaternion representations of 4D hypercomplex numbers to integrate four modalities namely structural knowledge graph embedding, word-level representations (e.g.\ Word2vec, Fasttext), sentence-level representations (Sentence transformer), and document-level representations (sentence transformer, Doc2vec). Our unified vector representation scores the plausibility of labelled edges via Hamilton and Dihedron products, thus modeling pairwise interactions between different modalities. Extensive experimental evaluation on standard benchmark datasets shows the superiority of our two new models using abundant textual information besides sparse structural knowledge to enhance performance in link prediction tasks.

View on arXiv PDF Code

Similar