LGAICLJun 7, 2022

Confidence-aware Self-Semantic Distillation on Knowledge Graph Embedding

arXiv:2206.02963v32 citationsh-index: 46
Originality Incremental advance
AI Analysis

This work addresses computational efficiency issues in knowledge graph embedding for AI applications, though it is incremental as it builds on existing distillation techniques.

The paper tackles the performance degradation of low-dimensional knowledge graph embeddings by proposing a confidence-aware self-knowledge distillation method that learns from the model's own previous iterations, achieving improved results without needing a pre-trained teacher model.

Knowledge Graph Embedding (KGE), which projects entities and relations into continuous vector spaces, has garnered significant attention. Although high-dimensional KGE methods offer better performance, they come at the expense of significant computation and memory overheads. Decreasing embedding dimensions significantly deteriorates model performance. While several recent efforts utilize knowledge distillation or non-Euclidean representation learning to augment the effectiveness of low-dimensional KGE, they either necessitate a pre-trained high-dimensional teacher model or involve complex non-Euclidean operations, thereby incurring considerable additional computational costs. To address this, this work proposes Confidence-aware Self-Knowledge Distillation (CSD) that learns from the model itself to enhance KGE in a low-dimensional space. Specifically, CSD extracts knowledge from embeddings in previous iterations, which would be utilized to supervise the learning of the model in the next iterations. Moreover, a specific semantic module is developed to filter reliable knowledge by estimating the confidence of previously learned embeddings. This straightforward strategy bypasses the need for time-consuming pre-training of teacher models and can be integrated into various KGE methods to improve their performance. Our comprehensive experiments on six KGE backbones and four datasets underscore the effectiveness of the proposed CSD.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes