LGAIMay 10, 2023

ACTC: Active Threshold Calibration for Cold-Start Knowledge Graph Completion

arXiv:2305.06395v3222 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of improving prediction quality in knowledge graph completion for applications requiring efficient calibration with minimal labeled data, though it is incremental as it builds on existing calibration methods.

The paper tackles the problem of cold-start calibration for knowledge graph completion, where no annotated examples are initially available, by proposing ACTC to efficiently find per-relation thresholds using limited annotations and unlabeled data, resulting in improvements of up to 7% points with a budget of 10 tuples and an average of 4% points across budgets.

Self-supervised knowledge-graph completion (KGC) relies on estimating a scoring model over (entity, relation, entity)-tuples, for example, by embedding an initial knowledge graph. Prediction quality can be improved by calibrating the scoring model, typically by adjusting the prediction thresholds using manually annotated examples. In this paper, we attempt for the first time cold-start calibration for KGC, where no annotated examples exist initially for calibration, and only a limited number of tuples can be selected for annotation. Our new method ACTC finds good per-relation thresholds efficiently based on a limited set of annotated tuples. Additionally to a few annotated tuples, ACTC also leverages unlabeled tuples by estimating their correctness with Logistic Regression or Gaussian Process classifiers. We also experiment with different methods for selecting candidate tuples for annotation: density-based and random selection. Experiments with five scoring models and an oracle annotator show an improvement of 7% points when using ACTC in the challenging setting with an annotation budget of only 10 tuples, and an average improvement of 4% points over different budgets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes