AI CLApr 7, 2015

Large Margin Nearest Neighbor Embedding for Knowledge Representation

Miao Fan, Qiang Zhou, Thomas Fang Zheng, Ralph Grishman

arXiv:1504.01684v18.69 citations

Originality Incremental advance

AI Analysis

This work addresses the challenge of enabling AI machines to compute and reason with knowledge more efficiently, representing an incremental improvement over existing embedding methods.

The paper tackles the problem of knowledge representation for AI by proposing a model that learns embeddings to satisfy h + r ≈ t, achieving state-of-the-art performance in link prediction and triplet classification on benchmark datasets with fewer computational resources.

Traditional way of storing facts in triplets ({\it head\_entity, relation, tail\_entity}), abbreviated as ({\it h, r, t}), makes the knowledge intuitively displayed and easily acquired by mankind, but hardly computed or even reasoned by AI machines. Inspired by the success in applying {\it Distributed Representations} to AI-related fields, recent studies expect to represent each entity and relation with a unique low-dimensional embedding, which is different from the symbolic and atomic framework of displaying knowledge in triplets. In this way, the knowledge computing and reasoning can be essentially facilitated by means of a simple {\it vector calculation}, i.e. ${\bf h} + {\bf r} \approx {\bf t}$. We thus contribute an effective model to learn better embeddings satisfying the formula by pulling the positive tail entities ${\bf t^{+}}$ to get together and close to {\bf h} + {\bf r} ({\it Nearest Neighbor}), and simultaneously pushing the negatives ${\bf t^{-}}$ away from the positives ${\bf t^{+}}$ via keeping a {\it Large Margin}. We also design a corresponding learning algorithm to efficiently find the optimal solution based on {\it Stochastic Gradient Descent} in iterative fashion. Quantitative experiments illustrate that our approach can achieve the state-of-the-art performance, compared with several latest methods on some benchmark datasets for two classical applications, i.e. {\it Link prediction} and {\it Triplet classification}. Moreover, we analyze the parameter complexities among all the evaluated models, and analytical results indicate that our model needs fewer computational resources on outperforming the other methods.

View on arXiv PDF

Similar