AICLDLDec 17, 2018

TechKG: A Large-Scale Chinese Technology-Oriented Knowledge Graph

arXiv:1812.06722v13 citations
Originality Synthesis-oriented
AI Analysis

This provides a domain-specific resource for AI applications in Chinese technology contexts, but it is incremental as it adapts existing knowledge graph methods to new data.

The authors tackled the lack of large-scale Chinese technology-oriented knowledge graphs by building TechKG, a dataset with over 260 million triplets from 52 million entities across 38 research domains, extracted from Chinese academic papers using heuristic rules.

Knowledge graph is a kind of valuable knowledge base which would benefit lots of AI-related applications. Up to now, lots of large-scale knowledge graphs have been built. However, most of them are non-Chinese and designed for general purpose. In this work, we introduce TechKG, a large scale Chinese knowledge graph that is technology-oriented. It is built automatically from massive technical papers that are published in Chinese academic journals of different research domains. Some carefully designed heuristic rules are used to extract high quality entities and relations. Totally, it comprises of over 260 million triplets that are built upon more than 52 million entities which come from 38 research domains. Our preliminary ex-periments indicate that TechKG has high adaptability and can be used as a dataset for many diverse AI-related applications. We released TechKG at: http://www.techkg.cn.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes