CLAINov 14, 2022

Technological taxonomies for hypernym and hyponym retrieval in patent texts

arXiv:2212.06039v2h-index: 20
Originality Incremental advance
AI Analysis

This addresses the need for updateable taxonomies in the constantly evolving field of technological terminology, particularly for patent analysis, though it is incremental as it builds on existing classification and models.

The paper tackled the problem of automatically creating taxonomies for technical terms in patent texts, resulting in a freely available taxonomy with about 170k nodes across 9 branches and a T5 model that generates hypernyms and hyponyms with relatively high precision.

This paper presents an automatic approach to creating taxonomies of technical terms based on the Cooperative Patent Classification (CPC). The resulting taxonomy contains about 170k nodes in 9 separate technological branches and is freely available. We also show that a Text-to-Text Transfer Transformer (T5) model can be fine-tuned to generate hypernyms and hyponyms with relatively high precision, confirming the manually assessed quality of the resource. The T5 model opens the taxonomy to any new technological terms for which a hypernym can be generated, thus making the resource updateable with new terms, an essential feature for the constantly evolving field of technological terminology.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes