CLJan 22, 2021

Unsupervised Technical Domain Terms Extraction using Term Extractor

arXiv:2101.09015v1712 citations
Originality Synthesis-oriented
AI Analysis

This work addresses terminology extraction for natural language processing applications, but it appears incremental as it builds on existing unsupervised methods for a specific shared task.

The paper tackled unsupervised extraction of technical domain terms by proposing a method that uses chunking, preprocessing, and ranking with relevance and cohesion functions, achieving results for the ICON 2020 shared task 2.

Terminology extraction, also known as term extraction, is a subtask of information extraction. The goal of terminology extraction is to extract relevant words or phrases from a given corpus automatically. This paper focuses on the unsupervised automated domain term extraction method that considers chunking, preprocessing, and ranking domain-specific terms using relevance and cohesion functions for ICON 2020 shared task 2: TermTraction.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes