Unsupervised Technical Domain Terms Extraction using Term Extractor
This work addresses terminology extraction for natural language processing applications, but it appears incremental as it builds on existing unsupervised methods for a specific shared task.
The paper tackled unsupervised extraction of technical domain terms by proposing a method that uses chunking, preprocessing, and ranking with relevance and cohesion functions, achieving results for the ICON 2020 shared task 2.
Terminology extraction, also known as term extraction, is a subtask of information extraction. The goal of terminology extraction is to extract relevant words or phrases from a given corpus automatically. This paper focuses on the unsupervised automated domain term extraction method that considers chunking, preprocessing, and ranking domain-specific terms using relevance and cohesion functions for ICON 2020 shared task 2: TermTraction.