Building a Norwegian Lexical Resource for Medical Entity Recognition
This provides a domain-specific tool for medical NLP in Norwegian, but it is incremental as it builds on existing databases and methods.
The authors tackled the lack of a Norwegian lexical resource for medical entity recognition by creating one with over 77,000 unique entries, achieving about 80% correctness in automatic mappings as evaluated by a domain expert.
We present a large Norwegian lexical resource of categorized medical terms. The resource merges information from large medical databases, and contains over 77,000 unique entries, including automatically mapped terms from a Norwegian medical dictionary. We describe the methodology behind this automatic dictionary entry mapping based on keywords and suffixes and further present the results of a manual evaluation performed on a subset by a domain expert. The evaluation indicated that ca. 80% of the mappings were correct.