Shangchun Si

0.2CLDec 5, 2018

An enhanced computational feature selection method for medical synonym identification via bilingualism and multi-corpus training

K. Lei, S. Si, D. Wen et al.

Medical synonym identification has been an important part of medical natural language processing (NLP). However, in the field of Chinese medical synonym identification, there are problems like low precision and low recall rate. To solve the problem, in this paper, we propose a method for identifying Chinese medical synonyms. We first selected 13 features including Chinese and English features. Then we studied the synonym identification results of each feature alone and different combinations of the features. Through the comparison among identification results, we present an optimal combination of features for Chinese medical synonym identification. Experiments show that our selected features have achieved 97.37% precision rate, 96.00% recall rate and 97.33% F1 score.

Shangchun Si

1 Paper