IR AIMar 21, 2022

Semantic Similarity Computing for Scientific Academic Conferences fused with domain features

arXiv:2203.12593v13.71 citationsh-index: 13

Originality Incremental advance

AI Analysis

This work addresses the challenge of leveraging semantic information in conference data for researchers and practitioners, but it is incremental as it builds on existing BERT and Siamese network techniques.

The paper tackled the problem of computing semantic similarity for scientific academic conferences by fusing domain features, achieving improved Spearman correlation coefficients on different datasets compared to existing methods.

Aiming at the problem that the current general-purpose semantic text similarity calculation methods are difficult to use the semantic information of scientific academic conference data, a semantic similarity calculation algorithm for scientific academic conferences by fusion with domain features is proposed. First, the domain feature information of the conference is obtained through entity recognition and keyword extraction, and it is input into the BERT network as a feature and the conference information. The structure of the Siamese network is used to solve the anisotropy problem of BERT. The output of the network is pooled and normalized, and finally the cosine similarity is used to calculate the similarity between the two sessions. Experimental results show that the SBFD algorithm has achieved good results on different data sets, and the Spearman correlation coefficient has a certain improvement compared with the comparison algorithm.

View on arXiv PDF

Similar