CLOct 11, 2022

Towards Structure-aware Paraphrase Identification with Phrase Alignment Using Sentence Encoders

arXiv:2210.05302v130.9580 citationsh-index: 27

Originality Incremental advance

AI Analysis

This work addresses the need for more accurate and interpretable paraphrase identification in natural language processing, though it is incremental as it builds on existing sentence encoder methods.

The paper tackles the problem of paraphrase identification by addressing the weak sensitivity of sentence encoders to word order and structural differences, proposing a method that combines sentence encoders with a phrase alignment component to improve performance and interpretability.

Previous works have demonstrated the effectiveness of utilising pre-trained sentence encoders based on their sentence representations for meaning comparison tasks. Though such representations are shown to capture hidden syntax structures, the direct similarity comparison between them exhibits weak sensitivity to word order and structural differences in given sentences. A single similarity score further makes the comparison process hard to interpret. Therefore, we here propose to combine sentence encoders with an alignment component by representing each sentence as a list of predicate-argument spans (where their span representations are derived from sentence encoders), and decomposing the sentence-level meaning comparison into the alignment between their spans for paraphrase identification tasks. Empirical results show that the alignment component brings in both improved performance and interpretability for various sentence encoders. After closer investigation, the proposed approach indicates increased sensitivity to structural difference and enhanced ability to distinguish non-paraphrases with high lexical overlap.

View on arXiv PDF

Similar