CLJul 27, 2017

Determining Semantic Textual Similarity using Natural Deduction Proofs

Hitomi Yanaka, Koji Mineshima, Pascual Martinez-Gomez, Daisuke Bekki

arXiv:1707.08713v139.21092 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of capturing accurate semantics in NLP for tasks like similarity assessment, but it is incremental as it builds on existing logical methods.

The authors tackled the problem of determining semantic textual similarity by combining shallow features with features from natural deduction proofs of bidirectional entailment, resulting in a system that outperformed other logic-based systems and demonstrated the effectiveness of proof-derived features.

Determining semantic textual similarity is a core research subject in natural language processing. Since vector-based models for sentence representation often use shallow information, capturing accurate semantics is difficult. By contrast, logical semantic representations capture deeper levels of sentence semantics, but their symbolic nature does not offer graded notions of textual similarity. We propose a method for determining semantic textual similarity by combining shallow features with features extracted from natural deduction proofs of bidirectional entailment relations between sentence pairs. For the natural deduction proofs, we use ccg2lambda, a higher-order automatic inference system, which converts Combinatory Categorial Grammar (CCG) derivation trees into semantic representations and conducts natural deduction proofs. Experiments show that our system was able to outperform other logic-based systems and that features derived from the proofs are effective for learning textual similarity.

View on arXiv PDF Code

Similar