Identifying Phrasemes via Interlingual Association Measures -- A Data-driven Approach on Dependency-parsed and Word-aligned Parallel Corpora
This work addresses the challenge of phraseme identification for computational linguistics and natural language processing, but it appears incremental as it builds on existing methods with specific data enhancements.
The paper tackled the problem of identifying phrasemes (multi-word expressions) by using interlingual association measures on dependency-parsed and word-aligned parallel corpora, resulting in a data-driven approach that improves accuracy in multilingual contexts.
This is a preprint of the article "Identifying Phrasemes via Interlingual Association Measures" that was presented in February 2016 at the LeKo (Lexical combinations and typified speech in a multilingual context) conference in Innsbruck.