CLMar 24, 2022

Multitasking Framework for Unsupervised Simple Definition Generation

arXiv:2203.12926v1642 citationsh-index: 55
Originality Incremental advance
AI Analysis

This addresses the lack of learner's dictionaries for supervised training in many languages, though it is incremental as it builds on existing definition generation tasks.

The paper tackled the problem of generating simple definitions for language learners and low-literacy readers by proposing a multitasking framework that uses only standard dictionaries and simple text corpora, achieving a 1.77 SARI score improvement on English data and increasing low-level word usage in Chinese definitions by 3.87%.

The definition generation task can help language learners by providing explanations for unfamiliar words. This task has attracted much attention in recent years. We propose a novel task of Simple Definition Generation (SDG) to help language learners and low literacy readers. A significant challenge of this task is the lack of learner's dictionaries in many languages, and therefore the lack of data for supervised training. We explore this task and propose a multitasking framework SimpDefiner that only requires a standard dictionary with complex definitions and a corpus containing arbitrary simple texts. We disentangle the complexity factors from the text by carefully designing a parameter sharing scheme between two decoders. By jointly training these components, the framework can generate both complex and simple definitions simultaneously. We demonstrate that the framework can generate relevant, simple definitions for the target words through automatic and manual evaluations on English and Chinese datasets. Our method outperforms the baseline model by a 1.77 SARI score on the English dataset, and raises the proportion of the low level (HSK level 1-3) words in Chinese definitions by 3.87%.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes