CLLGOct 18, 2018

Unsupervised Neural Text Simplification

arXiv:1810.07931v61117 citations
Originality Incremental advance
AI Analysis

This addresses the problem of text simplification for accessibility without requiring large labeled datasets, though it is incremental as it builds on existing neural methods.

The paper tackles unsupervised text simplification using only unlabeled Wikipedia text, achieving competitive performance to supervised methods in lexical and syntactic simplification, with further improvements from adding a few labeled pairs.

The paper presents a first attempt towards unsupervised neural text simplification that relies only on unlabeled text corpora. The core framework is composed of a shared encoder and a pair of attentional-decoders and gains knowledge of simplification through discrimination based-losses and denoising. The framework is trained using unlabeled text collected from en-Wikipedia dump. Our analysis (both quantitative and qualitative involving human evaluators) on a public test data shows that the proposed model can perform text-simplification at both lexical and syntactic levels, competitive to existing supervised methods. Addition of a few labelled pairs also improves the performance further.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes