CLLGMay 19, 2022

Transformers as Neural Augmentors: Class Conditional Sentence Generation via Variational Bayes

arXiv:2205.09391v12 citationsh-index: 16
Originality Incremental advance
AI Analysis

This work addresses data augmentation challenges for NLP practitioners, offering an incremental improvement in generating diverse, class-conditional sentences to enhance downstream tasks.

The paper tackles the problem of limited diversity and applicability in sentence-level data augmentation for NLP by proposing a neural method combining Conditional Variational Autoencoder and Transformer models, which improves performance over other augmentation techniques with minimal computational cost.

Data augmentation methods for Natural Language Processing tasks are explored in recent years, however they are limited and it is hard to capture the diversity on sentence level. Besides, it is not always possible to perform data augmentation on supervised tasks. To address those problems, we propose a neural data augmentation method, which is a combination of Conditional Variational Autoencoder and encoder-decoder Transformer model. While encoding and decoding the input sentence, our model captures the syntactic and semantic representation of the input language with its class condition. Following the developments in the past years on pre-trained language models, we train and evaluate our models on several benchmarks to strengthen the downstream tasks. We compare our method with 3 different augmentation techniques. The presented results show that, our model increases the performance of current models compared to other data augmentation techniques with a small amount of computation power.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes