CLLGSep 10, 2021

Efficient Contrastive Learning via Novel Data Augmentation and Curriculum Learning

arXiv:2109.05941v2662 citations
Originality Incremental advance
AI Analysis

This work addresses computational efficiency in NLP pretraining, offering incremental improvements for researchers and practitioners in the field.

The paper tackles efficient continual pretraining for contrastive learning by introducing novel data augmentation and curriculum learning, resulting in improved performance on GLUE benchmark tasks, especially sentence-level ones, with a 30% reduction in computational memory usage.

We introduce EfficientCL, a memory-efficient continual pretraining method that applies contrastive learning with novel data augmentation and curriculum learning. For data augmentation, we stack two types of operation sequentially: cutoff and PCA jittering. While pretraining steps proceed, we apply curriculum learning by incrementing the augmentation degree for each difficulty step. After data augmentation is finished, contrastive learning is applied on projected embeddings of original and augmented examples. When finetuned on GLUE benchmark, our model outperforms baseline models, especially for sentence-level tasks. Additionally, this improvement is capable with only 70% of computational memory compared to the baseline model.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes