CLMay 16, 2023

ContrastNet: A Contrastive Learning Framework for Few-Shot Text Classification

arXiv:2305.09269v1100 citations
Originality Incremental advance
AI Analysis

This work addresses few-shot text classification, a domain-specific problem for natural language processing, with incremental improvements over existing methods.

The paper tackles the problems of learning discriminative text representations and overfitting in few-shot text classification by proposing ContrastNet, a contrastive learning framework that improves performance, as shown by experiments on 8 datasets where it outperforms state-of-the-art models.

Few-shot text classification has recently been promoted by the meta-learning paradigm which aims to identify target classes with knowledge transferred from source classes with sets of small tasks named episodes. Despite their success, existing works building their meta-learner based on Prototypical Networks are unsatisfactory in learning discriminative text representations between similar classes, which may lead to contradictions during label prediction. In addition, the tasklevel and instance-level overfitting problems in few-shot text classification caused by a few training examples are not sufficiently tackled. In this work, we propose a contrastive learning framework named ContrastNet to tackle both discriminative representation and overfitting problems in few-shot text classification. ContrastNet learns to pull closer text representations belonging to the same class and push away text representations belonging to different classes, while simultaneously introducing unsupervised contrastive regularization at both task-level and instance-level to prevent overfitting. Experiments on 8 few-shot text classification datasets show that ContrastNet outperforms the current state-of-the-art models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes