CLAIJun 3, 2021

Self-Guided Contrastive Learning for BERT Sentence Representations

arXiv:2106.07345v1744 citations
Originality Incremental advance
AI Analysis

This work addresses a key challenge in NLP for researchers and practitioners by enhancing sentence representation quality, though it is incremental as it builds on existing contrastive learning and BERT frameworks.

The paper tackles the problem of deriving effective sentence embeddings from pre-trained BERT models by proposing a self-guided contrastive learning method, resulting in improved performance on diverse sentence-related tasks compared to competitive baselines.

Although BERT and its variants have reshaped the NLP landscape, it still remains unclear how best to derive sentence embeddings from such pre-trained Transformers. In this work, we propose a contrastive learning method that utilizes self-guidance for improving the quality of BERT sentence representations. Our method fine-tunes BERT in a self-supervised fashion, does not rely on data augmentation, and enables the usual [CLS] token embeddings to function as sentence vectors. Moreover, we redesign the contrastive learning objective (NT-Xent) and apply it to sentence representation learning. We demonstrate with extensive experiments that our approach is more effective than competitive baselines on diverse sentence-related tasks. We also show it is efficient at inference and robust to domain shifts.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes