CL AI LGMar 17, 2024

RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning

Javad Rafiei Asl, Prajwal Panzade, Eduardo Blanco, Daniel Takabi, Zhipeng Cai

arXiv:2403.11082v115.432 citationsh-index: 10Has CodeNAACL-HLT

Originality Incremental advance

AI Analysis

It addresses robustness issues in sentence embeddings for NLP applications, offering incremental improvements over existing methods.

The paper tackles the problem of poor robustness in pre-trained language model-based sentence embeddings under adversarial attacks, introducing RobustSentEmbed which reduces the BERTAttack success rate from 75.51% to 38.81% and improves semantic textual similarity by 1.59%.

Pre-trained language models (PLMs) have consistently demonstrated outstanding performance across a diverse spectrum of natural language processing tasks. Nevertheless, despite their success with unseen data, current PLM-based representations often exhibit poor robustness in adversarial settings. In this paper, we introduce RobustSentEmbed, a self-supervised sentence embedding framework designed to improve both generalization and robustness in diverse text representation tasks and against a diverse set of adversarial attacks. Through the generation of high-risk adversarial perturbations and their utilization in a novel objective function, RobustSentEmbed adeptly learns high-quality and robust sentence embeddings. Our experiments confirm the superiority of RobustSentEmbed over state-of-the-art representations. Specifically, Our framework achieves a significant reduction in the success rate of various adversarial attacks, notably reducing the BERTAttack success rate by almost half (from 75.51\% to 38.81\%). The framework also yields improvements of 1.59\% and 0.23\% in semantic textual similarity tasks and various transfer tasks, respectively.

View on arXiv PDF Code

Similar