CLAIAug 1, 2021

Improving Social Meaning Detection with Pragmatic Masking and Surrogate Fine-Tuning

arXiv:2108.00356v4639 citations
Originality Incremental advance
AI Analysis

This work addresses social meaning detection in social media for NLP applications, but it is incremental as it builds on existing masked language models.

The paper tackles the mismatch between pre-training and fine-tuning objectives in masked language models for social meaning detection by proposing pragmatic masking and surrogate fine-tuning, achieving a 2.34% F1 improvement over baselines and 68.54% average F1 in few-shot learning.

Masked language models (MLMs) are pre-trained with a denoising objective that is in a mismatch with the objective of downstream fine-tuning. We propose pragmatic masking and surrogate fine-tuning as two complementing strategies that exploit social cues to drive pre-trained representations toward a broad set of concepts useful for a wide class of social meaning tasks. We test our models on $15$ different Twitter datasets for social meaning detection. Our methods achieve $2.34\%$ $F_1$ over a competitive baseline, while outperforming domain-specific language models pre-trained on large datasets. Our methods also excel in few-shot learning: with only $5\%$ of training data (severely few-shot), our methods enable an impressive $68.54\%$ average $F_1$. The methods are also language agnostic, as we show in a zero-shot setting involving six datasets from three different languages.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes