CLAIApr 16, 2021

A Million Tweets Are Worth a Few Points: Tuning Transformers for Customer Service Tasks

arXiv:2104.07944v1727 citations
Originality Incremental advance
AI Analysis

This addresses a practical challenge for companies in customer service applications, but it is incremental as it builds on existing migration strategies for domain-specific tasks.

The paper tackled the problem of deploying NLP models in multilingual customer service with limited, noisy data by collecting 865k tweets and comparing pretraining and finetuning pipelines across 5 tasks, showing that in-domain pretraining before finetuning boosts performance, particularly for non-English settings.

In online domain-specific customer service applications, many companies struggle to deploy advanced NLP models successfully, due to the limited availability of and noise in their datasets. While prior research demonstrated the potential of migrating large open-domain pretrained models for domain-specific tasks, the appropriate (pre)training strategies have not yet been rigorously evaluated in such social media customer service settings, especially under multilingual conditions. We address this gap by collecting a multilingual social media corpus containing customer service conversations (865k tweets), comparing various pipelines of pretraining and finetuning approaches, applying them on 5 different end tasks. We show that pretraining a generic multilingual transformer model on our in-domain dataset, before finetuning on specific end tasks, consistently boosts performance, especially in non-English settings.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes