CLAILGSep 23, 2022

Whodunit? Learning to Contrast for Authorship Attribution

arXiv:2209.11887v2303 citationsh-index: 16
Originality Incremental advance
AI Analysis

It addresses the problem of inconsistent performance in authorship attribution for researchers and practitioners, though it is incremental as it builds on existing methods.

The paper tackles authorship attribution by learning author-specific representations through contrastive fine-tuning of pre-trained language models, achieving up to 6.8% improvement over cross-entropy fine-tuning on multiple benchmarks.

Authorship attribution is the task of identifying the author of a given text. The key is finding representations that can differentiate between authors. Existing approaches typically use manually designed features that capture a dataset's content and style, but these approaches are dataset-dependent and yield inconsistent performance across corpora. In this work, we propose \textit{learning} author-specific representations by fine-tuning pre-trained generic language representations with a contrastive objective (Contra-X). We show that Contra-X learns representations that form highly separable clusters for different authors. It advances the state-of-the-art on multiple human and machine authorship attribution benchmarks, enabling improvements of up to 6.8% over cross-entropy fine-tuning. However, we find that Contra-X improves overall accuracy at the cost of sacrificing performance for some authors. Resolving this tension will be an important direction for future work. To the best of our knowledge, we are the first to integrate contrastive learning with pre-trained language model fine-tuning for authorship attribution.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes