CLLGApr 2, 2022

Exploiting Local and Global Features in Transformer-based Extreme Multi-label Text Classification

CMU
arXiv:2204.00933v12 citationsh-index: 71
Originality Incremental advance
AI Analysis

This addresses the challenge of accurately tagging documents with many labels for applications like recommendation systems, though it is incremental as it builds on existing Transformer-based approaches.

The paper tackled the problem of extreme multi-label text classification by proposing a model that combines local word-level and global document-level features from Transformers, showing it either outperforms or matches state-of-the-art methods on benchmark datasets.

Extreme multi-label text classification (XMTC) is the task of tagging each document with the relevant labels from a very large space of predefined categories. Recently, large pre-trained Transformer models have made significant performance improvements in XMTC, which typically use the embedding of the special CLS token to represent the entire document semantics as a global feature vector, and match it against candidate labels. However, we argue that such a global feature vector may not be sufficient to represent different granularity levels of semantics in the document, and that complementing it with the local word-level features could bring additional gains. Based on this insight, we propose an approach that combines both the local and global features produced by Transformer models to improve the prediction power of the classifier. Our experiments show that the proposed model either outperforms or is comparable to the state-of-the-art methods on benchmark datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes