LGCLMLOct 29, 2022

CascadeXML: Rethinking Transformers for End-to-end Multi-resolution Training in Extreme Multi-label Classification

Microsoft
arXiv:2211.00640v121 citationsh-index: 15Has Code
Originality Incremental advance
AI Analysis

This work addresses computational and performance trade-offs in XMC for applications requiring classification over millions of labels, representing an incremental improvement over existing transformer-based methods.

The paper tackles the problem of extreme multi-label text classification (XMC) by proposing CascadeXML, an end-to-end multi-resolution learning pipeline that uses transformers to attend to different label resolutions with separate feature representations, achieving significant performance gains on benchmark datasets with up to three million labels.

Extreme Multi-label Text Classification (XMC) involves learning a classifier that can assign an input with a subset of most relevant labels from millions of label choices. Recent approaches, such as XR-Transformer and LightXML, leverage a transformer instance to achieve state-of-the-art performance. However, in this process, these approaches need to make various trade-offs between performance and computational requirements. A major shortcoming, as compared to the Bi-LSTM based AttentionXML, is that they fail to keep separate feature representations for each resolution in a label tree. We thus propose CascadeXML, an end-to-end multi-resolution learning pipeline, which can harness the multi-layered architecture of a transformer model for attending to different label resolutions with separate feature representations. CascadeXML significantly outperforms all existing approaches with non-trivial gains obtained on benchmark datasets consisting of up to three million labels. Code for CascadeXML will be made publicly available at \url{https://github.com/xmc-aalto/cascadexml}.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes