CLLGAug 16, 2019

Few-shot Text Classification with Distributional Signatures

arXiv:1908.06039v3183 citations
AI Analysis

This work addresses the problem of few-shot learning in NLP for researchers and practitioners, offering a novel approach that improves performance but is incremental in its adaptation of meta-learning techniques.

The paper tackles the challenge of applying meta-learning to few-shot text classification by incorporating distributional signatures to capture word occurrence patterns, achieving an average improvement of 20.0% in 1-shot classification over existing methods.

In this paper, we explore meta-learning for few-shot text classification. Meta-learning has shown strong performance in computer vision, where low-level patterns are transferable across learning tasks. However, directly applying this approach to text is challenging--lexical features highly informative for one task may be insignificant for another. Thus, rather than learning solely from words, our model also leverages their distributional signatures, which encode pertinent word occurrence patterns. Our model is trained within a meta-learning framework to map these signatures into attention scores, which are then used to weight the lexical representations of words. We demonstrate that our model consistently outperforms prototypical networks learned on lexical knowledge (Snell et al., 2017) in both few-shot text classification and relation classification by a significant margin across six benchmark datasets (20.0% on average in 1-shot classification).

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes