CLAIAug 25, 2017

$k$-Nearest Neighbor Augmented Neural Networks for Text Classification

arXiv:1708.07863v117 citations
Originality Incremental advance
AI Analysis

This incremental improvement addresses text classification for researchers and practitioners by boosting performance in data-scarce and unbalanced scenarios.

The paper tackles the problem of neural networks lacking instance-level information in text classification by enhancing them with k-nearest neighbor (kNN) memory, resulting in outperforming baselines and a deep 29-layer model on benchmark datasets, especially with scarce or unbalanced training data.

In recent years, many deep-learning based models are proposed for text classification. This kind of models well fits the training set from the statistical point of view. However, it lacks the capacity of utilizing instance-level information from individual instances in the training set. In this work, we propose to enhance neural network models by allowing them to leverage information from $k$-nearest neighbor (kNN) of the input text. Our model employs a neural network that encodes texts into text embeddings. Moreover, we also utilize $k$-nearest neighbor of the input text as an external memory, and utilize it to capture instance-level information from the training set. The final prediction is made based on features from both the neural network encoder and the kNN memory. Experimental results on several standard benchmark datasets show that our model outperforms the baseline model on all the datasets, and it even beats a very deep neural network model (with 29 layers) in several datasets. Our model also shows superior performance when training instances are scarce, and when the training set is severely unbalanced. Our model also leverages techniques such as semi-supervised training and transfer learning quite well.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes