CVLGOct 28, 2019

Neighborhood Watch: Representation Learning with Local-Margin Triplet Loss and Sampling Strategy for K-Nearest-Neighbor Image Classification

arXiv:1911.07940v1
Originality Incremental advance
AI Analysis

This work addresses challenges in tuning and theoretical foundation for triplet networks, particularly useful for small sample data with limited augmentation, though it is incremental.

The paper tackled the problem of deep representation learning for classification by proposing a local-margin triplet loss and sampling strategy, which outperformed end-to-end softmax and typical triplet loss on datasets like MNIST and Cifar-10 without data augmentation.

Deep representation learning using triplet network for classification suffers from a lack of theoretical foundation and difficulty in tuning both the network and classifiers for performance. To address the problem, local-margin triplet loss along with local positive and negative mining strategy is proposed with theory on how the strategy integrate nearest-neighbor hyper-parameter with triplet learning to increase subsequent classification performance. Results in experiments with 2 public datasets, MNIST and Cifar-10, and 2 small medical image datasets demonstrate that proposed strategy outperforms end-to-end softmax and typical triplet loss in settings without data augmentation while maintaining utility of transferable feature for related tasks. The method serves as a good performance baseline where end-to-end methods encounter difficulties such as small sample data with limited allowable data augmentation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes