LGMLFeb 15, 2021

One Line To Rule Them All: Generating LO-Shot Soft-Label Prototypes

arXiv:2102.07834v18 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses computational efficiency for machine learning practitioners by enabling learning with fewer prototypes than classes, though it is incremental in building on existing prototype generation and soft-label concepts.

The paper tackles the problem of high computational costs in machine learning by proposing a method to generate soft-label prototypes that accurately represent training datasets with fewer prototypes than classes, achieving high classification accuracy even on imbalanced and difficult data.

Increasingly large datasets are rapidly driving up the computational costs of machine learning. Prototype generation methods aim to create a small set of synthetic observations that accurately represent a training dataset but greatly reduce the computational cost of learning from it. Assigning soft labels to prototypes can allow increasingly small sets of prototypes to accurately represent the original training dataset. Although foundational work on `less than one'-shot learning has proven the theoretical plausibility of learning with fewer than one observation per class, developing practical algorithms for generating such prototypes remains an unexplored territory. We propose a novel, modular method for generating soft-label prototypical lines that still maintains representational accuracy even when there are fewer prototypes than the number of classes in the data. In addition, we propose the Hierarchical Soft-Label Prototype k-Nearest Neighbor classification algorithm based on these prototypical lines. We show that our method maintains high classification accuracy while greatly reducing the number of prototypes required to represent a dataset, even when working with severely imbalanced and difficult data. Our code is available at https://github.com/ilia10000/SLkNN.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes