LGJul 10, 2017

TAPAS: Two-pass Approximate Adaptive Sampling for Softmax

arXiv:1707.03073v24.35 citations

Originality Incremental advance

AI Analysis

This work addresses efficiency issues in multi-class classification for applications with extensive label sets, representing an incremental improvement over existing sampling techniques.

The paper tackles the computational challenge of training softmax models with very large label spaces by introducing TAPAS, a two-pass adaptive sampling method that reduces computational overhead and effectively minimizes rank loss, as demonstrated on synthetic and large real datasets.

TAPAS is a novel adaptive sampling method for the softmax model. It uses a two pass sampling strategy where the examples used to approximate the gradient of the partition function are first sampled according to a squashed population distribution and then resampled adaptively using the context and current model. We describe an efficient distributed implementation of TAPAS. We show, on both synthetic data and a large real dataset, that TAPAS has low computational overhead and works well for minimizing the rank loss for multi-class classification problems with a very large label space.

View on arXiv PDF

Similar